VMware vSphere Fault Tolerance (FT) is the ultimate virtual machine (VM) protection. The VM will never be down because of an underlying hardware problem. This is not High Availability (HA) where the VM simply restarts on another host within a cluster.

vSphere FT protects the VM all the time, and when there is an underlying host problem, the "shadow" VM (the secondary VM created when you activate FT) replaces the failed primary VM instantly without the users noticing.

However, you can't simply activate FT for every data center VM because there are some constraints, some rules to respect, and finally, some licensing limitations.

How does FT work, and what's the history? ^

Over the years, vSphere FT has evolved. It works by continuously replicating an entire running VM from one physical server to another. The protected VM is the primary VM. Activating FT creates a duplicate VM called a secondary VM, which runs on another host.

FT continuously replicates the primary VM to the secondary VM so the secondary VM can take over at any point, thereby providing fault-tolerant protection. In case of a hardware problem on the host that runs the primary VM, it will start a new secondary VM and reestablish FT redundancy automatically.

Primary and secondary VMs

Primary and secondary VMs

Note: This also works the other way around. In case there is a hardware problem on a machine that hosts a secondary VM, the primary VM detects it and re-creates a new secondary VM on another host within the vSphere cluster.

Use cases for FT ^

  • An application that needs to be bulletproof: Its workload cannot have five minutes of downtime. It's a business-critical enterprise application. Every minute counts.
  • A custom application that cannot do clustering any other way.
  • Temporary protection: You might have a monthly report running one day per month every month, and you want to be absolutely sure your VM won't be down because of a hardware problem. You activate FT for a day for that particular VM.
  • Clustering is too complicated to maintain and configure: There might be cases where your application can be clustered, but it's really problematic and difficult to install and maintain.

Requirements, limits, and licensing ^

  • The physical CPUs need to support vMotion and hardware memory management unit (MMU) virtualization (Intel EPT or AMD RVI). They have to be Intel Sandy Bridge or later, and for AMD, they have to be AMD Bulldozer or later.
  • You'll need 10 Gb networking for FT logging; check whether the network is a low-latency one. VMware recommends dedicated networking.

Limits

  • The number of VMs protected by FT per cluster: by default, this is four (but configurable with the host advanced configuration parameter: maxftvmsperhost).
  • The number of virtual CPUs (vCPUs) configured within VMs protected by FT: by default, this is eight (but configurable with the host advanced configuration parameter: maxftvcpusperhost).

Licensing

The number of vCPUs supported by a single fault-tolerant VM is limited by the level of licensing you have purchased for vSphere.

FT support is as follows:

  • vSphere Standard and Enterprise allow up to two vCPUs
  • vSphere Enterprise Plus allows up to eight vCPUs

VM limitations

The VM needs to meet some requirements as well.

  • There must be no unsupported devices attached to the VM.
  • You must store the VM on shared storage, but not the VM disks (VMDKs).
  • You can't have CD-ROMs, floppies, network interface controller (NIC) passthroughs, hotplug devices, serial or parallel ports, video cards with 3D-enabled features, or 2 TB+ VMDKs.

Best practices for VMware FT ^

You should follow some best practices when thinking of using FT within your virtual infrastructure.

  • You should use approximately the same CPU frequencies on the hosts where the primary and secondary VMs will run. This is because the secondary VM might restart more frequently; configuring the host enforces low-frequency mode to save power. (Check the BIOS setup and configure your host not to use the economy energy mode for the CPU).
  • You should configure your BIOS across all hosts to apply the same instruction set (enabled or disabled). You should also check your motherboard documentation on how to enable or disable instruction sets on your systems.

And you should also use:

  • Processors from the same compatible processor group
  • The same ESXi version
  • Common access to data stores the VMs use
  • The same VM network configuration

Where do you activate VMware Fault Tolerance? ^

First, on each host, create a new VMkernel network adapter and activate Fault Tolerance logging.

Start the vSphere Web Client and go to Host > Manage > Networking. Then go to VMkernel adapters  > Add host networking; select the VMkernel network adapter and select an existing standard switch or a new vSwitch. On the Port properties, enable Fault Tolerance.

Configure the VMkernel network adapter for Fault Tolerance logging

Configure the VMkernel network adapter for Fault Tolerance logging

In addition, you must enable the vMotion VMkernel interface. Also, make sure the host has a license covering Fault Tolerance. Then, at the VM level, you can configure VMware FT.

It's fairly simple. Once you've met the requirements, simply right-click a VM and go to Fault Tolerance > Turn on Fault Tolerance.

Turn On Fault Tolerance

Turn On Fault Tolerance

Final words ^

VMware FT is the ultimate protection for your VMs and your workloads. It does not need any "in-guest" configuration, and you don't need to install any software inside the VM or do any special configuration inside of your VM(s).

Subscribe to 4sysops newsletter!

However, you must follow the host, VM, and cluster requirements to activate FT with success. Once it's active, you don't really have to pay particular attention as long as your cluster and your hosts within your cluster have identical or homogenous configurations.

avataravatar
3 Comments
  1. julian 4 years ago

    hey, fyi, vSphere Enterprise doesnt exist anymore.  only Standard and Enterprise Plus.

  2. Author
    Vladan Seget 4 years ago

    Thanks for the heads up. -:) Sure it does not. 

  3. Nezil Locked 2 years ago

    Hi

    i am facing issue , when i disconnect management network FT is not working. while shutdown the host FT is working .any idea about it??

    VSwitch 1 – management network

    vSwitch 2 – vMotion (10Gig )

    vSwitch 3 -FT(10Gig)

     

Leave a reply

Your email address will not be published.

*

© 4sysops 2006 - 2022

CONTACT US

Please ask IT administration questions in the forums. Any other messages are welcome.

Sending

Log in with your credentials

or    

Forgot your details?

Create Account