11. vMotion Fails – VM Stuck at 14% or 90%
Scenario:
When trying to migrate a VM using vMotion, the operation gets stuck at 14% or 90% and fails.
Possible Causes:
- Insufficient network bandwidth on the vMotion network.
- VM has an active CD/DVD drive connected to a local datastore.
- High CPU/RAM utilization causing resource contention.
- vMotion network is misconfigured.
Solution Steps:
- Check vMotion Network Configuration
- Ensure the vMotion NIC is configured correctly: bashCopyEdit
esxcli network ip interface list
- Verify vMotion network reachability using: bashCopyEdit
vmkping <target_host_IP>
- Ensure the vMotion NIC is configured correctly: bashCopyEdit
- Check for Open VM Hardware (CD/DVD, ISOs, Floppy)
- If a VM has an attached CD/DVD, disconnect it.
- Right-click VM → Edit Settings → Uncheck “Connected” for CD/DVD.
- If a VM has an attached CD/DVD, disconnect it.
- Check Storage and CPU Load
- If the VM is using a high CPU load, try again when usage is lower.
- Use “Force vMotion” to Bypass CPU Compatibility Check bashCopyEdit
vim-cmd vmsvc/getallvms vim-cmd vmsvc/migrate <VM_ID> <Target_ESXi_ID>
- Reconfigure vMotion if Needed
- Disable and re-enable vMotion on both hosts.
- Restart ESXi management services: bashCopyEdit
/etc/init.d/hostd restart /etc/init.d/vpxa restart
12. ESXi Host Fails to Enter Maintenance Mode
Scenario:
When trying to put an ESXi host into Maintenance Mode, the process gets stuck or fails.
Possible Causes:
- A VM is still running on the host.
- vSphere HA is preventing the action.
- Storage or network issues causing lockups.
Solution Steps:
- Manually Migrate or Power Off VMs
- Use vMotion to move running VMs to another host.
- If vMotion is unavailable, power off VMs.
- Check HA Admission Control Settings
- Navigate to Cluster → Configure → vSphere HA.
- Disable Admission Control and retry.
- Force Maintenance Mode from CLI bashCopyEdit
vim-cmd hostsvc/maintenance_mode_enter
- Restart ESXi Services and Retry bashCopyEdit
/etc/init.d/hostd restart /etc/init.d/vpxa restart
- Check for Storage Connectivity Issues
- If the host is waiting for storage access, check if a datastore is unresponsive.
13. VM Network Disconnected After ESXi Host Reboot
Scenario:
After rebooting an ESXi host, some VMs lose network connectivity.
Possible Causes:
- vSwitch or port group misconfiguration.
- Physical NIC failed or disconnected.
- VLAN configuration mismatch.
Solution Steps:
- Check if vSwitch and Port Groups Exist bashCopyEdit
esxcli network vswitch standard list esxcli network vswitch standard portgroup list
- Verify Physical NIC Status bashCopyEdit
esxcli network nic list
- Ensure VLAN Tagging is Correct
- Check if VLAN ID on the port group matches switch configuration.
- Restart Network Services bashCopyEdit
/etc/init.d/networking restart
- Reconnect VM to the Correct Port Group
- Right-click VM → Edit Settings → Network Adapter → Reassign Port Group.
14. ESXi Host Logs Show “Lost Connectivity to Storage Device”
Scenario:
The ESXi host loses connectivity to a shared iSCSI/FC storage device.
Possible Causes:
- Network failure (for iSCSI storage).
- Storage controller failure or LUN not assigned.
- HBA/Multipathing issue.
Solution Steps:
- Check Storage Connectivity bashCopyEdit
esxcli storage core path list
- Rescan All Storage Adapters bashCopyEdit
esxcli storage core adapter rescan --all
- Check Multipathing and HBA Configuration bashCopyEdit
esxcli storage nmp device list
- Restart Storage Services bashCopyEdit
/etc/init.d/lwiod restart /etc/init.d/hostd restart /etc/init.d/vpxa restart
- Verify Storage Visibility on SAN/NAS Side
- Check SAN/NAS logs to see if the ESXi host is recognized.
15. vCenter Appliance (VCSA) is Not Accessible
Scenario:
The vCenter Server Appliance (VCSA) is unreachable via web or SSH.
Possible Causes:
- vCenter service failure.
- Storage or disk space issues.
- Network misconfiguration.
Solution Steps:
- Check if VCSA is Running on ESXi bashCopyEdit
vim-cmd vmsvc/getallvms vim-cmd vmsvc/power.getstate <VCSA_ID>
- Access VCSA Directly via Console
- Open DCUI (Direct Console UI) → Check Network & Services.
- Restart vCenter Services bashCopyEdit
service-control --stop --all service-control --start --all
- Check Storage and Disk Space bashCopyEdit
df -h
- Verify vCenter Certificate Expiry bashCopyEdit
/usr/lib/vmware-vmca/bin/certificate-manager
16. Unable to Delete a VM – “File is Locked” Error
Scenario:
When trying to delete a VM, you get “Cannot delete file. File is locked.”
Possible Causes:
- VM files are in use by another process.
- ESXi host lock preventing deletion.
Solution Steps:
- Check Which Host Owns the Lock bashCopyEdit
vmkfstools -D /vmfs/volumes/datastore/VM_Name/VM.vmdk
- Power Off the VM and Retry bashCopyEdit
vim-cmd vmsvc/power.off <VM_ID>
- Manually Remove Locked Files bashCopyEdit
rm -rf /vmfs/volumes/datastore/VM_Name
- Restart ESXi Host (If Needed)
- If the lock persists, restart the host holding the lock.
17. VM is Running but Not Responding
Scenario:
A VM is powered on but does not respond to ping, RDP, or SSH.
Possible Causes:
- Guest OS crash or freeze.
- Network adapter disabled or misconfigured.
Solution Steps:
- Check VM’s Console for Any Errors
- Open vSphere Client → VM Console → Look for OS errors.
- Verify VM Network Adapter Status bashCopyEdit
esxcli network vm list esxcli network vm port list -w <VM_WORLD_ID>
- Check for Disk or CPU Contention Issues
- Run
esxtop
and checkCPU %RDY
andDAVG/cmd
(Disk Latency).
- Run
- Force Reboot the VM bashCopyEdit
vim-cmd vmsvc/power.reset <VM_ID>