Advanced Storage Issues in ESXi
18. Datastore Becomes Read-Only
Scenario:
- An ESXi host suddenly marks a VMFS datastore as read-only, preventing VMs from writing to the disk.
Possible Causes:
- Storage device path failure.
- VMFS corruption.
- ESXi lost access to the LUN.
Solution Steps:
- Check if the Datastore is Marked as Read-Only bashCopyEdit
vmkfstools -P /vmfs/volumes/DatastoreName
- If output shows “Read-Only”, continue with recovery.
- Rescan the Storage Adapters bashCopyEdit
esxcli storage core adapter rescan --all
- Check for Active Paths bashCopyEdit
esxcli storage core path list | grep "Active"
- If no active paths exist, storage connectivity is lost.
- Try to Remount the Datastore bashCopyEdit
esxcli storage filesystem mount -l DatastoreName
- Check for VMFS Errors and Repair bashCopyEdit
vmkfstools -y /vmfs/devices/disks/naa.6xxxxxxxxx
- Restart Storage Services bashCopyEdit
/etc/init.d/hostd restart /etc/init.d/vpxa restart
- If All Fails, Restore from Backup or Recreate the Datastore bashCopyEdit
esxcli storage filesystem unmount -l DatastoreName esxcli storage vmfs extent delete -l DatastoreName
19. ESXi Host Loses Access to iSCSI Storage (APD/PDL Errors)
Scenario:
- The ESXi host loses connection to iSCSI storage and shows APD (All Paths Down) or PDL (Permanent Device Loss) errors.
Solution Steps:
- Check Storage Paths bashCopyEdit
esxcli storage core path list
- If no active paths, the storage is disconnected.
- Restart the iSCSI Service bashCopyEdit
/etc/init.d/iscsi restart
- Rescan iSCSI Targets bashCopyEdit
esxcli storage core adapter rescan --adapter=vmhbaXX
- Verify Firewall Rules (If iSCSI is Blocked) bashCopyEdit
esxcli network firewall ruleset list
- Reconnect iSCSI Targets Manually bashCopyEdit
esxcli iscsi adapter discovery sendtarget add -a <Storage_IP>
- If PDL Condition Exists, Remove the LUN and Re-add It bashCopyEdit
esxcli storage core device detach -d naa.xxxxxxxx esxcli storage core device set -d naa.xxxxxxxx -O off
20. High Storage Latency Causing VM Performance Issues
Scenario:
- VMs running on an ESXi host experience high I/O latency, affecting performance.
esxtop
shows highDAVG/cmd
values.
Solution Steps:
- Check Storage Performance in
esxtop
bashCopyEditesxtop
- Press
d
for disk statistics. - Check
DAVG/cmd
(Disk Average Latency):- Below 10ms = Good
- 10-20ms = Moderate
- Above 20ms = Bad
- Press
- Check for Storage Congestion bashCopyEdit
esxcli storage core stats get
- Identify High I/O VMs bashCopyEdit
vscsiStats -l
- Reduce Storage Load
- Move VMs to another datastore using Storage vMotion.
- Enable Storage I/O Control (SIOC) to balance I/O.
- Verify Multipathing Policy bashCopyEdit
esxcli storage nmp device list
- Upgrade Storage Firmware and Drivers
- Check if the storage controller is on VMware’s HCL (Hardware Compatibility List).
Advanced Network Issues in ESXi
21. VM Network Latency or Packet Loss
Scenario:
- VMs experience high ping latency or packet drops.
- vMotion and iSCSI traffic is slow.
Solution Steps:
- Check Physical NIC Statistics bashCopyEdit
esxcli network nic stats get -n vmnicX
- Look for high
droppedTx
ordroppedRx
.
- Look for high
- Check for Network Congestion in
esxtop
bashCopyEditesxtop
- Press
n
for network stats.
- Press
- Verify MTU Mismatch (If using Jumbo Frames) bashCopyEdit
esxcli network ip interface list
- Ensure all devices (VMkernel, vSwitch, and physical switch) have the same MTU.
- Ensure vSwitch and Port Group Configurations are Correct bashCopyEdit
esxcli network vswitch standard list esxcli network vswitch standard portgroup list
- Check VLAN Tagging Issues
- Ensure VLAN IDs match between ESXi and the physical switch.
- Enable RSS (Receive Side Scaling) for Better Performance bashCopyEdit
esxcli system settings advanced set -o /Net/RSS -i 1
22. vMotion Fails Due to Network Misconfiguration
Scenario:
- vMotion fails with “Host not reachable” error.
- Migration is slow or gets stuck.
Solution Steps:
- Check if vMotion is Enabled on the VMkernel Port bashCopyEdit
esxcli network ip interface tag list
- Test vMotion Network Connectivity bashCopyEdit
vmkping -I vmk1 <destination_host_IP>
- Verify MTU Configuration bashCopyEdit
esxcli network ip interface list
- Ensure Jumbo Frames (9000 MTU) is configured correctly.
- Check for Firewall Blocking vMotion Traffic bashCopyEdit
esxcli network firewall ruleset list
- Enable vMotion if necessary: bashCopyEdit
esxcli network firewall ruleset set -e true -r vmotion
- Enable vMotion if necessary: bashCopyEdit
- Try vMotion Again Using CLI bashCopyEdit
vim-cmd vmsvc/getallvms vim-cmd vmsvc/migrate <VM_ID> <Target_Host_ID>
23. ESXi Host Loses Network Connectivity After vSwitch Change
Scenario:
- After changing vSwitch or VLAN settings, the ESXi host loses network access.
Solution Steps:
- Check Physical NIC Status bashCopyEdit
esxcli network nic list
- Verify vSwitch Configuration bashCopyEdit
esxcli network vswitch standard list
- Reconfigure the VMkernel Network Manually bashCopyEdit
esxcli network ip interface remove -i vmk0 esxcli network ip interface add -i vmk0 -p "Management Network" -i 192.168.1.10 -n 255.255.255.0 -g 192.168.1.1
- Restart the Network Stack bashCopyEdit
/etc/init.d/networking restart