Server is a VMware ESXi 5.5 host. Uptime on it is 65 days. LCD on the server is showing an error: Persistent correctable memory error rate has increased for a memory device at location DIMM B4
I'd like to look in DRAC to see what's going on, but cannot access it. The last time I needed DRAC (a few months ago, can't really recall) I was able to access it via the web portal. However, now it is not responding to PING, not even when I just try to ping the IP (we also have a DNS A record for it). I am in the same office as the server, on the same LAN.
Is there anything I can do while the server is up, or do I just have to try and reboot it during the next maintenance window? This is a ESXi host running several VM's. I cannot try anything risky during normal business hours. Thanks.
On the LCD interface, you should be able to navigate to view the iDRAC IP address. Verify it is still valid. If so, the firmware could be frozen.
As for your memory error rate, this may be a bad RAM chip in slot B4. If not corrected, your RAM amount may go down or the server may crash (purple screen of death). Do you currently have a support contract with Dell? If so, log a case with them and they should overnight a new RAM module. If not, you'll need to consider replacing it soon.
If your host is in a cluster, I would advise trying to vMotion your VMs and put this server in maintenance mode until it could be addressed.
Got it. The issue was NIC0 was being used for vMotion so was on the iSCSI switch, not the LAN switch. When I changed the patch cable on NIC0 over to the LAN switch I started getting a PING response on the IP and I was able to open the iDRAC web console.