I'm monitoring my IBM ServeRAID M5015 controller for RAID status with MegaCLI, I have this on one of the disk :
Enclosure Device ID: 252
Slot Number: 6
Enclosure position: 0
Device Id: 14
Sequence Number: 2
Media Error Count: 32
Other Error Count: 0
Predictive Failure Count: 18
Last Predictive Failure Event Seq Number: 8119
PD Type: SAS
Raw Size: 279.396 GB [0x22ecb25c Sectors]
Non Coerced Size: 278.896 GB [0x22dcb25c Sectors]
Coerced Size: 278.464 GB [0x22cee000 Sectors]
Firmware state: Online, Spun Up
SAS Address(0): 0x5000c50042c319c9
SAS Address(1): 0x0
Connected Port Number: 5(path0)
Inquiry Data: IBM-ESXSST9300653SS B6336XN04HC10525B633
IBM FRU/CRU: 81Y9671
FDE Capable: Not Capable
FDE Enable: Disable
Secured: Unsecured
Locked: Unlocked
Needs EKM Attention: No
Foreign State: None
Device Speed: 6.0Gb/s
Link Speed: 6.0Gb/s
Media Type: Hard Disk Device
Drive: Not Certified
Drive Temperature :33 Celsius
What does this mean exactly ? I can't find an exact description, is there a way to have more details ? The RAID array has the Optimal state.
Media Error Count: 32
Predictive Failure Count: 18
Is there a way through the CLI to power-on the front LED so I physically know which disk I need to replace ?
There are errors on your disk. S.M.A.R.T. stands for Self-Monitoring, Analysis and Reporting Technology
The specific errors you mention correlate to mechanical degradation of the drive. You can possibly use this report to obtain a warranty replacement fomr IBM. The drive WILL eventually fail.
From a Seagate doc:
Here's out to locate the faulty disk:
The drive is physically failing at this point. The most important thing to worry about right now is having a good backup of your data, and a plan to get that drive replaced ASAP.