My LSI MegaRaid just told me one disk is "UBad" which I assume means it failed:
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp Type
--------------------------------------------------------------------------------
252:7 13 UBad F 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68EUZN0 U -
I have a hot spare installed:
EID:Slt DID State DG Size Intf Med SED PI SeSz Model Sp Type
--------------------------------------------------------------------------------
252:6 14 DHS 0 2.728 TB SATA HDD N N 512B WDC WD30EFRX-68EUZN0 D -
but the status of the hot spare didn't change. Is it being used to save my RAID array?
If not, how do I tell the controller to add the hot spare to the disk group 0?
First, get some information about your controller, volumes and drive:
/c0
is the controller to check. If unsure, try/cALL
and look for a line likeController =
to get the controller numbers.We need EID and Slot/Slt from the output. The
TOPOLOGY
orPD LIST
are good:In order to make the controller to reconsider the drive, set it to
good
:/e252
is the enclosure (EID
in the output) and/s14
is the slot (Slt
in the output).The state of the disk should now be
uGood
:If the controller already knew the disk (it was installed before but for some reason, the controller thought the disk was bad), it may show up as
DHS
(dedicated hot spare).To check that the disk is OK, run a self test:
14
is theDID
(Disk ID) from thePD LIST
(storcli /c0 show all
). That's the disk that you want./dev/sdb
is the Linux device which is attached to the megaraid driver.-t long
starts a long self test.To find the Linux device, use
lsscsi
:To check the state of the self test, use
smartctl -d megaraid,14 /dev/sdb -c
orsmartctl -d megaraid,14 /dev/sdb -a