I'm trying to monitor an HP Smart Array controller using nagios. I have 4 disks on 2 RAID1 arrays, so 2 disks per RAID1.
I installed nrpe-server and nrpe-plugins and I'm using the almighty check_raid
plugin, but it seems to only be seeing my first array and not the second:
# /usr/lib/nagios/plugins/check_raid
OK: cciss:[/dev/cciss/c0d0: (Smart Array E200i) RAID 1 Volume 0 status: OK, /dev/cciss/c0d0: (Smart Array E200i) RAID 1 Volume 1 status: OK]
cciss_vol_status
reports correctly for both arrays:
# cciss_vol_status /dev/cciss/c0d0
/dev/cciss/c0d0: (Smart Array E200i) RAID 1 Volume 0 status: OK.
/dev/cciss/c0d0: (Smart Array E200i) RAID 1 Volume 1 status: OK.
# cciss_vol_status /dev/cciss/c0d1
/dev/cciss/c0d1: (Smart Array E200i) RAID 1 Volume 0 status: OK.
/dev/cciss/c0d1: (Smart Array E200i) RAID 1 Volume 1 status: OK.
How do I get check_raid
to report on both arrays?
The server is an HP Proliant DL360 G5 with a Smart Array E200i controller. It's currently running debian wheezy.
My recommendation would be to use the HP Management and Health Agents for the server. Those can email, dump to syslog and send SNMP traps.
If you absolutely must have this reported in Nagios, it seems like there are plenty of options. But a component of the HP set of tools is the
hpssacli
orhpacucli
utility, whose output could be parsed:or
or maybe: