Ping a Specific Port

Question

ZaphodB

Asked: 2012-02-21 07:03:27 +0800 CST2012-02-21 07:03:27 +0800 CST 2012-02-21 07:03:27 +0800 CST

nagios check_crash || how to detect when a server has crashed and rebooted?

772

Thanks to the Intel TCO watchdog some servers i manage now reboot on a kernel or hardware crash and init scripts are now even 'rebootsafe'. Sadly this means that i no longer get a notification from nagios when a machine has crashed because the service is simply back up before the checks fire for enough times to send a notification.

Is there a reliable script or nagios check out there that will let me get notified if say the machine has crashed say 3 times during the last 48 hour period?

2 Answers

Voted

Michael Lowman · Answer 1 · 2012-02-21T07:16:20+08:00

Best Answer

Michael Lowman

2012-02-21T07:16:20+08:002012-02-21T07:16:20+08:00

How about you write one? An easy way would be to run uptime in the script. A slightly better way would be to add an initscript that echos the time to a rotating logfile. Grab the last three entries in the file, and check the elapsed time since the first.

1

Keith · Answer 2 · 2012-02-21T11:24:11+08:00

Keith

2012-02-21T11:24:11+08:002012-02-21T11:24:11+08:00

There are a number of "check_uptime" variants on Nagios Exchange. These allow you to catch quick reboots, without setting max_check_attempts to 1 or 2 for the host check (therefore preventing false positives).

This one, for example, can be run via NRPE (uses uptime), but can also check via SNMP (Linux, Windows, etc.).

1

nagios check_crash || how to detect when a server has crashed and rebooted?

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?