We use nagios to monitor our server farm, and generally it works great. From time to time, though, the host where nagios runs loses connectivity for a couple of minutes, which makes nagios believe that all servers and services it monitors are down. The result is hundreds of alert mails, shortly followed by hundreds of recovery mails.
Is there any way to configure nagios in such a way that it tests its own connectivity before releasing an avalanche of alert mails?
Yes, you can set parents and childs. If a parent is down, no notification about the child is given. You do need to set the timings properly though (in generic_service and generic_host or whatever templates you use), because when the services are no longer available, it needs to have decideded the parent is down before it would send notifications out for those services.
What I did, is this: