OpenNMS's main dashboard is fantastic, showing all current outages, updated continuously. I'm also using OpenNMS to monitor thresholds - for example, to get notified whenever a disk is more than 90% full. However, I can't find any way to view all outstanding exceeded thresholds the same way I can view outstanding outages. Is this possible?
The thing is, OpenNMS can send lots of notifications (in some circumstances, one outage can generate dozens of notifications) and "threshold exceeded" notifications can get lost in the noise - if I don't catch one, then there's absolutely no indication anywhere in the OpenNMS GUI that something is currently wrong!
Is there some way to set this up? A list of outstanding issues seems like a pretty fundamental feature for an NMS.
EDIT: If it's not possible, what other tools might provide such functionality while also giving the strong graphing features of OpenNMS (I suspect that cacti+nagios will do it, but I'd rather not have to manually configure 2 different monitoring systems for each new computer I want to monitor!)
It seems other people have had the same itch to scratch: OpenNMS Enhancement request, and subsequent blog post about it. I haven't installed this, so can't comment on how easy it is to add to the current 1.8 branch.
The quick solution is to restarting opennms to resend the current thresholds. But the proper approach is to create a "nagging alarm" as part of the thresholds and vacuumd functions.
See: http://www.opennms.org/wiki/Creating_Threshold_Alarms
Look for:
This will give you an indicator in the dashboard as well as a granular solution for creating notification intervals of different lengths.