I've been using Nagios for a while now and recently started using Cacti after being dissatisfied with the lack of scaling and lack of any GUI in MRTG. I'm interested in adding trending to my Nagios installation and wondered what was the best route to go. I've looked around a bit and have seen what's available, but there's not a lot of information around to differentiate them from each other. My Nagios install has about 250 hosts and 1100 service checks, but many of them are just simple network devices and there's only about 20 servers and 300 services associated with them. All servers but 2 are running Windows Server 2003. What are the main highlights of PNP4Nagios vs. nagiosgraph, or would I be better off using some sort of tool to convert the data to RRD form and just view it directly in Cacti? Is there a completely different direction I could go that would be even better? Please comment if you need any more information, I tend to be too wordy and tried to keep this question brief.
Thanks!
I ended up going with pnp4nagios. Installation was a breeze and there was no additional configuration needed to get it automatically graphing trends for every service (that produces graphable data) on every host. The entire process took maybe 30 minutes.
did you try installing nagiosgraph? any feedback you have about the process would be appreciated.
nagiosgraph has seen a lot of updates in the past couple of months, but there are always ways to improve. some things added in nagiosgraph 1.4.x include:
I highly recommend Munin if you want a standalone tool that's easy to set up and extend; you get a good amount of trend graphs enabled out of the box and there are plugins for pretty much anything else you might need. Munin is RRD based but stands alone from Nagios; installation is pretty much analagous to Nagios, you install a central server and a daemon on each node you want to monitor (via package install on most Linux distros) and then optionally do a small amount of configuration on the nodes if you want to add non-default data.
I tried Zenoss and to start with it looks quite impressive, but unfortunately that's where it ends. Underneath it all you're left with a system that isn't very useful, the reports aren't useful at all and when you start adding more than about 50 devices it starts slowing down to a point where it's almost un-usable.
*disclaimer, I'm the Zenoss Community Manager
Zenoss does exactly what you are looking for. It is an Open Source network monitoring platform that does availability monitoring (like Nagios) and performance monitoring (like Cacti). It's built to easily scale to thousands of devices and does Windows machines via WMI and SNMP. There are no configuration files and you get your monitoring, graphing and it's easy to extend. You can also re-use any custom Nagios and Cacti plugins you have and there are no agents to install on the nodes.
http://community.zenoss.org