I am debugging a problem with one of our systems. Every Sunday, it stops communicating with another server. If we reboot both servers, communication works again.
I was wondering if there are any small footprint apps that monitor TCP port availability and network connectivity, possibly logging any downtime. I'd also like it to be open source if possible, but if there is another solution that is proprietary, I'd like to hear about it also.
I'll buck the trend and give you a scripted solution.
Grab a copy of netcat for Windows and modify the script with the path of netcat (or just dump nc.exe in the %SystemRoot%\system32 directory and change that line to just nc.exe). Also, modify the script to store its log file wherever you want.
There ya' go. You could run that as a "scheduled task".
This isn't fancy at all, but it would work. If you wanted to make it fancier, you could use the date or time to change the log file name such that you get a new log every day, week, etc. You could delete old logs, etc. There's a lot you could do with a simple script like this... heh heh...
Addendum:
Here's the fancier "command line arguments" version. It logs into whatever directory you specify, in the filename "MONTIOR_HOST_YYYY-MM-DD.log". Call with the syntax:
This would be suitable for calling from multiple scheduled tasks, scripts, etc, to monitor multiple servers or multiple ports.
I have too much fun writing scripts...
I can recommend Zenoss. It is quite comprehensive, but easy to set up, free software and can do everything you need.
Of course, you could also just whip up a few Perl/Shell scripts, but why reinvent the wheel?
My current favorite is Zabbix. There is also Nagios and maybe a few others. But if it's just one service I would probably write a perl script to monitor it and log to sqlite.
There is always Nagios.
I've had good luck with What's Up Gold in the past. It's easy to configure and does keep historical information. Unfortunately, it is not open source. I would love to see an open source product with the same features.
I would go to zabbix too (I'm using it and it works very great in a distributed environment) and also splunk which is a wonderful tool.
You have several options, with differents between them:
There are others like Ganglia and Ximon, but are even older than nagios.