Ping a Specific Port

Question

aaronk6

Asked: 2012-09-04 06:20:28 +0800 CST2012-09-04 06:20:28 +0800 CST 2012-09-04 06:20:28 +0800 CST

How to find the reason for a weekly downtime on an Ubuntu web server hosted by AWS?

772

We started monitoring our web server using Pingdom and found out that we have a downtime of a few minutes every Sunday at 0:00 UTC.

The test runs every minute and checks if a successful HTTP response (code 200) is returned on port 80. The test fails due to a timeout (no response after 30 seconds).

Here's what we've already checked – without success:

Since we run our webserver behind a load balancer, I've set the Pingdom test on the load balancer's public DNS and the webserver's public DNS in order to find out if there's a problem with the AWS load balancer – both tests return the same result
We set up Munin on our webserver. Everything looked fine even after the failure. Since the last failure lasted only 2 minutes I suppose Munin couldn't capture a potential problem (it only checks every 5 minutes)
I have checked /var/log/apache2/error.log and /var/log/syslog for suspicious entries
I have checked /etc/cron.weekly and /etc/crontab for suspicious entries
I have searched for files created or last-modified during 0:00 and 0:15 using this method:

touch -t 201209020000 start
touch -t 201209020015 end
find / -newer start -and ! -newer end

(nothing found)

Has anybody experienced a similar problem? Any proposals on how to find the reason for this behavior?

It's Ubuntu 10.04 LTS running on an AWS m1.large instance.

Thanks!

1 Answers

Voted

Doka · Answer 1 · 2012-11-25T23:51:21+08:00

Doka

2012-11-25T23:51:21+08:002012-11-25T23:51:21+08:00

There are some reports out, that the update-apt-xapi process takes lot of cpu usage for couple of minutes. It runs on a weekly schedule. It can take your box down, if the regular load is also high. The command runs update-apt-xapian-index to update the index of software packages.

See few hints for workarounds here: http://empoccz.wordpress.com/2012/01/02/ubuntu-update-apt-xapi-takes-lot-of-cpu-usage-ii/ or https://askubuntu.com/questions/79481/is-100-cpu-usage-harmful-while-update-apt-xapi-runs

1

How to find the reason for a weekly downtime on an Ubuntu web server hosted by AWS?

Can you pass user/pass for HTTP Basic Authentication in URL parameters?

Ping a Specific Port

Check if port is open or closed on a Linux server?

How to automate SSH login with password?

How do I tell Git for Windows where to find my private RSA key?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?