Ping a Specific Port

Question

Glen Solsberry

Asked: 2010-01-06 05:35:07 +0800 CST2010-01-06 05:35:07 +0800 CST 2010-01-06 05:35:07 +0800 CST

Not all cron jobs in /etc/cron.daily are running

772

I have a Debian GNU/Linux 4.0 box (that cannot be upgraded) running 24x7. It has several jobs in /etc/cron.daily, including our backup scripts. I noticed several weeks ago that the backup script was not running with any regularity.

This morning, I ran the cron directory manually (nice run-parts --report /etc/cron.daily) which is seen in both /etc/anacrontab and /etc/crontab. I got an email for logwatch, but not for any of the other jobs. Our backup scripts, specifically, have a large amount of output, and take a few hours. I have tried rearranging the jobs in /etc/cron.daily, with no effect, and I've recently removed anacron, since this box should "never" experience downtime.

Running any of the jobs individually seems to work fine. I've just added the backup script to /etc/crontab manually to see if it runs properly.

Does anyone have other suggestions?

6 Answers

Voted

Glen Solsberry · Answer 1 · 2010-01-15T09:06:48+08:00

Best Answer

Glen Solsberry

2010-01-15T09:06:48+08:002010-01-15T09:06:48+08:00

The problem turns out to be that Debian does not allow '.' in the filename of a cron job stored in /etc/cron.(d|daily|weekly|monthly). Remove the '.', and the job runs fine.

11

Bart Silverstrim · Answer 2 · 2010-01-06T06:03:59+08:00

Are the logs for cron showing any errors, or that they're running at the specified times?

What happens if you watch the processes run at the times they're supposed to be running (i.e., if they are scheduled to run at 4:00PM what does the system look like in the process list and logs at 4:01?

Silly question, but you said you get emails from logwatch but not for any other jobs. Did you double check that the jobs are actually failing, and not that there's a communication issue with emails notifying you of completed jobs?

Are the jobs running as the proper user context, with permissions to do things needed? These are failing only sometimes but not others?

Can you find anything happening during those times that they fail but not other times (you said this is a no-downtime system...is it doing something where the scripts are overlapping so they can't complete? or there are processes that would block them?)

Is there anything configured on the system that kills processes at a certain load level? Watchdog timers, etc. may kill a process if the load is too great or processor/ram quota goes too high, etc. or the system becomes unresponsive. Another reason to see if someone can keep an eye on it through a ssh session at a point when the server should be running the job.

Richard · Answer 3 · 2010-01-06T06:18:49+08:00

Richard

2010-01-06T06:18:49+08:002010-01-06T06:18:49+08:00

Bart nailed about everything I would look for, except maybe disk space. When the jobs all run together to they run out of space? Is there something else happening at that time that might put a big, temp load on the drive space?

Another thing you might try, if you can, is to run them at a different time. Either all at once, say 5:00, or individually, 4:00 / 4:30 / 5:00 / 5:30 / etc...

2

Christian · Answer 4 · 2010-01-06T08:24:32+08:00

Christian

2010-01-06T08:24:32+08:002010-01-06T08:24:32+08:00

another thing could be the environment. have the scripts already successfully run via cron? the cron environment can be different to your testing environment (PATH, ...). is it possible to add logging to your backup scripts with logger or echo commands?

1

peterpan69007 · Answer 5 · 2010-01-19T09:38:51+08:00

peterpan69007

2010-01-19T09:38:51+08:002010-01-19T09:38:51+08:00

I also had the same problem with '.' in script names. Removing any '.' in the script name resolved my problem (even the extension ".sh"!)

0

dubiousjim · Answer 6 · 2010-02-17T08:51:19+08:00

dubiousjim

2010-02-17T08:51:19+08:002010-02-17T08:51:19+08:00

The --lsbsysinit or --regex options to run-parts allow you to change which filenames are considered valid.

0

Not all cron jobs in /etc/cron.daily are running

Ping a Specific Port

How do I tell Git for Windows where to find my private RSA key?

How do you restart php-fpm?

What's the default superuser username/password for postgres after a new install?

What port does SFTP use?

Resolve host name from IP address

How can I sort du -h output by size

Command line to list users in a Windows Active Directory group?

What is a Pem file and how does it differ from other OpenSSL Generated Key File Formats?

How to determine if a bash variable is empty?