last friday there was a power outage at one of the sites i run, it was long enough that the UPSs didn't last and much of the equipment crashed. it all came back up ok, but on one system(haven't noticed it on others yet) crontab does not appear to be running the jobs anymore.
I have a cronjob that runs twice a day, everyday and emails me output. it has not done this since friday night. if i run the script manually it works fine, cron is running, running an strace on it, shows activity (doing a bunch of stat calls). the crontab entries have not changed: 0 7,16 * * * /usr/local/bin/sav-update.sh i had 1>/dev/null 2>/dev/null on the first one but took them off to try to get it to generate output to email. it did not. runinng strace on the cron process i do not see it even attempting to run the script. i set it to run 16:41, and watched the clock on the server as well as watched strace and all it did was: mail:~# tail -f debug.log --- SIGCHLD (Child exited) --- wait4(-1, [WIFEXITED(s) && WEXITSTATUS(s) == 0], WNOHANG, NULL) = 16594 wait4(-1, 0xbffff9f8, WNOHANG, NULL) = -1 ECHILD (No child processes) sigreturn() = ? (mask now []) time(NULL) = 1014770402 time(NULL) = 1014770402 rt_sigprocmask(SIG_BLOCK, [CHLD], [], 8) = 0 rt_sigaction(SIGCHLD, NULL, {0x8049d98, [], SA_RESTART|0x4000000}, 8) = 0 rt_sigprocmask(SIG_SETMASK, [], NULL, 8) = 0 nanosleep({59, 0}, {59, 0}) = 0 time(NULL) = 1014770461 stat("crontabs", {st_mode=S_IFDIR|0700, st_size=4096, ...}) = 0 stat("/etc/crontab", {st_mode=S_IFREG|0644, st_size=596, ...}) = 0 stat("/etc/cron.d", {st_mode=S_IFDIR|0755, st_size=4096, ...}) = 0 stat("/etc/cron.d/sendmail", {st_mode=S_IFREG|0644, st_size=2324, ...}) = 0stat("/etc/cron.d/mailman", {st_mode=S_IFREG|0644, st_size=260, ...}) = 0 stat("/etc/cron.d/sysstat", {st_mode=S_IFREG|0644, st_size=236, ...}) = 0 time(NULL) = 1014770461 rt_sigprocmask(SIG_BLOCK, [CHLD], [], 8) = 0 rt_sigaction(SIGCHLD, NULL, {0x8049d98, [], SA_RESTART|0x4000000}, 8) = 0 rt_sigprocmask(SIG_SETMASK, [], NULL, 8) = 0 nanosleep({60, 0}, {60, 0}) = 0 time(NULL) = 1014770521 this is driving me up the wall. i modified the script to create a file in /root when it runs, just incase something else in it is broken, but the file is not created so clearly cron is not working right anymore im gonna try to reinstall cron, does anyone have any ideas what could be going wrong ? the system was up for 331 days before the outage ... nate