On Monday 02 February 2009 15:43:19 Victor Duchovni wrote: > On Mon, Feb 02, 2009 at 01:50:30PM +0100, Gaute Amundsen wrote: > > Jan 25 05:59:19 hotell01 postfix/smtp[595]: fatal: watchdog timeout > > Jan 25 05:59:20 hotell01 postfix/master[734]: warning: process > > /usr/libexec/postfix/smtp pid 595 exit status 1 > > Jan 25 05:59:20 hotell01 postfix/master[734]: warning: > > /usr/libexec/postfix/smtp: bad command startup -- throttling > > This happens when the smtp(8) process has been stuck waiting for something > to happen for 5 hours. What was happening around 00:59:xx on the same day?
Apparently nothing in particular: http://pastebin.ca/1325397 But there _is_ some kind of change.. Nothing unusual in /var/log/messages Hm.. our Munin http://munin.projects.linpro.no/ has lost the fine details that far back but there is a regular high peak on IOstsat just before 01:00 every night. Backup related I guess. both today and Jan 25 was a monday, so I had a look at cron.weekly which runs at sunday at 04:22 but nothing suspicious so far. What would be the usual things smtp(8) waits for? Any suggestions what to do next time I catch the problem live? ps -U postfix u ? How about the DBERROR? Could that be related? I found something about this being related to having several bdb versions. We have these: /lib/libdb-4.1.so /lib/libdb-3.3.so /lib/libdb-4.2.so /lib/libdb.so.3 /lib/libdb2.so.3 > > Also, > > http://www.postfix.org/DEBUG_README.html#mail Ah, thanks :) Gaute btw: # /usr/sbin/postconf -n alias_database = hash:/etc/aliases alias_maps = hash:/etc/aliases command_directory = /usr/sbin config_directory = /etc/postfix daemon_directory = /usr/libexec/postfix debug_peer_level = 2 html_directory = no inet_interfaces = localhost mail_owner = postfix mailq_path = /usr/bin/mailq.postfix manpage_directory = /usr/share/man mydestination = $myhostname, localhost.$mydomain, localhost newaliases_path = /usr/bin/newaliases.postfix queue_directory = /var/spool/postfix readme_directory = /usr/share/doc/postfix-2.2.10/README_FILES sample_directory = /usr/share/doc/postfix-2.2.10/samples sendmail_path = /usr/sbin/sendmail.postfix setgid_group = postdrop unknown_local_recipient_reject_code = 550 -- Programmerer - Pixelhospitalet AS Prinsessealleen 50, 0276 Oslo Tlf. 24 12 97 81 - 9074 7344