On Monday 02 February 2009 15:43:19 Victor Duchovni wrote:
> On Mon, Feb 02, 2009 at 01:50:30PM +0100, Gaute Amundsen wrote:
> > Jan 25 05:59:19 hotell01 postfix/smtp[595]: fatal: watchdog timeout
> > Jan 25 05:59:20 hotell01 postfix/master[734]: warning: process
> > /usr/libexec/postfix/smtp pid 595 exit status 1
> > Jan 25 05:59:20 hotell01 postfix/master[734]: warning:
> > /usr/libexec/postfix/smtp: bad command startup -- throttling
>
> This happens when the smtp(8) process has been stuck waiting for something
> to happen for 5 hours. What was happening around 00:59:xx on the same day?

Apparently nothing in particular:
 
http://pastebin.ca/1325397

But there _is_ some kind of change..

Nothing unusual in /var/log/messages

Hm..

our Munin http://munin.projects.linpro.no/
has lost the fine details that far back but there is a regular high peak on 
IOstsat just before 01:00 every night. Backup related I guess.

both today and  Jan 25 was a monday, so I had a look at cron.weekly which runs 
at sunday at 04:22 but nothing suspicious so far.

What would be the usual things smtp(8) waits for?

Any suggestions what to do next time I catch the problem live?
 ps -U postfix u  ?

How about the DBERROR? Could that be related?

I found something about this being related to having several bdb versions.
We have these:
/lib/libdb-4.1.so
/lib/libdb-3.3.so
/lib/libdb-4.2.so
/lib/libdb.so.3
/lib/libdb2.so.3

>
> Also,
>
>       http://www.postfix.org/DEBUG_README.html#mail

Ah, thanks :)

Gaute

btw:

# /usr/sbin/postconf -n
alias_database = hash:/etc/aliases
alias_maps = hash:/etc/aliases
command_directory = /usr/sbin
config_directory = /etc/postfix
daemon_directory = /usr/libexec/postfix
debug_peer_level = 2
html_directory = no
inet_interfaces = localhost
mail_owner = postfix
mailq_path = /usr/bin/mailq.postfix
manpage_directory = /usr/share/man
mydestination = $myhostname, localhost.$mydomain, localhost
newaliases_path = /usr/bin/newaliases.postfix
queue_directory = /var/spool/postfix
readme_directory = /usr/share/doc/postfix-2.2.10/README_FILES
sample_directory = /usr/share/doc/postfix-2.2.10/samples
sendmail_path = /usr/sbin/sendmail.postfix
setgid_group = postdrop
unknown_local_recipient_reject_code = 550


-- 
Programmerer - Pixelhospitalet AS
Prinsessealleen 50, 0276 Oslo 
Tlf. 24 12 97 81 - 9074 7344

Reply via email to