>>>>> "Tore" == Tore Anderson <[EMAIL PROTECTED]> writes:
Tore> * Brian May
>> note: the names of the lock files it complained about were
>> different for each email, including:
>> /var/run/munin/munin-{update,graph,html}.lock.
Tore> Hmm. Is there some stuck Munin process on your system?
Tore> If so, could you try to see what it's doing (strace, ls -l
Tore> /proc/<pid>/fd/)?
Currently there is:
--- cut ---
>ps auwx | grep munin
bam 20901 0.0 0.3 5856 3556 pts/1 S+ May27 0:00 w3m
http://bugs.debian.org/munin
root 6502 0.0 0.5 6724 5420 ? Ss May26 0:01
/usr/sbin/munin-node
bam 10138 0.0 0.0 1556 532 pts/9 S+ 22:19 0:00 grep munin
--- cut ---
I am too sleepy to even consider if this is
right/wrong/normal/abnormal. I suspect it is normal.
The problem may not even have been munin's fault[1]. If you agree,
feel free to close the bug.
In any case, because I don't want you to feel I didn't read your
instructions: ;-)
--- cut ---
>sudo ls -l /proc/6502/fd
Password:
total 6
lr-x------ 1 root root 64 May 30 22:18 0 -> /dev/null
l-wx------ 1 root root 64 May 30 22:18 1 -> /dev/null
l-wx------ 1 root root 64 May 30 22:18 2 -> /var/log/munin/munin-node.log
lr-x------ 1 root root 64 May 30 22:18 3 -> /etc/munin/munin-node.conf
l-wx------ 1 root root 64 May 30 22:18 4 -> /var/log/munin/munin-node.log
lrwx------ 1 root root 64 May 30 22:18 5 -> socket:[12252]
--- cut ---
--- cut ---
>sudo strace -p 6502
Process 6502 attached - interrupt to quit
select(8, [5], NULL, NULL, {4, 544000} <unfinished ...>
Process 6502 detached
--- cut ---
Note:
[1] At the time I also got several other emails, so I suspect the
problem may have been excessive load on my system due to some unknown
problem (DOS attack???). I don't know why the process count got so
high. This probably slowed processes down more then usual, causing
conflicts with lock files.
--- cut ---
From: [EMAIL PROTECTED]
Subject: ** PROBLEM alert - gateway/Total Processes is CRITICAL **
To: [EMAIL PROTECTED]
Date: Thu, 26 May 2005 23:33:30 +1000 (EST)
Return-Path: <[EMAIL PROTECTED]>
X-Original-To: [EMAIL PROTECTED]
Delivered-To: [EMAIL PROTECTED]
Received: from localhost (localhost [127.0.0.1])
by snoopy.microcomaustralia.com.au (Postfix) with ESMTP id CDA62D817E
for <[EMAIL PROTECTED]>; Thu, 26 May 2005 23:33:35 +1000 (EST)
Received: from snoopy.microcomaustralia.com.au ([127.0.0.1])
by localhost (snoopy [127.0.0.1]) (amavisd-new, port 10024) with LMTP
id 14933-04 for <[EMAIL PROTECTED]>;
Thu, 26 May 2005 23:33:30 +1000 (EST)
Received: by snoopy.microcomaustralia.com.au (Postfix, from userid 118)
id 929D8D8179; Thu, 26 May 2005 23:33:30 +1000 (EST)
Message-Id: <[EMAIL PROTECTED]>
X-Virus-Scanned: by amavisd-new-20030616-p10 (Debian) at snoopy.apana.org.au
***** Nagios *****
Notification Type: PROBLEM
Service: Total Processes
Host: gateway
Address: 127.0.0.1
State: CRITICAL
Date/Time: Thu May 26 23:33:30 EST 2005
Additional Info:
PROCS CRITICAL: 353 processes
--- cut ---
From: [EMAIL PROTECTED] (Cron Daemon)
Subject: Cron <[EMAIL PROTECTED]> [ -x /usr/sbin/amavis-stats ] &&
/usr/sbin/amavis-stats -q /var/log/mail.info
To: [EMAIL PROTECTED]
Date: Fri, 27 May 2005 02:09:31 +1000 (EST)
Return-Path: <[EMAIL PROTECTED]>
X-Original-To: [EMAIL PROTECTED]
Delivered-To: [EMAIL PROTECTED]
Received: from localhost (localhost [127.0.0.1])
by snoopy.microcomaustralia.com.au (Postfix) with ESMTP id C5C6FD8A1A
for <[EMAIL PROTECTED]>; Fri, 27 May 2005 02:09:35 +1000 (EST)
Received: from snoopy.microcomaustralia.com.au ([127.0.0.1])
by localhost (snoopy [127.0.0.1]) (amavisd-new, port 10024) with LMTP
id 26964-05-7 for <[EMAIL PROTECTED]>;
Fri, 27 May 2005 02:09:31 +1000 (EST)
Received: by snoopy.microcomaustralia.com.au (Postfix, from userid 117)
id 8E6D8D817E; Fri, 27 May 2005 02:09:31 +1000 (EST)
X-Cron-Env: <SHELL=/bin/sh>
X-Cron-Env: <HOME=/var/lib/amavis-stats>
X-Cron-Env: <PATH=/usr/bin:/bin>
X-Cron-Env: <LOGNAME=amavis-stats>
Message-Id: <[EMAIL PROTECTED]>
X-Virus-Scanned: by amavisd-new-20030616-p10 (Debian) at snoopy.apana.org.au
amavis-stats: error: warning: Could not lock /var/lock/amavis-stats: Resource
temporarily unavailable
--- cut ---
From: [EMAIL PROTECTED]
Subject: ** RECOVERY alert - gateway/Total Processes is OK **
To: [EMAIL PROTECTED]
Date: Fri, 27 May 2005 02:13:30 +1000 (EST)
Return-Path: <[EMAIL PROTECTED]>
X-Original-To: [EMAIL PROTECTED]
Delivered-To: [EMAIL PROTECTED]
Received: from localhost (localhost [127.0.0.1])
by snoopy.microcomaustralia.com.au (Postfix) with ESMTP id C1312D823B
for <[EMAIL PROTECTED]>; Fri, 27 May 2005 02:13:34 +1000 (EST)
Received: from snoopy.microcomaustralia.com.au ([127.0.0.1])
by localhost (snoopy [127.0.0.1]) (amavisd-new, port 10024) with LMTP
id 01608-01-2 for <[EMAIL PROTECTED]>;
Fri, 27 May 2005 02:13:30 +1000 (EST)
Received: by snoopy.microcomaustralia.com.au (Postfix, from userid 118)
id C3CADD817E; Fri, 27 May 2005 02:13:30 +1000 (EST)
Message-Id: <[EMAIL PROTECTED]>
X-Virus-Scanned: by amavisd-new-20030616-p10 (Debian) at snoopy.apana.org.au
***** Nagios *****
Notification Type: RECOVERY
Service: Total Processes
Host: gateway
Address: 127.0.0.1
State: OK
Date/Time: Fri May 27 02:13:30 EST 2005
Additional Info:
PROCS OK: 175 processes
--- cut ---
--
Brian May <[EMAIL PROTECTED]>
--
To UNSUBSCRIBE, email to [EMAIL PROTECTED]
with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]