This is the same issue that I've been hitting, and that requires the serial console / DDB stuff described in the debugging deadlocks web page that I pointed you at ...

So far *knock on wood* since adding all of the debugging to one of my server, none of mine have done it ... but the more ppl experiencing this, and getting the debugging in place to provide proper kernel traces, the better ...

On Sat, 1 Jul 2006, Francisco Reyes wrote:

I believe this may be related to the NFS issues mentioned recent, but hopefully I may have captured enough info to help others troubleshoot.. I got the header of some ps commands.. and when was about to do full listing of the same ps commands to files.. the machine hung up.

The machine is 6.1 Stable around 6-25 ( plus or minus 1 day).


iostat 5 (not much of a load)
    tty             da0             cpu
tin tout  KB/t tps  MB/s  us ni sy in id
 0   31 17.71 125  2.17  20  0  5  1 74
 0   26  8.57  23  0.19   0  0  1  0 99
 0    9 33.73  10  0.34   0  0  0  0 99
 0   21  8.42  18  0.15   0  0  1  1 99
 0    9 15.92  58  0.90   0  0  0  0 99
 0    9 15.18   7  0.10   0  0  0  0 99
 0   53 12.93   9  0.11   0  0  1  0 99
 0   31  5.17  58  0.29   0  0  1  1 99

vmstat 5 (very high 'b' column)
procs      memory      page                   disk   faults      cpu
r b w     avm    fre  flt  re  pi  po  fr  sr da0   in   sy  cs us sy id
0 248 2 1410436 110728 1519   2   0   0 1644 264   0 4481 8862 9168 20  6 74
0 248 0 1410436 110796    0   0   0   0  13   0   4  700   40 1426  0  1 99
0 248 0 1410436 110764    1   0   0   0  39   0  14 1253  722 2615  0  1 99
0 248 0 1410436 110720    1   0   0   0  10   0   5  407  396 899  0  1 99
0 248 0 1410436 110704    1   0   0   0  60   0  21 2822  360 5695  0  2 98
0 248 0 1410436 110684    1   0   0   0  10   0   7  538  434 1166  0  1 99
0 248 0 1410436 110668    0   0   0   0  75   0  51  576  163 1026  0  0 99
0 248 0 1410436 110696    0   0   0   0  23   0  31 1171  190 2271  0  1 99

vmstat 5
procs      memory      page                   disk   faults      cpu
r b w     avm    fre  flt  re  pi  po  fr  sr da0   in   sy  cs us sy id
0 250 1 1399688 152000 1517   2   0   0 1643 264   0 4479 8853 9163 20  6 74
0 250 0 1399688 151968    2   0   0   0  25   0  28 1395  966 2852  0  2 98
0 250 0 1399692 151892    1   0   0   0  12   0   6  446  540 986  0  0 99
0 250 2 1399692 151604    1   0   0   0  50   0  37  803  675 1611  0  1 99


Don't recall which ps..
411     1       0 ufs     ??  Ds     0:04.81 /usr/sbin/mountd -r
37675 650 0 ufs ?? D 0:00.46 /usr/bin/perl /data/backaway/mailarchive/client/bin/smtpproxy 127.0.0.1:10026 127.0.0.1:10025 (perl5.8.7) 37919 650 0 ufs ?? D 0:00.46 /usr/bin/perl /data/backaway/mailarchive/client/bin/smtpproxy 127.0.0.1:10026 127.0.0.1:10025 (perl5.8.7) 39306 650 0 ufs ?? D 0:00.39 /usr/bin/perl /data/backaway/mailarchive/client/bin/smtpproxy 127.0.0.1:10026 127.0.0.1:10025 (perl5.8.7) 40214 38649 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40220 32943 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40223 33257 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40226 32942 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40228 33199 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40231 38599 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40233 32896 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40236 33224 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40238 32876 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40240 32976 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40242 35580 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40246 35593 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40248 32923 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40252 35596 4100 ufs ?? Ds 0:00.01 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40253 29833 4100 ufs ?? Ds 0:00.01 /usr/local/bin/maildrop -d [EMAIL PROTECTED]



ps ax -O ppid,flags,mwchan | awk '$6 ~ /^D/ || $6 == "STAT"'
PID  PPID       F MWCHAN  TT  STAT      TIME COMMAND
  2     0     204 -       ??  DL     0:17.68 [g_event]
  3     0     204 -       ??  DL     9:14.85 [g_up]
  4     0     204 -       ??  DL    10:50.81 [g_down]
  5     0     204 -       ??  DL     0:02.93 [thread taskq]
  6     0     204 -       ??  DL     0:00.00 [acpi_task0]
  7     0     204 -       ??  DL     0:00.00 [acpi_task1]
  8     0     204 -       ??  DL     0:00.00 [acpi_task2]
  9     0     204 -       ??  DL     0:00.00 [kqueue taskq]
 15     0     204 -       ??  DL     8:47.55 [yarrow]
 27     0     204 -       ??  DL     0:01.72 [fdc0]
 28     0     204 psleep  ??  DL     0:43.74 [pagedaemon]
 29     0     204 psleep  ??  DL     0:00.00 [vmdaemon]
 30     0     20c pgzero  ??  DL     7:35.27 [pagezero]
 31     0     204 psleep  ??  DL     0:57.11 [bufdaemon]
 32     0     204 syncer  ??  DL     8:46.07 [syncer]
 33     0     204 vlruwt  ??  DL     0:28.29 [vnlru]
 34     0     204 sdflus  ??  DL     2:35.54 [softdepflush]
 35     0     204 -       ??  DL     1:01.20 [schedcpu]
411     1       0 ufs     ??  Ds     0:04.81 /usr/sbin/mountd -r
39306 650 0 ufs ?? D 0:00.39 /usr/bin/perl /data/backaway/mailarchive/client/bin/smtpproxy 127.0.0.1:10026 127.0.0.1:10025 (perl5.8.7) 40214 38649 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40220 32943 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40223 33257 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40226 32942 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40228 33199 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40231 38599 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40233 32896 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40236 33224 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40238 32876 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40240 32976 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40242 35580 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40246 35593 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED] 40248 32923 4100 ufs ?? Ds 0:00.00 /usr/local/bin/maildrop -d [EMAIL PROTECTED]


ps axlww
UID   PID  PPID CPU PRI NI   VSZ   RSS MWCHAN STAT  TT       TIME COMMAND
  0     0     0   0  12  0     0     0 -      WLs   ??    0:00.00 [swapper]
0 1 0 0 8 0 744 268 wait ILs ?? 0:00.01 /sbin/init --
  0     2     0   0  -8  0     0     8 -      DL    ??    0:17.68 [g_event]
  0     3     0   0  -8  0     0     8 -      DL    ??    9:14.93 [g_up]
  0     4     0   0  -8  0     0     8 -      DL    ??   10:50.90 [g_down]
0 5 0 0 8 0 0 8 - DL ?? 0:02.93 [thread taskq] 0 6 0 0 8 0 0 8 - DL ?? 0:00.00 [acpi_task0] 0 7 0 0 8 0 0 8 - DL ?? 0:00.00 [acpi_task1] 0 8 0 0 8 0 0 8 - DL ?? 0:00.00 [acpi_task2] 0 9 0 0 8 0 0 8 - DL ?? 0:00.00 [kqueue taskq] 0 10 0 153 171 0 0 8 - RL ?? 3939:36.70 [idle: cpu1] 0 11 0 148 171 0 0 8 - RL ?? 4416:30.08 [idle: cpu0] 0 12 0 2 -44 0 0 8 - WL ?? 50:41.76 [swi1: net] 0 13 0 0 -32 0 0 8 - WL ?? 8:56.20 [swi4: clock sio]
  0    14     0   0 -36  0     0     8 -      WL    ??    0:00.00 [swi3: vm]
  0    15     0   0  96  0     0     8 -      DL    ??    8:47.68 [yarrow]
UID   PID  PPID CPU PRI NI   VSZ   RSS MWCHAN STAT  TT       TIME COMMAND
  0     0     0   0  12  0     0     0 -      WLs   ??    0:00.00 [swapper]
0 1 0 0 8 0 744 268 wait ILs ?? 0:00.01 /sbin/init --
  0     2     0   0  -8  0     0     8 -      DL    ??    0:17.68 [g_event]
  0     3     0   0  -8  0     0     8 -      DL    ??    9:14.93 [g_up]
  0     4     0   0  -8  0     0     8 -      DL    ??   10:50.90 [g_down]
0 5 0 0 8 0 0 8 - DL ?? 0:02.93 [thread taskq] 0 6 0 0 8 0 0 8 - DL ?? 0:00.00 [acpi_task0] 0 7 0 0 8 0 0 8 - DL ?? 0:00.00 [acpi_task1] 0 8 0 0 8 0 0 8 - DL ?? 0:00.00 [acpi_task2] 0 9 0 0 8 0 0 8 - DL ?? 0:00.00 [kqueue taskq] 0 10 0 153 171 0 0 8 - RL ?? 3939:36.70 [idle: cpu1] 0 11 0 148 171 0 0 8 - RL ?? 4416:30.08 [idle: cpu0] 0 12 0 2 -44 0 0 8 - WL ?? 50:41.76 [swi1: net] 0 13 0 0 -32 0 0 8 - WL ?? 8:56.20 [swi4: clock sio]
  0    14     0   0 -36  0     0     8 -      WL    ??    0:00.00 [swi3: vm]
  0    15     0   0  96  0     0     8 -      DL    ??    8:47.68 [yarrow]
0 16 0 0 -24 0 0 8 - WL ?? 0:00.01 [swi6: task queue]
  0    17     0   0 -24  0     0     8 -      WL    ??    0:00.00 [swi6: +]
  0    18     0   0 -28  0     0     8 -      WL    ??    6:34.50 [swi5: +]
0 19 0 0 -40 0 0 8 - WL ?? 6:42.62 [swi2: cambio] 0 20 0 0 -52 0 0 8 - WL ?? 0:00.00 [irq9: acpi0] 0 21 0 0 -64 0 0 8 - WL ?? 0:00.00 [irq14: ata0] 0 22 0 0 -64 0 0 8 - WL ?? 0:00.00 [irq15: ata1] 0 23 0 0 -68 0 0 8 - WL ?? 8:13.27 [irq26: bge0] 0 24 0 0 -68 0 0 8 - WL ?? 50:29.26 [irq27: bge1] 0 25 0 0 -60 0 0 8 - WL ?? 0:00.01 [irq1: atkbd0] 0 26 0 0 -48 0 0 8 - WL ?? 0:00.00 [swi0: sio]
  0    27     0   0  -8  0     0     8 -      DL    ??    0:01.72 [fdc0]
0 28 0 0 -16 0 0 8 psleep DL ?? 0:43.74 [pagedaemon]
  0    29     0   0  20  0     0     8 psleep DL    ??    0:00.00 [vmdaemon]
  0    30     0   0 171  0     0     8 pgzero DL    ??    7:35.27 [pagezero]
0 31 0 0 -16 0 0 8 psleep DL ?? 0:57.11 [bufdaemon]
  0    32     0   0  20  0     0     8 syncer DL    ??    8:46.35 [syncer]
  0    33     0   0  -4  0     0     8 vlruwt DL    ??    0:28.29 [vnlru]
0 34 0 0 -16 0 0 8 sdflus DL ?? 2:35.54 [softdepflush]
  0    35     0   0 -40  0     0     8 -      DL    ??    1:01.29 [schedcpu]
0 116 1 255 20 0 1220 648 pause Is ?? 0:00.00 adjkerntz -i
  0   295     1   0   4  0   516   276 select Is    ??    0:05.71 /sbin/devd
0 337 1 0 96 0 1344 908 select Ss ?? 5:54.01 /usr/sbin/syslogd -s 0 354 1 0 96 0 1412 1032 select Ss ?? 0:07.06 /usr/sbin/rpcbind 0 411 1 0 -4 0 1536 1128 ufs Ds ?? 0:04.81 /usr/sbin/mountd -r 0 413 1 0 4 0 1364 956 accept Is ?? 0:00.02 nfsd: master (nfsd) 0 414 413 4 4 0 1240 716 - S ?? 101:39.74 nfsd: server (nfsd) 0 415 413 0 4 0 1240 716 - S ?? 24:34.31 nfsd: server (nfsd) 0 416 413 0 4 0 1240 716 - S ?? 9:23.71 nfsd: server (nfsd) 0 417 413 0 4 0 1240 716 - S ?? 4:21.56 nfsd: server (nfsd) 0 419 413 0 4 0 1240 716 - I ?? 2:24.04 nfsd: server (nfsd) 0 420 413 0 4 0 1240 716 - I ?? 0:01.46 nfsd: server (nfsd)

Any insights would be greatly appreciated.
We are likely to try and downgrade to 5.5 stable.. 6.X has been nothing but problems to us with regards to NFS.. both on the client and server. _______________________________________________
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"


----
Marc G. Fournier           Hub.Org Networking Services (http://www.hub.org)
Email . [EMAIL PROTECTED]                              MSN . [EMAIL PROTECTED]
Yahoo . yscrappy               Skype: hub.org        ICQ . 7615664
_______________________________________________
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Reply via email to