So it does appear that it is zpool that hangs, possibly during 
resilvering (we lost a HDD at midnight, this what was started all this).

After boot:

x4500-02:~# zpool status -x
   pool: zpool1
  state: DEGRADED
status: One or more devices is currently being resilvered.  The pool will
         continue to function, possibly in a degraded state.
action: Wait for the resilver to complete.
  scrub: resilver in progress, 11.10% done, 2h11m to go
config:

         NAME              STATE     READ WRITE CKSUM
         zpool1            DEGRADED     0     0     0
           raidz1          ONLINE       0     0     0
[snip]
             c7t3d0        ONLINE       0     0     0
             replacing     UNAVAIL      0     0     0  insufficient replicas
               c8t3d0s0/o  UNAVAIL      0     0     0  cannot open
               c8t3d0      UNAVAIL      0     0     0  cannot open
           raidz1          ONLINE       0     0     0


You can run zpool for about 4-5 minutes, then they start to hang. For 
example, I tried to issue;

# zpool offline zpool1 c8t3d0

.. and the system stops z-responding.

# mdb -k

::ps!grep pool

R    732    722    732    662      0 0x4a004000 ffffffffb92a8030 zpool

 > ffffffffb92a8030::walk thread|::findstack -v
stack pointer for thread fffffe85285d07e0: fffffe800283fc40
[ fffffe800283fc40 _resume_from_idle+0xf8() ]
   fffffe800283fc70 swtch+0x12a()
   fffffe800283fc90 cv_wait+0x68()
   fffffe800283fcc0 spa_config_enter+0x50()
   fffffe800283fce0 spa_vdev_enter+0x2a()
   fffffe800283fd10 vdev_offline+0x29()
   fffffe800283fd40 zfs_ioc_vdev_offline+0x58()
   fffffe800283fd80 zfsdev_ioctl+0x13e()
   fffffe800283fd90 cdev_ioctl+0x1d()
   fffffe800283fdb0 spec_ioctl+0x50()
   fffffe800283fde0 fop_ioctl+0x25()
   fffffe800283fec0 ioctl+0xac()
   fffffe800283ff10 sys_syscall32+0x101()


Similarly, nfs:

 > ::ps!grep nfsd
R    548      1    548    548      1 0x42000000 ffffffffb92ad6d0 nfsd
 > ffffffffb92ad6d0::walk thread|::findstack -v
stack pointer for thread ffffffff9af8e540: fffffe8001046cc0
[ fffffe8001046cc0 _resume_from_idle+0xf8() ]
   fffffe8001046cf0 swtch+0x12a()
   fffffe8001046d40 cv_wait_sig_swap_core+0x177()
   fffffe8001046d50 cv_wait_sig_swap+0xb()
   fffffe8001046da0 cv_waituntil_sig+0xd7()
   fffffe8001046e50 poll_common+0x420()
   fffffe8001046ec0 pollsys+0xbe()
   fffffe8001046f10 sys_syscall32+0x101()





-- 
Jorgen Lundman       | <[EMAIL PROTECTED]>
Unix Administrator   | +81 (0)3 -5456-2687 ext 1017 (work)
Shibuya-ku, Tokyo    | +81 (0)90-5578-8500          (cell)
Japan                | +81 (0)3 -3375-1767          (home)
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to