Ross,
I feel you here, but I don't have much of a solution.
The best I can suggest (and has been my solution) is to take out the
problematic disk, copy it to a fresh disk (preferably using something
like dd_rescue) and then re-install.
It seems the resilvering loop is generally a result of a faulty
device, but even if it is taken offline, you still have issues. I have
had so many zpool resilvering loops, it's not funny. I'm running
2009.06 with all updates applied. I've had a very, very bad batch of
disks.
I actually have a resilvering loop running right now, and I need to go
copy off the offending device. Again.
I wish I had a better solution, because the zpool functions fine, no
data errors, but resilvering loops forever. I love ZFS as an on-disk
format. I increasingly hate the implementation of ZFS software.
-Galen
On Jul 13, 2009, at 5:34 AM, Ross wrote:
Just look at this. I thought all the restarting resilver bugs were
fixed, but it looks like something odd is still happening at the
start:
Status immediately after starting resilver:
# zpool status
pool: rc-pool
state: DEGRADED
status: One or more devices has experienced an unrecoverable error.
An
attempt was made to correct the error. Applications are
unaffected.
action: Determine if the device needs to be replaced, and clear the
errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: http://www.sun.com/msg/ZFS-8000-9P
scrub: resilver in progress for 0h0m, 0.00% done, 57h3m to go
config:
NAME STATE READ WRITE CKSUM
rc-pool DEGRADED 0 0 0
mirror DEGRADED 0 0 0
c4t1d0 ONLINE 0 0 0 5.56M resilvered
replacing DEGRADED 0 0 0
c4t2d0s0/o FAULTED 1.71M 23.3M 0 too many errors
c4t2d0 ONLINE 0 0 0 5.43M resilvered
c5t1d0 ONLINE 0 0 0 5.55M resilvered
And a few minutes later:
# zpool status
pool: rc-pool
state: DEGRADED
status: One or more devices has experienced an unrecoverable error.
An
attempt was made to correct the error. Applications are
unaffected.
action: Determine if the device needs to be replaced, and clear the
errors
using 'zpool clear' or replace the device with 'zpool replace'.
see: http://www.sun.com/msg/ZFS-8000-9P
scrub: resilver in progress for 0h0m, 0.00% done, 245h21m to go
config:
NAME STATE READ WRITE CKSUM
rc-pool DEGRADED 0 0 0
mirror DEGRADED 0 0 0
c4t1d0 ONLINE 0 0 0 1.10M resilvered
replacing DEGRADED 0 0 0
c4t2d0s0/o FAULTED 1.71M 23.3M 0 too many errors
c4t2d0 ONLINE 0 0 0 824K resilvered
c5t1d0 ONLINE 0 0 0 1.10M resilvered
It's gone from 5MB resilvered to 1MB, and increased the estimated
time to 245 hours.
--
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss