On Tue, Nov 20, 2007 at 11:39:30AM -0600, Albert Chin wrote: > On Tue, Nov 20, 2007 at 11:10:20AM -0600, [EMAIL PROTECTED] wrote: > > > > [EMAIL PROTECTED] wrote on 11/20/2007 10:11:50 AM: > > > > > On Tue, Nov 20, 2007 at 10:01:49AM -0600, [EMAIL PROTECTED] wrote: > > > > Resilver and scrub are broken and restart when a snapshot is created > > > > -- the current workaround is to disable snaps while resilvering, > > > > the ZFS team is working on the issue for a long term fix. > > > > > > But, no snapshot was taken. If so, zpool history would have shown > > > this. So, in short, _no_ ZFS operations are going on during the > > > resilvering. Yet, it is restarting. > > > > > > > Does 2007-11-20.02:37:13 actually match the expected timestamp of > > the original zpool replace command before the first zpool status > > output listed below? > > No. We ran some 'zpool status' commands after the last 'zpool > replace'. The 'zpool status' output in the initial email is from this > morning. The only ZFS command we've been running is 'zfs list', 'zpool > list tww', 'zpool status', or 'zpool status -v' after the last 'zpool > replace'.
I think the 'zpool status' command was resetting the resilvering. We upgraded to b77 this morning which did not exhibit this problem. Resilvering is now done. > Server is on GMT time. > > > Is it possible that another zpool replace is further up on your > > pool history (ie it was rerun by an admin or automatically from some > > service)? > > Yes, but a zpool replace for the same bad disk: > 2007-11-20.00:57:40 zpool replace tww c0t600A0B80002999660000059E4668CBD3d0 > c0t600A0B8000299966000006584741C7C3d0 > 2007-11-20.02:35:22 zpool detach tww c0t600A0B8000299966000006584741C7C3d0 > 2007-11-20.02:37:13 zpool replace tww c0t600A0B80002999660000059E4668CBD3d0 > c0t600A0B8000299CCC000006734741CD4Ed0 > > We accidentally removed c0t600A0B8000299966000006584741C7C3d0 from the > array, hence the 'zpool detach'. > > The last 'zpool replace' has been running for 15h now. > > > -Wade > > > > > > > > > > > > [EMAIL PROTECTED] wrote on 11/20/2007 09:58:19 AM: > > > > > > > > > On b66: > > > > > # zpool replace tww c0t600A0B80002999660000059E4668CBD3d0 \ > > > > > c0t600A0B8000299CCC000006734741CD4Ed0 > > > > > < some hours later> > > > > > # zpool status tww > > > > > pool: tww > > > > > state: DEGRADED > > > > > status: One or more devices is currently being resilvered. The > > pool > > > > will > > > > > continue to function, possibly in a degraded state. > > > > > action: Wait for the resilver to complete. > > > > > scrub: resilver in progress, 62.90% done, 4h26m to go > > > > > < some hours later> > > > > > # zpool status tww > > > > > pool: tww > > > > > state: DEGRADED > > > > > status: One or more devices is currently being resilvered. The > > pool > > > > will > > > > > continue to function, possibly in a degraded state. > > > > > action: Wait for the resilver to complete. > > > > > scrub: resilver in progress, 3.85% done, 18h49m to go > > > > > > > > > > # zpool history tww | tail -1 > > > > > 2007-11-20.02:37:13 zpool replace tww > > > > c0t600A0B80002999660000059E4668CBD3d0 > > > > > c0t600A0B8000299CCC000006734741CD4Ed0 > > > > > > > > > > So, why did resilvering restart when no zfs operations occurred? I > > > > > just ran zpool status again and now I get: > > > > > # zpool status tww > > > > > pool: tww > > > > > state: DEGRADED > > > > > status: One or more devices is currently being resilvered. The > > pool > > > > will > > > > > continue to function, possibly in a degraded state. > > > > > action: Wait for the resilver to complete. > > > > > scrub: resilver in progress, 0.00% done, 134h45m to go > > > > > > > > > > What's going on? > > > > > > > > > > -- > > > > > albert chin ([EMAIL PROTECTED]) > > > > > _______________________________________________ > > > > > zfs-discuss mailing list > > > > > zfs-discuss@opensolaris.org > > > > > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss > > > > > > > > _______________________________________________ > > > > zfs-discuss mailing list > > > > zfs-discuss@opensolaris.org > > > > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss > > > > > > > > > > > > > > -- > > > albert chin ([EMAIL PROTECTED]) > > > _______________________________________________ > > > zfs-discuss mailing list > > > zfs-discuss@opensolaris.org > > > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss > > > > _______________________________________________ > > zfs-discuss mailing list > > zfs-discuss@opensolaris.org > > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss > > > > > > -- > albert chin ([EMAIL PROTECTED]) > _______________________________________________ > zfs-discuss mailing list > zfs-discuss@opensolaris.org > http://mail.opensolaris.org/mailman/listinfo/zfs-discuss > > -- albert chin ([EMAIL PROTECTED]) _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss