For the record, in case anyone else experiences this behaviour: I tried 
various things which failed, and finally as a last ditch effort, upgraded my 
freebsd, giving me zpool v14 rather than v13 - and now it's resilvering as it 
should.

Michael

On Monday 17 May 2010 09:26:23 Michael Donaghy wrote:
> Hi,
> 
> I recently moved to a freebsd/zfs system for the sake of data integrity,
>  after losing my data on linux. I've now had my first hard disk failure;
>  the bios refused to even boot with the failed drive (ad18) connected, so I
>  removed it. I have another drive, ad16, which had enough space to replace
>  the failed one, so I partitioned it and attempted to use "zpool replace"
>  to replace the failed partitions for new ones, i.e. "zpool replace tank
>  ad18s1d ad16s4d". This seemed to simply hang, with no processor or disk
>  use; any "zpool status" commands also hung. Eventually I attempted to
>  reboot the system, which also eventually hung; after waiting a while,
>  having no other option, rightly or wrongly, I hard-rebooted. Exactly the
>  same behaviour happened with the other zpool replace.
> 
> Now, my zpool status looks like:
> arcueid ~ $ zpool status
>   pool: tank
>  state: DEGRADED
>  scrub: none requested
> config:
> 
>         NAME           STATE     READ WRITE CKSUM
>         tank           DEGRADED     0     0     0
>           raidz2       DEGRADED     0     0     0
>             ad4s1d     ONLINE       0     0     0
>             ad6s1d     ONLINE       0     0     0
>             ad9s1d     ONLINE       0     0     0
>             ad17s1d    ONLINE       0     0     0
>             replacing  DEGRADED     0     0     0
>               ad18s1d  UNAVAIL      0 9.62K     0  cannot open
>               ad16s4d  ONLINE       0     0     0
>             ad20s1d    ONLINE       0     0     0
>           raidz2       DEGRADED     0     0     0
>             ad4s1e     ONLINE       0     0     0
>             ad6s1e     ONLINE       0     0     0
>             ad17s1e    ONLINE       0     0     0
>             replacing  DEGRADED     0     0     0
>               ad18s1e  UNAVAIL      0 11.2K     0  cannot open
>               ad16s4e  ONLINE       0     0     0
>             ad20s1e    ONLINE       0     0     0
> 
> errors: No known data errors
> 
> It looks like the replace has taken in some sense, but ZFS doesn't seem to
>  be resilvering as it should. Attempting to zpool offline doesn't work:
>  arcueid ~ # zpool offline tank ad18s1d
> cannot offline ad18s1d: no valid replicas
> Attempting to scrub causes a similar hang to before. Data is still readable
> (from the zvol which is the only thing actually on this filesystem),
>  although slowly.
> 
> What should I do to recover this / trigger a proper replace of the failed
> partitions?
> 
> Many thanks,
> Michael
> _______________________________________________
> zfs-discuss mailing list
> zfs-discuss@opensolaris.org
> http://mail.opensolaris.org/mailman/listinfo/zfs-discuss
> 
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to