Geoff, I've hit this problem several times in the past, with OpenSolaris and then with OpenIndiana.
There are, to my knowledge, no available solutions, it is so by design! If a disk stops responding the pool waits until after it responds again (sometimes pulling it out of its slot and then reinserting the disk causes a reset of the link and it starts working again). I was not able to assess what happens if I set failmode to continue. I think it could be no better since you still cannot write to the pool. This is IMHO the biggest problem of ZFS, in that I cannot instruct it to stop using a failed device if it has some level of redundancy still available. Wait is OK only if an entire vdev stops responding, not if a disk in a vdev with redundancy has problems either fatal or transitory. Best regards. Maurilio. PS. Using server grade disks (those with TLER) makes it possibile to overcome this problem for transitory errors. Geoff Nordli wrote: > Part of my concern is why one disk would have completely brought down > the system. I have seen this come up on the list before, but I don't > remember any resolutions to fixing it. > > Anyone have any clues to try to prevent this from happening in the future? > > thanks, > > Geoff > -- __________ | | | |__| Maurilio Longo |_|_|_|____| farmaconsult s.r.l. _______________________________________________ OpenIndiana-discuss mailing list OpenIndiana-discuss@openindiana.org http://openindiana.org/mailman/listinfo/openindiana-discuss