Re: [zfs-discuss] Disk failure chokes all the disks attached to the failing disk HBA

Richard Elling Fri, 01 Jun 2012 08:12:48 -0700

On Jun 1, 2012, at 4:54 AM, Edward Ned Harvey wrote:

>> From: zfs-discuss-boun...@opensolaris.org [mailto:zfs-discuss-
>> boun...@opensolaris.org] On Behalf Of Richard Elling
>> 
>>> From: zfs-discuss-boun...@opensolaris.org [mailto:zfs-discuss-
>>> boun...@opensolaris.org] On Behalf Of "Antonio S. Cofiño"
>>> 
>>> My question is, there is anyway to anticipate this "choking" situation
> when a
>>> disk is failing, to avoid the general failure?
>> 
>> No.
> 
> Yes.


:-)

> But not necessarily using the setup that you are currently using - that is
> not quite clear from your original email.
> 
> If you have 4 HBA's, you want to arrange your raid such that you could
> survive the complete loss of the entire HBA.  This would mean you build your
> pool out of a bunch of 4-disk raidz vdev's, or perhaps a bunch of 8-disk
> raidz2 vdev's.
> 
> The whole problem you're facing is that some bad disk brings down the whole
> bus with it...  Make your redundancy able to survive the loss of a bus.

We've had luck eliminating expanders from the design, too :-)

But this is also one of those cases where the failure results in a "wounded
soldier" case -- not dead, but not able to keep fighting effectively. The result
is a massive slowdown of the system that can be best described as a DoS
condition.
 -- richard

--
ZFS Performance and Training
richard.ell...@richardelling.com
+1-760-896-4422

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Disk failure chokes all the disks attached to the failing disk HBA

Reply via email to