On Tue, May 10, 2011 at 03:57:28PM -0700, Brandon High wrote: > On Tue, May 10, 2011 at 9:18 AM, Ray Van Dolson <rvandol...@esri.com> wrote: > > My question is -- is there a way to tune the MPT driver or even ZFS > > itself to be more/less aggressive on what it sees as a "failure" > > scenario? > > You didn't mention what drives you had attached, but I'm guessing they > were normal "desktop" drives. > > I suspect (but can't confirm) that using enterprise drives with TLER / > ERC / CCTL would have reported the failure up the stack faster than a > consumer drive. The drives will report an error after 7 seconds rather > than retry for several minutes. > > You may be able to enable the feature on your drives, depending on the > manufacturer and firmware revision. > > -B
Yup, shoulda included that. These are regular SATA drives -- supposedly "Enterprise" whatever that gives us (most likely a higher MTBF number). We'll probably look at going with nearline SAS drives (only increases cost slightly) and write a small SEC rule on our syslog server to watch for 0x31111000 errors on servers with SATA disks only so we can at least be alerted more quickly. Ray _______________________________________________ zfs-discuss mailing list zfs-discuss@opensolaris.org http://mail.opensolaris.org/mailman/listinfo/zfs-discuss