Re: [zfs-discuss] SNV_125 MPT warning in logfile

Richard Elling Fri, 23 Oct 2009 18:55:46 -0700

On Oct 23, 2009, at 5:32 PM, Tim Cook wrote:

On Fri, Oct 23, 2009 at 7:17 PM, Richard Elling <richard.ell...@gmail.com> wrote:
Tim has a valid point. By default, ZFS will queue 35 commands perdisk.For 46 disks that is 1,610 concurrent I/Os. Historically, it hasproven to berelatively easy to crater performance or cause problems with very,very,very expensive arrays that are easily overrun by Solaris. As aresult, it isnot uncommon to see references to setting throttles, especially inolder docs.
Fortunately, this is simple to test by reducing the number of I/OsZFS
will queue.  See the Evil Tuning Guide
http://www.solarisinternals.com/wiki/index.php/ZFS_Evil_Tuning_Guide#Device_I.2FO_Queue_Size_.28I.2FO_Concurrency.29
The mpt source is not open, so the mpt driver's reaction to 1,610concurrentI/Os can only be guessed from afar -- public LSI docs mention anumber of 511concurrent I/Os for SAS1068, but it is not clear to me that is anexplicit limit. If
you have success with zfs_vdev_max_pending set to 10, then the mystery
might be solved. Use iostat to observe the wait and actv columns,which
show the number of transactions in the queues.  JCMP?
NB sometimes a driver will have the limit be configurable. Forexample, to gethigh performance out of a high-end array attached to a qlc card,I've setthe execution-throttle in /kernel/drv/qlc.conf to be more than twoorders ofmagnitude greater than its default of 32. /kernel/drv/mpt*.conf doesnot seem
to have a similar throttle.
 -- richard
I believe there's a caveat here though. That really only helps ifthe total I/O load is actually enough for the controller to handle.If the sustained I/O workload is still 1600 concurrent I/O's,lowering the batch won't actually cause any difference in thetimeouts, will it? It would obviously eliminate burstiness (yes, Imade that word up), but if the total sustained I/O load is greaterthan the ASIC can handle, it's still going to fall over and die witha queue of 10, correct?


Yes, but since they are disks, and I'm assuming HDDs here, there is no
chance the disks will be faster than the host's ability to send I/Os ;-)
iostat will show what the queues look like.
 -- richard

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] SNV_125 MPT warning in logfile

Reply via email to