Re: [zfs-discuss] ZFS write I/O stalls

Bob Friesenhahn Fri, 03 Jul 2009 10:28:32 -0700

On Fri, 3 Jul 2009, Victor Latushkin wrote:

On 02.07.09 22:05, Bob Friesenhahn wrote:
On Thu, 2 Jul 2009, Zhu, Lejun wrote:
Actually it seems to be 3/4:
3/4 is an awful lot. That would be 15 GB on my system, which explains whythe "5 seconds to write" rule is dominant.
3/4 is 1/8 * 6, where 6 is worst-case inflation factor (for raid-z2 is 9actually, and considering ganged 1k block on raid-z2 in the really bad caseit should be even bigger than that). DSL does inflate write sizes too, soinflated write sizes are compared against inflated limit, so it should befine.

But blocking read I/O for several seconds is not so fine. There arevarious amounts of buffering and caches in the write pipe-line.These suggest that there is a certain amount of write data which ishandled efficiently by the write pipe-line. Once buffers and cachesfill, and the disks are maximally busy with write I/O, there is nomore opportunity to do a read from the same disks for several seconds(up to five seconds). When a TXG is written, the system writes asjust fast and hard as it can (for up to five seconds) withoutconsidering other requirements.

ZFS's asynchronous write caching is speculative, hoping that theapplication will update the data just written several times so thatonly the final version needs to be written and disk I/O and preciousIOPS are saved. Unfortunately, not all applications work that way.


Bob
--
Bob Friesenhahn
bfrie...@simple.dallas.tx.us, http://www.simplesystems.org/users/bfriesen/
GraphicsMagick Maintainer,    http://www.GraphicsMagick.org/
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] ZFS write I/O stalls

Reply via email to