> On Wed, 24 Jun 2009, Lejun Zhu wrote:
> 
> > There is a bug in the database about reads blocked
> by writes which may be related:
> >
> >
> http://bugs.opensolaris.org/view_bug.do?bug_id=6471212
> >
> > The symptom is sometimes reducing queue depth makes
> read perform better.
> 
> I have been banging away at this issue without
> resolution.  Based on 
> Roch Bourbonnais's blog description of the ZFS write
> throttle code, it 
> seems that I am facing a perfect storm.  Both the
> storage write 
> bandwidth (800+ MB/second) and the memory size of my
> system (20 GB) 
> result in the algorithm batching up 2.5 GB of user
> data to write. 

With ZFS write throttle, the number 2.5GB is tunable. From what I've read in 
the code, it is possible to e.g. set zfs:zfs_write_limit_override = 0x8000000 
(bytes) to make it write 128M instead.

> Since I am using mirrors, this results in 5 GB of
> data being written 
> at full speed to the array on a very precise schedule
> since my 
> application is processing fixed-sized files with a
> fixed algorithm. 
> The huge writes lead to at least 3 seconds of read
> starvation, 
> resulting in a stalled application and a square-wave
> of system CPU 
> utilization.  I could attempt to modify my
> application to read ahead 
> by 3 seconds but that would require gigabytes of
> memory, lots of 
> complexity, and would not be efficient.
> 
> Richard Elling thinks that my array is pokey, but
> based on write speed 
> and memory size, ZFS is always going to be batching
> up data to fill 
> the write channel for 5 seconds so it does not really
> matter how fast 
> that write channel is.  If I had 32GB of RAM and 2X
> the write speed, 
> the situation would be identical.
> 
> Hopefully someone at Sun is indeed working this read
> starvation issue 
> and it will be resolved soon.
> 
> Bob
> --
> Bob Friesenhahn
> bfrie...@simple.dallas.tx.us,
> http://www.simplesystems.org/users/bfriesen/
> GraphicsMagick Maintainer,
>    http://www.GraphicsMagick.org/
> ____________________________________________
> zfs-discuss mailing list
> zfs-discuss@opensolaris.org
> http://mail.opensolaris.org/mailman/listinfo/zfs-discu
> ss
-- 
This message posted from opensolaris.org
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Reply via email to