Re: [zfs-discuss] Ssd for zil on a dell 2950

Ross Walker Sun, 23 Aug 2009 08:11:10 -0700

On Aug 23, 2009, at 9:59 AM, Ross Walker <rswwal...@gmail.com> wrote:

On Aug 23, 2009, at 12:11 AM, Tristan Ball <tristan.b...@leica-microsystems.com> wrote:
Ross Walker wrote:
[snip]
We turned up our X4540s, and this same tar unpack took over 17minutes! We disabled the ZIL for testing, and we dropped thisto under 1 minute. With the X25-E as a slog, we were able torun this test in 2-4 minutes, same as the old storage.
That's pretty impressive. So with a X25-E slog ZFS is as fastsynchronously asyour previously hardware was asynchronously - but with no riskof data corruption.Of course the hardware is different so it's not really apples toapples.
There was a thread not too along ago either on the xfs mailinglist or mysql mailing list that talked about the Intel X25-E andit's on board cache. The cache ignores flushes, but isn'tpersistent on power failure. Pulling the drive during a syncwrite caused data corruption. You can disable the write backcache of these, but the performance is no where near as good withit disabled.
Here is the blog post:

http://www.mysqlperformanceblog.com/2009/03/02/ssd-xfs-lvm-fsync-write-cache-barrier-and-lost-transactions/

-Ross
Hang on, in reading that his initial results were 50 writes asecond, with the default xfs write barriers, which to me impliesthat the drive is honouring the cache flush. The fact that writerate jumps so significantly when he turns off barriers, butcontinues with ODIRECT and innodb_flush_log_at_trx_commit=1 to mejust says that xfs is returning success on writes as soon as thedata has been given to the drive - not when the drive has flushedit's cache to have it persistent. Given that we told xfs to turnoff write barriers - isn't it doing what it's told? Why are weexpecting data to be consistent across power loss or device removal?
Couldn't this just be XFS only actually requesting cache flusheswhen barrier's are enabled?
I think it's more an illustration that write barriers on Linux needa little work, even with flushes it should do a lot better then 50IOPS.
O_DIRECT does just that, with or without barriers, it flushes oneach write, with an ever so slight delay to allow the queue tocoalesce writes.

My bad O_DIRECT does NOT do that, it just goes direct to the driverbypassing page cache. Allows for low latency IO and arbitrary IO sizesfor throughput (instead of page sized IO), but it doesn't enforcepersistence.

A barrier is more to enforce order and persistence when IO is async.

I suspect that since XFS can use an internal or external log like ZFSdoes, that when a barrier is issued it is issued across all devices inthe file system since XFS doesn't know about the actual physicallayout like ZFS does and that is why the IOPS are so low with XFSbarriers.


-Ross

_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] Ssd for zil on a dell 2950

Reply via email to