Re: [zfs-discuss] ZFS vs. Apple XRaid

Jonathan Edwards Tue, 01 Aug 2006 08:04:32 -0700


On Aug 1, 2006, at 03:43, [EMAIL PROTECTED] wrote:

So what does this exercise leave me thinking? Is Linux 2.4.x really
screwed up in NFS-land? This Solaris NFS replaces a Linux-based NFS
server that the clients (linux and IRIX) liked just fine.

Yes; the Linux NFS server and client work together just fine butgenerally

only because the Linux NFS server replies that writes are done before
they are committed to disk (async operation).

The Linux NFS client is not optimized for server which do not do this
and it appears to write little before waiting for the commit replies.

Well .. linux clients with linux servers tend to be slightly betterbehaved sincethe server essentially fudges on the commit and the async clustercount isgenerally higher (it won't switch on every operation like Solariswill by

default)

Additionally there's a VM issue in the page-writeback code that seems to
affect write performance and RPC socket performance when there's a high

dirty page count. Essentially as pages are flushed there's a highernumber

of NFS commit operations which will tend to slow down the Solaris NFS

server (and probably the txgs or zil as well with the increase insynchronousbehaviour.) On the linux 2.6 VM - the number of commits has beenseen torise dramatically when the dirty page count is between 40-90% of theoverall

system memory .. by tuning the dirtypage_ratio back down to 10% there's

typically less time spent in page-writeback and the overall asyncthroughputshould rise .. this wasn't really addressed until 2.6.15 or 2.6.16 soyou mightalso get better results on a later kernel. Watching performancebetween alinux client and a linux server - the linux server seems to bufferthe NFS commitoperations .. of course the clients will also buffer as much as theycan - so you

can end up with some unbelievable performance numbers both on the

filesystem layers (before you do a sync) and on the NFS client layersas well

(until you unmount/remount.)

Overall, I find that the Linux VM suffers from many of the same sortsof largememory performance problems that Solaris used to face before prioritypagingin 2.6 and subsequent page coloring schemes. Based on myunscientific macpowerbook performance observations - i suspect that there could besimilarissues with various iterations of the BSD or Darwin kernels - but Ihaven't taken

the initiative to really study any of this.

So to wrap up:

When doing linux client / solaris server NFS .. I'll typically tunethe client for32KB async tcp transfers (you have to dig into the kernel source toincrease thisand it's not really worth it) tune the VM to reduce time spent in thekludgypage-writeback (typically a sysctl setting for the dirty page ratioor some such),and then increase the nfs:nfs3_async_clusters andnfs:nfs4_async_clusters tosomething higher than 1 .. say 32 x 32KB transfers to get you to1MB .. you canalso increase the numbers of threads and the read ahead on the serverto eek

out some more performance

I'd also look at tuning the volblocksize and recordsize as well asthe stripe widthon your array to 32K or reasonable multiples .. but I'm not sure howmuch of theissue is in misaligned I/O blocksizes between the various elements vsmandatory

pauses or improper behaviour incurred from miscommunication ..

---
.je
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] ZFS vs. Apple XRaid

Reply via email to