Re: [zfs-discuss] periodic slow responsiveness

Ross Walker Sun, 06 Sep 2009 17:46:57 -0700


Sorry for my earlier post I responded prematurely.



On Sep 6, 2009, at 9:15 AM, James Lever <j...@jamver.id.au> wrote:

I’m experiencing occasional slow responsiveness on an OpenSolaris b118 system typically noticed when running an ‘ls’ (no extra flags,so no directory service lookups). There is a delay of between 2 and30 seconds but no correlation has been noticed with load on the server and the slow return. This problem has only been noticed via NFS(v3. We are migrating to NFSv4 once the O_EXCL/mtime bug fix has been integrated - anticipated for snv_124). The problem has been observed both locally on the primary filesystem, in an locally automounted reference (/home/foo) and remotely via NFS.

Have you tried snoop/tcpdump/wirehark on the client side and serverside to figure out what is being sent and exactly how long it istaking to get a response?

zpool is RAIDZ2 comprised of 10 * 15kRPM SAS drives behind an LSI1078 w/ 512MB BBWC exposed as RAID0 LUNs (Dell MD1000 behind PERC 6/E) with 2x SSDs each partitioned as 10GB slog and 36GB remainder asl2arc behind another LSI 1078 w/ 256MB BBWC (Dell R710 server withPERC 6/i).

This config might lead to heavy sync writes (NFS) starving reads dueto the fact that the whole RAIDZ2 behaves as a single disk on writes.How about a 2 5 disk RAIDZ2s or 3 4 disk RAIDZs?

Just one or two other vdevs to spread the load can make the world ofdifference.

The system is configured as an NFS (currently serving NFSv3), iSCSI(COMSTAR) and CIFS (using the SUN SFW package running Samba 3.0.34)with authentication taking place from a remote openLDAP server.

There are a lot of services here, all off one pool? You might betrying to bite off more then the config can chew.

Automount is in use both locally and remotely (linux clients).Locally /home/* is remounted from the zpool, remotely /home andanother filesystem (and children) are mounted using autofs. Therewas some suspicion that automount is the problem, but no definitiveevidence as of yet.

Try taking a particularly bad problem station and configuring itstatic for a bit to see if it is.

The problem has definitely been observed with stats (of some form,typically ‘/usr/bin/ls’ output) both remotely, locally in /home/*and locally in /zpool/home/* (the true source location). There is aclear correlation with recency of reads of the directories in question and reoccurrence of the fault in that one user has scripted a regular (15m/30m/hourly tests so far) ‘ls’ of the filesystems ofinterested and this has reduced the fault to have minimal noted impact since starting down this path (just for themself).

Sounds like the user is pre-fetching his attribute cache to over comepoor performance.

I have removed the l2arc(s) (cache devices) from the pool and thesame behaviour has been observed. My suspicion here was that therewas perhaps occasional high synchronous load causing heavy writes tothe slog devices and when a stat was requested it may have beenfaulting from ARC to L2ARC prior to going to the primary datastore. The slowness has been reported since removing the extracache devices.

That doesn't make a lot of sense to me the L2ARC is secondary readcache, if writes are starving reads then the L2ARC would only help here.

Another thought I had was along the lines of fileystem caching andheavy writes causing read blocking. I have no evidence that this isthe case, but some suggestions on list recently of limiting the ZFSmemory usage for write caching. Can anybody comment to theeffectiveness of this (I have 256MB write cache in front of the slogSSDs and 512MB in front of the primary storage devices).

It just may be that the pool configuration just can't handle the writeIOPS needed and reads are starving.

My DTrace is very poor but I’m suspicious that this is the best wayto root cause this problem. If somebody has any code that may assist in debugging this problem and was able to share it would much appreciated.

Dtrace would tell you, but i wish the learning curve wasn't so steepto get it going.

Any other suggestions for how to identify this fault and work aroundit would be greatly appreciated.

I hope I gave some good pointers. I'd first look at the poolconfiguration.


-Ross


_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss

Re: [zfs-discuss] periodic slow responsiveness

Reply via email to