On Feb 12, 2007, at 7:52 AM, Robert Milkowski wrote:
Hello Roch,
Monday, February 12, 2007, 3:54:30 PM, you wrote:
RP> Duh!.
RP> Long sync (which delays the next sync) are also possible on
RP> a write intensive workloads. Throttling heavy writters, I
RP> think, is the key to fixing this.
Well, then maybe it's not the cause to our problems.
Nevertheless 60-90s for unlink() is just plain wrong especially when
you've got <10ms IOs to array, almost zero writes, plenty of CPU free,
etc.
Definitely something is wrong here.
Looks like spa_sync() via the txg_sync_thread thread is taking way
too long, which is causing new (NFS) requests to be delayed (such as
unlink).
Is this just a NFS server, or is there local activity as well?
A complete threadlist would be interesting, as would memory usage.
Have you increased the load on this machine? I have seen a similar
situation (new requests being blocked waiting for the sync thread to
finish), but that's only been when either 1) the hardware is broken
and taking too long or 2) the server is way overloaded.
eric
_______________________________________________
zfs-discuss mailing list
zfs-discuss@opensolaris.org
http://mail.opensolaris.org/mailman/listinfo/zfs-discuss