> Hi all,
> 
> I have five front end web servers that all mount their content from the same 
> server via NFS.  If I stress the link on any one of the machines (eg: copy a 
> large directory with a lot of files to/from the mounted file system) the 
> client will pause.  That is, all processes trying to access that mount will 
> freeze.  The log files with hundreds or thousands of nfs server not 
> responding / is alive again messages. After 60 seconds it returns to normal, 
> unless the load is still there in which case it continues to pause.
> 

The 60sec delay suggests that the client is doing a TCP reconnect. I'd suggest 
that you
look at a packet trace in wireshark (it knows how to decode NFS packets) and 
see if
there are new TCP connections (SYN, SYN-ACK,...) being made. If that is what is
happening, I suspect it is NIC driver related, but it is really hard to say.

If you can try a network interface of a different type (not em) that will check 
to
see if it is an em(4) issue.

Alternately, you could try turning off the TSO and checksum offload stuff for 
the
em(4) and see if that helps.

> This has only started happening since I upgraded the client machines to 
> 8.1-STABLE (previously four of them were 8.0 and one was 7.3).  The server is 
> 7.1-RELEASE-p11.  No other changes have taken place in terms of hardware or 
> software or mount options, etc.
> 

There were some client side fixes between 8.0 and 8.1, but I don't think any
of those have caused a regression w.r.t. connections. There is a problem w.r.t.
the nfsd getting in a loop, but that wouldn't recover after 60sec. (If it 
happens,
the server has to be rebooted. There is a fix for this at:
   http://people.freebsd.org/~rmacklem/freebsd8.1-patches/replay.patch
but I don't think it is what you are seeing.)

rick
_______________________________________________
freebsd-stable@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-stable
To unsubscribe, send any mail to "freebsd-stable-unsubscr...@freebsd.org"

Reply via email to