On 21.08.2013 21:11, Navdeep Parhar wrote:
I'd like to add a last-active timestamp to the structure that tracks the
LRO state in a NIC's rx handler.  This is r254336 in user/np/cxl_tuning
that will be merged to head if there are no objections.

No objections.  This is good thing.

The last time I looked some time back there were a couple of additional
issues with the soft-LRO code.  One was that it would cause packet re-
ordering when a new segment is out-of-order or can't be merged.  There
the new packet would be delivered first and the merged LRO chain after
it.  I think there were a couple of other similar issues.  However I
don't know if these have been fixed in the mean time.

BTW an excellent resource on the correct LRO behavior is this detailed
flow chart.  It probably would be good to audit our soft-LRO against it.
 
http://msdn.microsoft.com/en-us/library/windows/hardware/jj853325%28v=vs.85%29.aspx

--
Andre

http://svnweb.freebsd.org/base?view=revision&revision=254336
http://svnweb.freebsd.org/base/user/np/cxl_tuning/sys/netinet/tcp_lro.c?r1=254336&r2=254335&pathrev=254336
http://svnweb.freebsd.org/base/user/np/cxl_tuning/sys/netinet/tcp_lro.h?r1=254336&r2=254335&pathrev=254336


-----
Add a last-modified timestamp to each LRO entry and provide an interface
to flush all inactive entries.  Drivers decide when to flush and what
the inactivity threshold should be.

Network drivers that process an rx queue to completion can enter a
livelock type situation when the rate at which packets are received
reaches equilibrium with the rate at which the rx thread is processing
them.  When this happens the final LRO flush (normally when the rx
routine is done) does not occur.  Pure ACKs and segments with total
payload < 64K can get stuck in an LRO entry.  Symptoms are that TCP
tx-mostly connections' performance falls off a cliff during heavy,
unrelated rx on the interface.

Flushing only inactive LRO entries works better than any of these
alternates that I tried:
- don't LRO pure ACKs
- flush _all_ LRO entries periodically (every 'x' microseconds or every
   'y' descriptors)
- stop rx processing in the driver periodically and schedule remaining
   work for later.
-----


Regards,
Navdeep
_______________________________________________
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"



_______________________________________________
freebsd-net@freebsd.org mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-net
To unsubscribe, send any mail to "freebsd-net-unsubscr...@freebsd.org"

Reply via email to