Thanks for the reply Stefan. That looks to be exactly the same bug I'm dealing with here. The one difference is that I can reproduce it easily on test hardware. So I have two separate boxes that can go down(and run test code) without interfering with anything in production.
My test setup has a local torrent server, and 5 other machines on the network seeding a very large(10GB) random data file that I created. When the guest tries to download this file from the other servers this aggravates the issue. To the point that I could cause it to fail within minutes, repeatedly. I created this test after someone on irc said that the issue could have something to do with udp traffic rather than tcp traffic. I am running latest ubuntu 10.04 on the hosts and the guests, and I can wipe(and reinstall) the hosts quickly and easily. On 06/28/2010 01:28 AM, Stefan Hajnoczi wrote: > On Mon, Jun 28, 2010 at 9:08 AM, William King <quentus...@gmail.com> wrote: >> Is this the wrong list to get help tracking down why the network stalls >> under heavy load? > > This list is the right one. There is at least one known network stall bug: > > http://sourceforge.net/tracker/?func=detail&atid=893831&aid=2506814&group_id=180599 > http://www.mail-archive.com/k...@vger.kernel.org/msg06774.html > > I sounds like you have the same issue? > > Stefan