[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-18 Thread Launchpad Bug Tracker
This bug was fixed in the package linux - 2.6.35-28.49 --- linux (2.6.35-28.49) maverick-proposed; urgency=low [ Brad Figg ] * Release Tracking Bug - LP: #726796 [ Colin Ian King ] * SAUCE: Dell All-In-One: Remove need for Dell module alias [ Manoj Iyer ] * SAUCE:

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-17 Thread nutznboltz
Never mind, I found it here: https://wiki.ubuntu.com/KernelTeam/KernelUpdates ``For normal 18-month releases, we will only accept updates to the kernel for 3-4 months after release. At this point we consider the in-development release to be stable enough for testing, and the primary target for fi

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-17 Thread nutznboltz
What really bothers me is that I can't find anything in writing about why this patch did not make it into Karmic. If someone would be so kind as to point out a link that explains why this patch did not make it into Karmic I would be grateful. -- You received this bug notification because you are

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-17 Thread Launchpad Bug Tracker
This bug was fixed in the package linux - 2.6.32-30.59 --- linux (2.6.32-30.59) lucid-proposed; urgency=low [ Steve Conklin ] * Release Tracking Bug - LP: #727336 [ Tim Gardner ] * [Config] CONFIG_IRQ_TIME_ACCOUNTING=n - LP: #723819 [ Upstream Kernel Changes ]

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-09 Thread nutznboltz
Lucid proposed kernel with virtio-net napi patch passed all of the QA Team's regression testing https://wiki.ubuntu.com/QATeam/KernelSRU-lucid-2.6.32-30.59 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bug

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-08 Thread nutznboltz
@AvaCam Note also that the file system buffer cache uses up RAM so activity that reads in many files while performing network I/O triggers this bug. That is why my test case of running two concurrent recursive "scp -r ..." jobs causes an unpatched virtio-net driver to lock up. -- You received th

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-08 Thread nutznboltz
@AvaCam use enough RAM until you start getting "page allocation failure." messages in the system logs. What is happening is that the network driver needs to have a free page of RAM. It also cannot wait around for a page to become free. It can however, try again later. So if there are no free pa

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-08 Thread AvaCam
I've installed the proposed kernel onto both a 10.10 and 10.04 VMs. I've just set their MTU's to 9000. What would be a good way to stress test them? -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/57927

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-08 Thread nutznboltz
@Divinsa did you test proposed? https://wiki.ubuntu.com/Testing/EnableProposed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation failure

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-07 Thread Divinsa Development
Would love to see a fix for this as well - running over 10 10.04 instances on ec2 and having crashes + hangs often with this bug. Finding that MTU increase to 9000 speeds up time to failure. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubu

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-04 Thread nutznboltz
Well, I rebuilt Dubnium as Maverick and ran the test: $ w 15:12:16 up 1:25, 3 users, load average: 1.94, 1.89, 1.90 USER TTY FROM LOGIN@ IDLE JCPU PCPU WHAT nutz pts/3140.147.245.89:S 13:511:20m 17:04 15:53 /usr/bin/ssh -x -oForwardAgent no -oPermitLo

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-04 Thread nutznboltz
In bash sudo -i cat >> /etc/apt/preferences << EOF Package: * Pin: release a=maverick-security Pin-Priority: 990 Package: * Pin: release a=maverick-updates Pin-Priority: 900 Package: * Pin: release a=maverick-proposed Pin-Priority: 400 EOF echo "deb http://archive.ubuntu.com/ubuntu/ maverick-pr

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-04 Thread nutznboltz
16 people (17 including me) checked the "affects me too" button on this bug report? Will any of them test proposed on Maverick? The instructions are right here: https://wiki.ubuntu.com/Testing/EnableProposed -- You received this bug notification because you are a member of Ubuntu Bugs, which is

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-03 Thread nutznboltz
I don't use Maverick in my environment and I still use Karmic. I don't know who you are going to get to do the Maverick testing but it ain't me. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 T

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-03 Thread nutznboltz
110 GB copied in three hours with no problems. ** Tags added: verification-done-lucid ** Tags removed: verification-needed-lucid -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost ne

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-03 Thread nutznboltz
Will the patch be added to 2.6.31? Meanwhile $ uname -a Linux dubnium 2.6.32-30-server #59-Ubuntu SMP Tue Mar 1 22:46:09 UTC 2011 x86_64 GNU/Linux 7765 7764 1 17:14 pts/300:01:33 scp -r /vol/ndnp/ndnp_staging/batches/kyu oxygen:/storage/scratch/virtio-net-test/2 x 77

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-03 Thread Steve Conklin
This bug is awaiting verification that the kernel in -proposed solves the problem. Please test the kernel and update this bug with the results. If the problem is solved, change the tag 'verification- needed-' to 'verification-done-'. If verification is not done by one week from today, this fix wil

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-03 Thread nutznboltz
** Patch added: "debdiff of virtio-net napi patch for Karmic" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/579276/+attachment/1883450/+files/karmic.debdiff -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launch

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-03 Thread Launchpad Bug Tracker
** Branch linked: lp:ubuntu/lucid-proposed/linux-ec2 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation failure -- ubuntu-bugs mailing

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-03 Thread Brad Figg
** Tags added: verification-needed-lucid verification-needed-maverick -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation failure -- ubu

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-02 Thread nutznboltz
Patch is now in Lucid https://launchpad.net/ubuntu/lucid/+source/linux/2.6.32-30.59 -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation fa

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-02 Thread nutznboltz
I had my first Karmic KVM guest encounter this issue today. I'm going to add support for Karmic to my PPA. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virti

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-03-01 Thread nutznboltz
The 2.6.32-29.58 kernel update that recently was pushed out lacks the virtio-net napi patch. I updated my PPA against the updated kernel. https://launchpad.net/~nutznboltz/+archive/lucid-virtio-napi The updated PPA is still compiling; ETA Mar 2, 2011 02:40:00 UTC To test with my PPA run: sudo ap

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-24 Thread nutznboltz
Yesterday a new ticket and branch stable-v2.6.32.29 was created with this patch https://bugs.launchpad.net/ubuntu/lucid/+source/linux/+bug/723819 http://kernel.ubuntu.com/git?p=rtg/ubuntu- lucid.git;a=blob;f=drivers/net/virtio_net.c;h=fb09effbfb63f5e080a87bfc80a823f83c363810;hb=refs/heads/stable-

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-23 Thread nutznboltz
This patch is in 2.6.37.1 as of Thu, 10 Feb 2011 19:03:31 + -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation failure -- ubuntu-bu

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-16 Thread Launchpad Bug Tracker
This bug was fixed in the package linux - 2.6.38-4.31 --- linux (2.6.38-4.31) natty; urgency=low [ Andy Whitcroft ] * add in bugs closed by upstream patches pulled in by rebases * rebase to 795abaf1e4e188c4171e3cd3dbb11a9fcacaf505 * [Config] enable CONFIG_VSX to allow use of

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-16 Thread nutznboltz
Actually, it's not in proposed yet. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation failure -- ubuntu-bugs mailing list ubuntu-bugs@

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-16 Thread nutznboltz
In proposed now: http://www.ubuntuupdates.org/packages/show/199704 Please test if you can: https://wiki.ubuntu.com/Testing/EnableProposed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title:

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-16 Thread Andy Whitcroft
This is now Fix Committed for Natty as we have just rebased to v2.6.38-rc5 mainline which contains this fix. ** Changed in: linux (Ubuntu) Status: Triaged => Fix Committed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https:/

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-15 Thread nutznboltz
Tested Ok on my VM: $ uname -a Linux dubnium 2.6.32-29-server #58pre201102150902-Ubuntu SMP Tue Feb 15 10:16:07 UTC 2011 x86_64 GNU/Linux -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title:

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-15 Thread nutznboltz
Patch is now in pre-proposed. https://launchpad.net/~kernel-ppa/+archive/pre-proposed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-11 Thread Andy Whitcroft
This is now officially in linus' tree but not yet tagged. Will be in the next Natty upload. ** Changed in: linux (Ubuntu) Assignee: (unassigned) => Andy Whitcroft (apw) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bug

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-11 Thread Stefan Bader
** Changed in: linux (Ubuntu Lucid) Status: In Progress => Fix Committed ** Changed in: linux (Ubuntu Maverick) Status: In Progress => Fix Committed -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-11 Thread Stefan Bader
** Also affects: linux (Ubuntu Lucid) Importance: Undecided Status: New ** Also affects: linux (Ubuntu Maverick) Importance: Undecided Status: New ** Changed in: linux (Ubuntu Lucid) Importance: Undecided => Medium ** Changed in: linux (Ubuntu Lucid) Status: New =>

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-10 Thread nutznboltz
$ git show-branch [master] Merge branch 'usb-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/gregkh/usb-2.6 $ git log drivers/net/virtio_net.c commit 3e9d08ec0a68f6faf718d5a7e050fe5ca0ba004f Author: Bruce Rogers Date: Thu Feb 10 11:03:31 2011 -0800 virtio_net: Add schedule check

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-10 Thread nutznboltz
http://www.spinics.net/lists/linux-virtualization/msg12364.html http://www.spinics.net/lists/linux-virtualization/msg12365.html -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost netwo

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-10 Thread nutznboltz
I started following @Linux_Kernel http://twitter.com/Linux_Kernel -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation failure -- ubuntu-

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-10 Thread Stefan Bader
Having Rusty pick it up should bring it (usually first to linux-next) to Linus tree. As soon as it hits there we can go on with adding it to 10.04 and 10.10. Sorry about the procedure being somewhat tedious, but this makes sure that relevant maintainers have looked at the change and it is being int

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-09 Thread nutznboltz
http://goo.gl/FQqS0 https://lists.linux-foundation.org/pipermail/virtualization/2011-February/016321.html https://lists.linux-foundation.org/pipermail/virtualization/2011-February/016322.html https://lists.linux-foundation.org/pipermail/virtualization/2011-February/016323.html But still not in 2.6

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-09 Thread nutznboltz
https://lists.linux- foundation.org/pipermail/virtualization/2011-February/016320.html -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-09 Thread nutznboltz
At this point a representative from the Ubuntu kernel team thanked me for my work in driving this however no evidence exists that the patch has made it into the upstream kernel yet. https://lists.ubuntu.com/archives/kernel-team/2011-February/014433.html -- You received this bug notification bec

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-09 Thread nutznboltz
If there is a problem fill out complaint form and place it in an envelope addressed to... http://www.youtube.com/watch?v=gEyFH-a-XoQ#t=1m -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title:

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-09 Thread Peter Lieven
I have not tested with NFS, but my newsserver test was also reliably crashing without the NAPI patch. I have seen Bruce's response. Will he take care of this patch going upstream? -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https:

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-09 Thread nutznboltz
The patched 2.6.38 kernel is running and has not crashed while copying data overnight. $ uname -a Linux dubnium 2.6.38-2-server #29~lucid3-Ubuntu SMP Tue Feb 8 21:49:57 UTC 2011 x86_64 GNU/Linux $ date;ps -eO lstart | grep "scp -r" | egrep -v 'grep|ssh' Wed Feb 9 05:31:07 EST 2011 1035 Tue Feb

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-08 Thread nutznboltz
The attached debdiff contains my modifications to that with my updated version of the patch from Bruce Rogers of Novell. I had to modify the patch a bit to make it work with 2.6.38 which is what Natty is based on. I used the Ubuntu Kernel Team Daily Build PPA (which isn't really updated daily) as

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-08 Thread nutznboltz
Bruce Rogers of Novell replied to my E-mail saying that the patch should have been accepted upstream and it was an oversight. https://lists.ubuntu.com/archives/kernel-team/2011-February/014428.html -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-08 Thread Sergey Svishchev
I've seen this happen on servers that run java webapps; it seems that high java heap usage (especially when heap size is close to physical memory size) helps trigger one of aforementioned bugs. Unfortunately, I don't have a simple test case. -- You received this bug notification because you are

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-08 Thread nutznboltz
Peter, does your news server use NFS? -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocation failure -- ubuntu-bugs mailing list ubuntu-bug

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-08 Thread nutznboltz
You can test that too. First install the tools apt-get install dpkg-dev python-software-properties After the tools are installed run apt-add-repository ppa:kernel-ppa/ppa" and then edit: /etc/apt/sources.list.d/kernel-ppa-ppa-lucid.list Copy this line deb http://ppa.launchpad.net/kernel-ppa/ppa/u

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-07 Thread Peter Lieven
Can you patch the natty kernel with the napi patch to be absolutely sure? You might also use netcat to transfer files between your boxes or use iperf. This might reduce the time to crash. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-07 Thread nutznboltz
I have reason to believe that the absence of this patch in upstream kernels is a critical oversight. I used "apt-add-repository ppa:kernel-ppa/ppa" to put the "Natty" kernel on my Lucid test VM $ uname -a Linux dubnium 2.6.38-2-server #29~lucid1-Ubuntu SMP Mon Feb 7 15:09:10 UTC 2011 x86_64 GNU/

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-07 Thread nutznboltz
I already sent an E-mail to Bruce Rogers of Novell asking about why his patch didn't get into the upstream kernel, see: https://lists.ubuntu.com/archives/kernel-team/2011-February/014414.html You are welcome to try reaching out to anyone who might have the answer as to why this patch never made i

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-07 Thread Peter Lieven
I tested with kernels that include both of these patches, but they still crashed. I also think that the both patches you mentioned have been backported to Ubuntu LTS. How should we proceed? Contact the virtio developers and the developers from Suse why this patch never went upstream? -- You r

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-07 Thread nutznboltz
This patch was posted via E-mail on June 3, 2010  * [PATCH] KVM: add schedule check to napi_enable call - http://kerneltrap.org/mailarchive/linux-netdev/2010/6/4/6278660 and it never made it into the upstream kernel. Shortly after that these two were added to the upstream kernel: http://kern

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-07 Thread Peter Lieven
just for the records. i have a binary newsfeed testserver with the napi patch running stable for 52 days. it really seems that this was the missing piece! more than 66TB data transferred. root@ubuntu-newsfeed:~# uptime 13:00:49 up 52 days, 14:47, 2 users, load average: 2.32, 2.61, 2.69 root@u

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-06 Thread nutznboltz
486.5 GB transmitted without locking up. $ ifconfig eth0 eth0 Link encap:Ethernet HWaddr 00:16:36:1c:fe:1a inet addr:192.168.1.105 Bcast:192.168.1.255 Mask:255.255.255.0 inet6 addr: fe80::216:36ff:fe1c:fe1a/64 Scope:Link UP BROADCAST RUNNING MULTICAST MTU:1

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-06 Thread nutznboltz
I'm actively testing two concurrent "scp -r" of > 200 GB from NFS directory to remote host which crashed the VM on the stock kernel. $ uname -a Linux dubnium 2.6.32-28-server #55ubuntu1~ppa3~lucid1-Ubuntu SMP Sun Feb 6 01:03:25 UTC 2011 x86_64 GNU/Linux To test with the PPA run sudo apt-get ins

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-05 Thread nutznboltz
I did the debdiff, the update to the bug report description for the SRU and subscribed https://launchpad.net/~ubuntu-sru Now this needs sponsorship. https://wiki.ubuntu.com/SponsorshipProcess In the meantime I started this PPA with the patch https://launchpad.net/~nutznboltz/+archive/lucid-virtio

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-04 Thread nutznboltz
To get any further with this bug report read https://wiki.ubuntu.com/StableReleaseUpdates and try to do what it says. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-04 Thread nutznboltz
How to install the debdiff: https://wiki.ubuntu.com/UbuntuPackagingGuide/BuildFromDebdiff -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page allocat

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-04 Thread nutznboltz
Oh, I see what you mean by "can't integrate backlog for every protocol". I rebuilt the debdiff with only the virtio driver patch and deleted the old debdiff with two patches. Debdiff containing patch:  * [PATCH] KVM: add schedule check to napi_enable call - http://kerneltrap.org/mailarchive

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-04 Thread nutznboltz
** Patch removed: "linux_2.6.32-28.55ubuntu1.debdiff" https://bugs.launchpad.net/ubuntu/+source/linux/+bug/579276/+attachment/1828276/+files/linux_2.6.32-28.55ubuntu1.debdiff -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-03 Thread nutznboltz
Debdiff containing two patches:  * [PATCH] KVM: add schedule check to napi_enable call - http://kerneltrap.org/mailarchive/linux-netdev/2010/6/4/6278660 * [PATCH 4/8] udp: use limited socket backlog - http://kerneltrap.org/mailarchive/linux-netdev/2010/3/3/6271096 ** Patch added: "li

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-03 Thread nutznboltz
Thinking about this a bit more, the "udp: use limited socket backlog" is still necessary since the "page allocation failures" from nic drivers problem affects real hardware with e100, e1000 and e1000e nic drivers. -- You received this bug notification because you are a member of Ubuntu Bugs, whic

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-03 Thread nutznboltz
Oh, sorry, I hadn't seen this yet: KVM: add schedule check to napi_enable call http://kerneltrap.org/mailarchive/linux-netdev/2010/6/4/6278660 If you want to get this into Lucid make a debdiff https://wiki.ubuntu.com/PackagingGuide/Recipes/Debdiff -- You received this bug notification because yo

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-02-03 Thread nutznboltz
This was fixed, but not in Ubuntu yet. See: "udp: use limited socket backlog" http://kerneltrap.org/mailarchive/linux-netdev/2010/3/3/6271096 Bug#576838: virtio network crashes again starting with comment 184 http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=592187#184 LP#661212 "crash after ksw

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2011-01-07 Thread Peter Lieven
It seems that "virtio_net: Add schedule check to napi_enable call" is the final solution to the virtio_net crashes. I have a newsserver (constantly 300-500mbit throughput) running a modified kernel with this patch for almost one month now. Who can help getting this patch into ubuntu-lucid offici

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-12-16 Thread Peter Lieven
It took about 5 days this time to crash an unpatches Ubuntu LTS 10.04.1 64-bit server. I will now try my home built kernel with the "virtio_net: Add schedule check to napi_enable call" patch included. If this patch is the final solution can someone help with: - getting this patch in the vanilla

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-12-03 Thread lhotari
Sergey, I'm still running 9.04 (2.6.28-19.24) in production because of this virtio_net stability problem. I'd like to upgrade the VMs to 10.04 LTS after this problem is resolved. I haven't done retesting lately. -- You received this bug notification because you are a member of Ubuntu Bugs, which

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-12-03 Thread Sergey Svishchev
Right, Lucid kernel package 2.6.32-25.44 includes the "more reliable" fix (see bug 607824). lhotari, do you use this kernel? -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-12-03 Thread Peter Lieven
This is weird. I remember when I last experiment with a vanilla 2.6.34 some time ago, it still crashed under heavy load. 2.6.34 should have all the patches mentioned applied except for the yet unpublished napi fix from SuSe. I'm currently rebuilding my test setup (binary news spool server) and see

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-12-03 Thread Sergey Svishchev
It's not in any kernel tree, either. Probably needs someone to bring it to lkml@'s attention. -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/579276 Title: Lost network in KVM VM / virtio_net page al

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-12-02 Thread Peter Lieven
Sergey, thank you very much for the summary. From what I can see all patches made it to the stable kernel since 2.6.34, but not the third patch "virtio_net: Add schedule check to napi_enable call". Am I right with that? That might be a reason why I still see the issue with recent kernels (2.6.34+).

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-12-02 Thread Sergey Svishchev
SuSE added another two patches to their 2.6.32 tree. Quoting first message: "These are patches which we have found useful for our 2.6.32 based SLES 11 SP1 release. The first patch ["Make delayed refill more reliable"] is already upstream, but should be included in stable. The second patch ["If

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-12-02 Thread Sergey Svishchev
To summarize: * This is a "RX lockup" condition -- VM still runs, can send traffic (ARP requests, mostly), but cannot receive replies. * On Lucid, two backports to 2.6.32 are needed, "virtio_net: Make delayed refill more reliable" and "Add limit for socket backlog". I didn't research which mainl

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-11-05 Thread lhotari
I hope this fix gets included in 10.04.1 LTS as soon as possible. We haven't been able to upgrade our Ubuntu VMs (running on Linux KVM) to 10.04 because of this bug. I think the priority should be much higher. Could someone assign this bug to someone in the Ubuntu Server team? -- Lost network in

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-11-05 Thread lhotari
Original patch: http://git.kernel.org/linus/8eae939f1400326b06d0c9afe53d2a484a326871 -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/579276 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. --

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-11-05 Thread lhotari
Changelog of Debian kernel 2.6.32-22 : * net/{tcp,udp,llc,sctp,tipc,x25}: Add limit for socket backlog (Closes: #592187) -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/579276 You received this bug notification because you are a member of Ubu

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-11-05 Thread lhotari
Another upstream bug report: http://bugs.debian.org/cgi- bin/bugreport.cgi?bug=592187 ** Bug watch added: Debian Bug tracker #592187 http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=592187 -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/579276 Yo

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-11-05 Thread CvB
Something must have changed, though. While earlier, I saw this bug whenever I increased network load on the virtio device, with Linux server64 2.6.32-25-server #45-Ubuntu the situation has improved, i.e. the system has not crashed again so far, despite some load tests. -- Lost network in KVM VM /

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-11-01 Thread Joe Kislo
No, I am completely wrong. It still happens all the time. There was a period when it seemed stable. Here is my kernel panic fwiw on lucid linux-image-2.6.32-25-server 2.6.32-25.45: [747393.713739] swapper: page allocation failure. order:0, mode:0x4020 [747393.713743] Pid: 0, comm: swappe

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-10-23 Thread Joe Kislo
We use vmware ESXi, and we were crippled by this bug (we had to rebuild several systems back to karmic because they were so unusable). Somewhere in the past month a kernel upgrade seems to have resolved this issue for us. We could reproduce this problem easily: Remote System: cat /dev/zero | nc -

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-10-03 Thread Peter Lieven
I can confirm this bug in Ubuntu Lucis LTS 10.04.1 64-bit Server. This bug seems not to exists in an older kernel from opensuse 11.1 which I use with heavy network i/o load. Version is: Linux 2.6.27.48-0.2-default So the bug seems to have been added somewhere in between. -- Lost network in KV

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-09-26 Thread Rupert Hair
Switching to the 'e1000' emulation seems to have worked for us, but it's far from a nice solution. Rupert -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/579276 You received this bug notification because you are a member of Ubuntu Bugs, which is sub

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-09-26 Thread CvB
I think I'm seeing this or a similar bug on my lucid VM (on a lucid host), too. (2.6.32-24-server Kernel) Is there a workaround? Does it help, e.g., to switch from virtio to some other network device? -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-08-10 Thread Sergey Svishchev
Another variant of this bug occurs even with both patches applied -- see recent comments in debbug 592187. Reporter says: "With 2.6.35-1~experimental.1 and virtio in the guest, 2.6.32-18 in the host I cannot (yet) trigger the bug." -- Lost network in KVM VM / virtio_net page allocation failure

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-06-24 Thread William King
I can confirm this same issue. -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/579276 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.co

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-06-21 Thread Jeremy Foshee
** Tags added: kernel-needs-review -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/579276 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubunt

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-06-13 Thread RoyK
Please note that this bug eventually kills my VM. It loses network, logs nfs timeouts and won't let anyone login to the console, nor do anything useful. A reboot of the guest fixes this, but since the error occurs after such a short time, this is not even a workaround. Reversing the client/server

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-06-13 Thread RoyK
I can confirm this on a Lucid VM running in KVM with a Lucid host. This mainly happens if the VM is copying data to/from an NFS share (guest as the NFS client, host as the NFS server). IMHO this should be prioritised higher than 'medium' since it doesn't take more than just minutes on full network

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-06-12 Thread Robert Sheets
Regarding my last comment: I tried it, using the 2010-05-31-lucid mainline kernel build. The issue seemed to take longer to come about, but it still happened eventually. -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/579276 You received this bug not

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-06-11 Thread Robert Sheets
So is installation of a recent upstream kernel thought to be a workaround for this? -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/579276 You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. --

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-06-02 Thread Jeremy Foshee
** Changed in: linux (Ubuntu) Status: Incomplete => Triaged ** Changed in: linux (Ubuntu) Importance: Undecided => Medium ** Tags added: cherry-pick kernel-net ** Tags removed: needs-kernel-logs needs-upstream-testing -- Lost network in KVM VM / virtio_net page allocation failure http

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-06-02 Thread Sergey Svishchev
Quoting Debian bug report: "> It seems as if Redhat encountered and fixed this bug back in January: > > https://bugzilla.redhat.com/show_bug.cgi?id=554078 The original upstream bug fix was: commit 3161e453e496eb5643faad30fff5a5ab183da0fe Author: Rusty Russell Date: Wed Aug 26 12:22:32 2009 -

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-05-18 Thread lhotari
apport information ** Tags added: apport-collected ** Description changed: Binary package hint: 2.6.32-21-server I'm seeing similar bugs in a VM running Lucid as there is reported in these Redhat bug reports: https://bugzilla.redhat.com/show_bug.cgi?id=520119 https://bugzilla.redhat.

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-05-17 Thread Jeremy Foshee
Hi lhotari, Please be sure to confirm this issue exists with the latest development release of Ubuntu. ISO CD images are available from http://cdimage.ubuntu.com/releases/ . If the issue remains, please run the following command from a Terminal (Applications->Accessories->Terminal). It will

[Bug 579276] Re: Lost network in KVM VM / virtio_net page allocation failure

2010-05-12 Thread lhotari
upstream bug report: http://bugs.debian.org/cgi- bin/bugreport.cgi?bug=576838 ** Bug watch added: Debian Bug tracker #576838 http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=576838 -- Lost network in KVM VM / virtio_net page allocation failure https://bugs.launchpad.net/bugs/579276 You receiv