Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-12-13 Thread Vivek Khera
On Nov 14, 2007, at 10:13 AM, Vivek Khera wrote: I'm running 6.2-REL. The old kernel was -p5, now without the zero copy sockets, i'm running -p8. I'll know in a couple of days if this is our solution. For the archives: Removing zero copy sockets seems to have fixed the issue. Not a s

Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-14 Thread Kris Kennaway
Vivek Khera wrote: On Nov 13, 2007, at 7:49 PM, Kris Kennaway wrote: notification. In the meantime, your best bet is to disable ZERO_COPY_SOCKETS. There is a chance this was a recent regression, previously in 7.0 they were believed to work. I'm running 6.2-REL. The old kernel was -p5,

Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-14 Thread Vivek Khera
On Nov 13, 2007, at 7:49 PM, Kris Kennaway wrote: notification. In the meantime, your best bet is to disable ZERO_COPY_SOCKETS. There is a chance this was a recent regression, previously in 7.0 they were believed to work. I'm running 6.2-REL. The old kernel was -p5, now without the zer

Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-13 Thread Kip Macy
Various calls that downgrade permissions or virtually copy a pmap in pmap.c now remove PG_W (and did not 6 months ago). This may be the cause of the regression. It would probably be better (and faster) if the pages were "held" instead of wired. -Kip On Nov 13, 2007 4:49 PM, Kris Kennaway <[EM

Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-13 Thread Kris Kennaway
Kip Macy wrote: Unfortunately, ZERO_COPY_SOCKETs have long been a known source of problems. I think also, when a page is copied as part of COW the new page is unwired (see pmap_copy et al.), this could lead to socow_iodone unwiring after send a page that was not wired. An added issue is that part

Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-13 Thread Vivek Khera
On Nov 13, 2007, at 5:13 PM, Kip Macy wrote: In the meantime, your best bet is to disable ZERO_COPY_SOCKETS. Thanks for the info. I'm putting the new kernel in place and will see what happens and report back. ___ freebsd-stable@freebsd.org mail

Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-13 Thread Kip Macy
Unfortunately, ZERO_COPY_SOCKETs have long been a known source of problems. I think also, when a page is copied as part of COW the new page is unwired (see pmap_copy et al.), this could lead to socow_iodone unwiring after send a page that was not wired. An added issue is that parts of the VM assume

Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-13 Thread Vlad GALU
On 11/13/07, Vivek Khera <[EMAIL PROTECTED]> wrote: > > On Nov 13, 2007, at 4:50 PM, Vlad GALU wrote: > > >>vmio = 1 > >>offset = Unhandled dwarf expression opcode 0x93 > >> (kgdb) > >> > > > >Do you happen to have ZERO_COPY_SOCKETS in your kernel config? > > > > > > Yes, I do.

Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-13 Thread Vivek Khera
On Nov 13, 2007, at 4:50 PM, Vlad GALU wrote: vmio = 1 offset = Unhandled dwarf expression opcode 0x93 (kgdb) Do you happen to have ZERO_COPY_SOCKETS in your kernel config? Yes, I do. Are they known to be bad under certain loads or just in general. I don't have this

Re: reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-13 Thread Vlad GALU
/lorax1/usr6/src/sys/ > KCI32SMP i386 > > > The last week or so, it has been crashing regularly. Sometimes twice > per day, and sometimes it runs for two days without a problem. I > finally managed to make it dump a crashlog and core, and discovered > that the panic was: > >

reboot after panic: vm_page_unwire: invalid wire count: 0

2007-11-13 Thread Vivek Khera
larly. Sometimes twice per day, and sometimes it runs for two days without a problem. I finally managed to make it dump a crashlog and core, and discovered that the panic was: reboot after panic: vm_page_unwire: invalid wire count: 0 I google around and found one old PR #33637 which had a patc