Re: kernel hangs by many connections (reproducable)

2010-10-05 Thread Jonathan Gray
On Tue, Oct 05, 2010 at 10:35:28AM +0200, Mike Belopuhov wrote: > > there's a forgotten splx in the driver. that explains why system > appears to be hung. OK? While this is a bug and should be fixed, we never actually call that codepath as only re_diag sets testmode and we never call that :) >

Re: kernel hangs by many connections (reproducable)

2010-10-05 Thread Mike Belopuhov
On Sun, Sep 12, 2010 at 11:26 +0600, Anton Maksimenkov wrote: > Hello. > I use my OBSD machine to test some server on another machine. They are > connected by pathcord, 1Gbit network cards are used. > Test program (uses kqueue) do many (I want thousands) connections to > server. Write query, read a

Re: kernel hangs by many connections (reproducable)

2010-09-19 Thread Anton Maksimenkov
2010/9/13 Anton Maksimenkov : > 2010/9/13 Claudio Jeker : >> When running with that many sockets a prominent warning about increasing >> kern.maxclusters shows up. This is not just dmesg spam, running >> out of mbuf clusters will stop your network stack. > > I've not seen any message neither on con

Re: kernel hangs by many connections (reproducable)

2010-09-13 Thread Anton Maksimenkov
2010/9/13 Claudio Jeker : > When running with that many sockets a prominent warning about increasing > kern.maxclusters shows up. This is not just dmesg spam, running > out of mbuf clusters will stop your network stack. I've not seen any message neither on console nor in logs. I tried to set kern

Re: kernel hangs by many connections (reproducable)

2010-09-13 Thread Henning Brauer
* Claudio Jeker [2010-09-13 08:12]: > Oh, that pool_get succeds since mbufs don't have a limit but the > allocation of the cluster fails so the driver will reuse the old buffer on > the queue. well, the trace shows a pool_get from m_gethdr which is certainly not the cluster. > By breaking into d

Re: kernel hangs by many connections (reproducable)

2010-09-12 Thread Claudio Jeker
On Mon, Sep 13, 2010 at 06:35:10AM +0200, Bret S. Lambert wrote: > On Mon, Sep 13, 2010 at 10:12:44AM +0600, Anton Maksimenkov wrote: > > 2010/9/13 Henning Brauer : > > >> hangs. 1-2 sec after start. > > >> --- interrupt --- > > >> pool_do_get(d0a10b60,0,0,0,60) at pool_do_get+0x2c2 > > >> pool_get

Re: kernel hangs by many connections (reproducable)

2010-09-12 Thread Anton Maksimenkov
2010/9/13 Bret S. Lambert : > yes, because you've soaked up all the memory that's available for > handling incoming/outgoing network traffic; you've got a bunch of > processes that try to grab a limited number of resources, fail to > get all they need, and sleep while holding already-allocated mbuf

Re: kernel hangs by many connections (reproducable)

2010-09-12 Thread Bret S. Lambert
On Mon, Sep 13, 2010 at 10:12:44AM +0600, Anton Maksimenkov wrote: > 2010/9/13 Henning Brauer : > >> hangs. 1-2 sec after start. > >> --- interrupt --- > >> pool_do_get(d0a10b60,0,0,0,60) at pool_do_get+0x2c2 > >> pool_get(d0a10b60,0,8000,0,0) at pool_get+0x54 > >> m_gethdr(1,1,8000,369e99,

Re: kernel hangs by many connections (reproducable)

2010-09-12 Thread Anton Maksimenkov
2010/9/13 Henning Brauer : >> hangs. 1-2 sec after start. >> --- interrupt --- >> pool_do_get(d0a10b60,0,0,0,60) at pool_do_get+0x2c2 >> pool_get(d0a10b60,0,8000,0,0) at pool_get+0x54 >> m_gethdr(1,1,8000,369e99,0) at m_gethdr+0x39 > too me that simply looks like you are running out of memo

Re: kernel hangs by many connections (reproducable)

2010-09-12 Thread Henning Brauer
* Anton Maksimenkov [2010-09-12 07:35]: > I use my OBSD machine to test some server on another machine. They are > connected by pathcord, 1Gbit network cards are used. > Test program (uses kqueue) do many (I want thousands) connections to > server. Write query, read answer. > And it tries to keep