Hi. I am confused at the following strange FreeBSD reboot case.
I have a Dell PowerEdge 2300 dual proc machine with Intel 100/10 netcard.
I put 3.2 SMP kernel with options MAXUSERS=256 and NMBCLUSTERS=32768.
Otherwise kernel is almost the same as the stock FreeBSD kernel.
When I started some live Web tests with my custom Apache module,
I found that the FreeBSD panics every 45 minutes. First, I thought
it was some FIN_WAIT_2 problem. So I applied julian(@whistle.com)'s
patch. But it still reboots after 45 minutes of testing. Then
I thought it might be running out of mbufs due to load. However, the
number of allocated mbufs is no where near 32768 just seconds before the reboot.
One additional piece of data is that in my simluated environment, I was able
to pound the machine with FreeBSD 3.2 SMP for more than 12 hours straight
with 600 running Apache child processes.
I haven't tried to dump kernel core to see the call stack since the machine
is actually far away at a hosting site.
I wonder if anyone is aware of SMP kernel issues with Dell PowerEdge machines.
I will try single processor mode when I could arrange more live time.
However I am kind of suspect somewhere in net driver or TCP/IP stack
is screwed up.
Any help on the possible reboot cause is greatly appreciated. Otherwise I might
have to try BSDI which is something I would rather not to do.
Thanks,
--min