high-activity web server spotaneous crashes

Rahul Dhesi Thu, 10 Feb 2000 03:40:26 -0800
I have a high-activity Apache web server on a 3.4-STABLE box (cvsup'd a
few weeks ago) which is spontaneously crashing several times a day
without logging anything about the crash in the messages file.  I
started a cron job periodically saving current status, and I have the
statistics shown below from just before and just after the crash.

There is no keyboard and monitor attached, and I am only running some
very basic software including named, postfix (email, very low traffic)
and apache (80 processes or so, 30-40 active and the rest usually idle).
Web transaction rate is 50-80 hits per second.  I see some apache (httpd)
processes exit sometimes with signal 10 or signal 11.

ASUS P2BS motherboard with onboard SCSI.  256 M memory.

Tulip NICs are in use:
    de0: <Digital 21140A Fast Ethernet> rev 0x22 int a irq 10 on pci0.11.0
    de0: 21140A [10-100Mb/s] pass 2.2
    de0: address 00:40:05:a1:bb:1d
    de1: <Digital 21140A Fast Ethernet> rev 0x22 int a irq 11 on pci0.12.0
    de1: 21140A [10-100Mb/s] pass 2.2

I am suspecting either a hardware problem or an OS bug.  It's quite hard
to completely eliminate the possibility of a hardware problem but the
machine does have ECC memory, so quite likely the problem is not due to
bad memory.  The machine was stable until subjected to high web hit
load.

kern.ipc.nmbclusters shows 8192.

Are there any other kernel variables that should be adjusted that might
avoid the crashes?

I can supply more deails (e.g. boot-time messages and kernel config) but
I don't want to make this email too long.

The data below are as follows:

a. output from 'date'
b. output from 'netstat -n -f inet' parsed to count number of lines
   showing each state.
c. output from 'netstat -m'


=====
Wed Feb  9 19:35:01 PST 2000
CLOSE_WAIT              8
CLOSING                 1
ESTABLISHED            42
FIN_WAIT_1             39
FIN_WAIT_2            104
LAST_ACK                1
LISTEN                  9
SYN_RCVD               13
TIME_WAIT            1125
2024/3264 mbufs in use:
        571 mbufs allocated to data
        1453 mbufs allocated to packet headers
512/1360/8192 mbuf clusters in use (current/peak/max)
3128 Kbytes allocated to network (40% in use)
0 requests for memory denied
0 requests for memory delayed
0 calls to protocol drain routines

*** crashed and came back up at 19:39 ***

Wed Feb  9 19:40:01 PST 2000
CLOSE_WAIT              6
ESTABLISHED            22
FIN_WAIT_1             39
FIN_WAIT_2             12
LISTEN                  9
SYN_RCVD               10
TIME_WAIT             922
1493/1856 mbufs in use:
        423 mbufs allocated to data
        1070 mbufs allocated to packet headers
379/688/8192 mbuf clusters in use (current/peak/max)
1608 Kbytes allocated to network (58% in use)
0 requests for memory denied
0 requests for memory delayed
0 calls to protocol drain routines

== END ==


To Unsubscribe: send mail to [EMAIL PROTECTED]
with "unsubscribe freebsd-stable" in the body of the message
high-activity web server spotaneous crashes

Reply via email to