Nope I have not tweaked any kernel variables other thatn the ones i tried temporarily and put back to normal afterwards. Ok i will recompile the kernel with 256 maxuser setting and recompile included is dmesg.boot from one of the machines.....not sure if that helps as much as info i found in dmesg...as this is just hardware info. Anyways here it is....should be back in a couple hours after basketball game to recompile the kernels on those 2 boxes 2 new settings. Thx for help as of this point. On Mon, 5 Feb 2001, Matt Dillon wrote: > Date: Mon, 5 Feb 2001 18:31:21 -0800 (PST) > From: Matt Dillon <[EMAIL PROTECTED]> > To: Dan Phoenix <[EMAIL PROTECTED]> > Cc: Alfred Perlstein <[EMAIL PROTECTED]>, Jos Backus <[EMAIL PROTECTED]>, > [EMAIL PROTECTED] > Subject: Re: qmail IO problems > > :ok of those commands some interesting info was from dmesg... > :on one machine i had > :file: table is full > :file: table is full > :file: table is full > :file: table is full > :file: table is full > :file: table is full > :file: table is full > :file: table is full > :file: table is full > :file: table is full > :file: table is full > :file: table is full > :file: table is full > : > :from dmesg > : > :on the other machine > :looutput: mbuf allocation failed > :nfs server 172.16.0.101:/bravenet1/home: not responding > :looutput: mbuf allocation failed > :looutput: mbuf allocation failed > :looutput: mbuf allocation failed > :nfs server 172.16.0.101:/bravenet1/home: is alive again > :looutput: mbuf allocation failed > :looutput: mbuf allocation failed > : > :i doubt that mbuf allocation failed was from the nfs server timeout that > :one time....but cannot be certain......this help you at all? > > This sheds a considerable amount of light on the problems... > methinks you may have a low 'maxusers' setting in the kernel > config. Read on. > > I still need the complete 'dmesg' output, or if it all scrolled off > due to the above errors, cat the '/var/run/dmesg.boot' file. > > You had systat -vm 1 output in the earlier emails, but not > 'vmstat 1' output for 20 seconds. That isn't as big a deal with > all the other info we have now, but still useful. > > > :[root@arwen qmail-1.03]# pstat -s > :Device 1K-blocks Used Avail Capacity Type > :/dev/ad0s1b 1048448 0 1048448 0% Interleaved > :[root@arwen qmail-1.03]# > : > :[root@elrond dphoenix]# pstat -s > :Device 1K-blocks Used Avail Capacity Type > :/dev/ad0s1b 528696 2032 526664 0% Interleaved > :[root@elrond dphoenix]# > > This indicates that you are not swapping or paging significantly, > which is good. We can cross that off the list of possible problems. > > :ps axlww > : > :included is ps.txt..... > :2 perl scripts running only on that machine at moment yet qmail queue keep > :getting larger....seems to be getting abit better but not that great > :either. > : > : (ps output not included in reply) > > The ps output indicates that you are running a relatively light process > load. The prime suspects are thus the file table and mbuf errors. > > These errors normally occur when you configure a much too low > 'maxusers' setting in the kernel config. Since you didn't provide > the complete dmesg output (cat /var/run/dmesg.boot), I can't tell > but I am guessing that you are either using the GENERIC kernel > directly, or you created a custom kernel but didn't tune the > 'maxusers' entry. > > For a machine doing the work this machine is doing, I recommend > a maxusers setting in the kernel config of 256. You need to rebuild > your kernel in that case. Have you ever built a kernel before? > I think all you may need to do is up 'maxusers' in the kernel > config and perhaps mess around with the number of mbuf clusters, > but I suspect increasing maxusers will do the trick. These > changes require recompiling the kernel. > > Also, to make sure... you haven't tweaked any other sysctl's, have > you? > > -Matt >
Copyright (c) 1992-2001 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 4.2-STABLE #0: Fri Feb 2 14:43:07 PST 2001 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/GENERIC Timecounter "i8254" frequency 1193182 Hz CPU: Pentium III/Pentium III Xeon/Celeron (498.75-MHz 686-class CPU) Origin = "GenuineIntel" Id = 0x672 Stepping = 2 Features=0x383fbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,MMX,FXSR,SSE> real memory = 268369920 (262080K bytes) avail memory = 256974848 (250952K bytes) Preloaded elf kernel "kernel" at 0xc043f000. Pentium Pro MTRR support enabled md0: Malloc disk npx0: <math processor> on motherboard npx0: INT 16 interface pcib0: <Intel 82443BX host to PCI bridge (AGP disabled)> on motherboard pci0: <PCI bus> on pcib0 sym0: <875> port 0x1400-0x14ff mem 0xfa200000-0xfa200fff,0xfa204000-0xfa2040ff irq 11 at device 13.0 on pci0 sym0: No NVRAM, ID 7, Fast-20, SE, parity checking sym1: <875> port 0x1800-0x18ff mem 0xfa201000-0xfa201fff,0xfa204400-0xfa2044ff irq 11 at device 13.1 on pci0 sym1: No NVRAM, ID 7, Fast-20, SE, parity checking fxp0: <Intel Pro 10/100B/100+ Ethernet> port 0x1060-0x107f mem 0xfa000000-0xfa0fffff,0xfa205000-0xfa205fff irq 10 at device 15.0 on pci0 fxp0: Ethernet address 00:90:27:73:c0:c0 fxp1: <Intel Pro 10/100B/100+ Ethernet> port 0x1080-0x10bf mem 0xfa100000-0xfa1fffff,0xfa202000-0xfa202fff irq 10 at device 16.0 on pci0 fxp1: Ethernet address 00:90:27:af:30:13 isab0: <Intel 82371AB PCI to ISA bridge> at device 18.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <Intel PIIX4 ATA33 controller> port 0x1050-0x105f at device 18.1 on pci0 ata0: at 0x1f0 irq 14 on atapci0 ata1: at 0x170 irq 15 on atapci0 uhci0: <Intel 82371AB/EB (PIIX4) USB controller> irq 11 at device 18.2 on pci0 uhci0: Could not map ports device_probe_and_attach: uhci0 attach returned 6 chip1: <Intel 82371AB Power management controller> port 0x1040-0x104f at device 18.3 on pci0 pci0: <Cirrus Logic GD5480 SVGA controller> at 20.0 fdc0: <NEC 72065B or clone> at port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on isa0 fdc0: FIFO enabled, 8 bytes threshold fd0: <1440-KB 3.5" drive> on fdc0 drive 0 atkbdc0: <Keyboard controller (i8042)> at port 0x60,0x64 on isa0 atkbd0: <AT Keyboard> flags 0x1 irq 1 on atkbdc0 kbd0 at atkbd0 vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> sio0 at port 0x3f8-0x3ff irq 4 flags 0x10 on isa0 sio0: type 16550A sio1: configured irq 3 not in bitmap of probed irqs 0 ppc0: <Parallel port> at port 0x378-0x37f irq 7 on isa0 ppc0: Generic chipset (ECP/PS2/NIBBLE) in COMPATIBLE mode ppc0: FIFO with 16/16/8 bytes threshold plip0: <PLIP network interface> on ppbus0 lpt0: <Printer> on ppbus0 lpt0: Interrupt-driven port ppi0: <Parallel I/O> on ppbus0 ad0: 6149MB <QUANTUM FIREBALL CX6.4A> [13328/15/63] at ata0-master UDMA33 acd0: CDROM <IDE/ATAPI CD-ROM 40X> at ata1-master using PIO4 Waiting 15 seconds for SCSI devices to settle Mounting root from ufs:/dev/ad0s1a