Hello everybody,
I updated my openbsd firewalls (two carp-ed fw) last month (May 24th) to
4.9 release. I don't know if this is related, but i have a significant
numbers of "watchdog timeout" errors in logs (the master becomes slave
when the error appears).
Before the update, i've just seen this error 2 times (May 17th and April
14th). xx-1 is normally the master and xx-2 the slave (dmesg are exactly
the same). Watchdog timeout in logs appear when the firewall is master
(bw ~ 20Mb/s).
xx-1:~# zgrep watchdog /var/log/messages*
/var/log/messages.0.gz:Jun 14 13:59:41 xx-1 /bsd: msk2: watchdog timeout
/var/log/messages.0.gz:Jun 15 15:05:55 xx-1 /bsd: msk0: watchdog timeout
/var/log/messages.0.gz:Jun 15 16:56:32 xx-1 /bsd: msk0: watchdog timeout
/var/log/messages.0.gz:Jun 15 17:23:05 xx-1 /bsd: msk1: watchdog timeout
/var/log/messages.1.gz:Jun 6 10:32:16 xx-1 /bsd: msk1: watchdog timeout
/var/log/messages.1.gz:Jun 6 17:30:55 xx-1 /bsd: msk2: watchdog timeout
/var/log/messages.1.gz:Jun 9 10:17:31 xx-1 /bsd: msk0: watchdog timeout
/var/log/messages.2.gz:May 30 17:10:45 xx-1 /bsd: msk0: watchdog timeout
/var/log/messages.2.gz:May 31 18:02:13 xx-1 /bsd: msk0: watchdog timeout
/var/log/messages.4.gz:May 17 16:18:45 xx-1 /bsd: msk0: watchdog timeout
/var/log/messages.9.gz:Apr 14 16:00:26 xx-1 /bsd: msk0: watchdog timeout
xx-2:~# zgrep watchdog /var/log/messages*
/var/log/messages:Jun 20 10:14:20 xx-2 /bsd: msk0: watchdog timeout
/var/log/messages:Jun 21 15:48:32 xx-2 /bsd: msk2: watchdog timeout
/var/log/messages.0.gz:Jun 17 10:16:12 xx-2 /bsd: msk0: watchdog timeout
The 4.9 upgrade and the increase of watchdog timeout logs seem to
coincide but there are no pieces of evidence.
Does anybody has the same type of network card and the same problem ?
I found some old mails (2007) but i don't see if it could help me :
http://gnats.netbsd.org/36454
http://kerneltrap.org/mailarchive/openbsd-misc/2007/5/11/149534/thread
Some mails/forums are not so old (2010) but concern freebsd :
http://www.freebsd.org/cgi/query-pr.cgi?pr=116853
http://forums.freebsd.org/showthread.php?t=10183
Thank you in advance !
Here is the dmesg of one of the firewall (they are strictly the same).
OpenBSD 4.9 (GENERIC.MP) #794: Wed Mar 2 07:19:02 MST 2011
dera...@i386.openbsd.org:/usr/src/sys/arch/i386/compile/GENERIC.MP
cpu0: Intel(R) Pentium(R) 4 CPU 3.40GHz ("GenuineIntel" 686-class) 3.41 GHz
cpu0:
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,DS-CPL,EST,TM2,CNXT-ID,CX16,xTPR,PDCM
real mem = 2145939456 (2046MB)
avail mem = 2100670464 (2003MB)
mainbus0 at root
bios0 at mainbus0: AT/286+ BIOS, date 08/29/05, BIOS32 rev. 0 @ 0xf9680,
SMBIOS rev. 2.2 @ 0xf0800 (39 entries)
bios0: vendor Phoenix Technologies, LTD version "6.00 PG" date 08/29/2005
acpi0 at bios0: rev 0
acpi0: sleep states S0 S1
acpi0: tables DSDT FACP MCFG APIC
acpi0: wakeup devices PEX0(S5) PEX1(S5) PEX2(S5) PEX3(S5) HUB0(S5)
UAR1(S5) UAR2(S5) USB0(S1) USB1(S1) USB2(S1) USBE(S1) AC97(S5) AZAL(S5)
PCI0(S5)
acpitimer0 at acpi0: 3579545 Hz, 24 bits
acpimcfg0 at acpi0 addr 0xe0000000, bus 0-0
acpimadt0 at acpi0 addr 0xfee00000: PC-AT compat
cpu0 at mainbus0: apid 0 (boot processor)
cpu0: apic clock running at 200MHz
cpu1 at mainbus0: apid 1 (application processor)
cpu1: Intel(R) Pentium(R) 4 CPU 3.40GHz ("GenuineIntel" 686-class) 3.41 GHz
cpu1:
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPI,MMX,FXSR,SSE,SSE2,SS,HTT,TM,SBF,SSE3,MWAIT,DS-CPL,EST,TM2,CNXT-ID,CX16,xTPR,PDCM
ioapic0 at mainbus0: apid 2 pa 0xfec00000, version 20, 24 pins
ioapic0: misconfigured as apic 0, remapped to apid 2
acpiprt0 at acpi0: bus 0 (PCI0)
acpiprt1 at acpi0: bus 1 (PEX0)
acpiprt2 at acpi0: bus 2 (PEX1)
acpiprt3 at acpi0: bus 3 (PEX2)
acpiprt4 at acpi0: bus 4 (PEX3)
acpiprt5 at acpi0: bus 5 (HUB0)
acpicpu0 at acpi0
acpicpu1 at acpi0
acpitz0 at acpi0: critical temperature 75 degC
acpibtn0 at acpi0: PWRB
bios0: ROM list: 0xc0000/0xa800! 0xcc000/0x8000! 0xef000/0x1000!
cpu0: Enhanced SpeedStep disabled by BIOS
pci0 at mainbus0 bus 0: configuration mode 1 (bios)
pchb0 at pci0 dev 0 function 0 "Intel 82915G Host" rev 0x0e
vga1 at pci0 dev 2 function 0 "Intel 82915G Video" rev 0x0e
wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
intagp0 at vga1
agp0 at intagp0: aperture at 0xc0000000, size 0x10000000
inteldrm0 at vga1: apic 2 int 16 (irq 5)
drm0 at inteldrm0
ppb0 at pci0 dev 28 function 0 "Intel 82801FB PCIE" rev 0x04: apic 2 int
16 (irq 5)
pci1 at ppb0 bus 1
mskc0 at pci1 dev 0 function 0 "Marvell Yukon 88E8053" rev 0x15, Yukon-2
EC rev. A3 (0x2): apic 2 int 16 (irq 5)
msk0 at mskc0 port A: address 00:10:f3:13:c6:98
eephy0 at msk0 phy 0: 88E1111 Gigabit PHY, rev. 2
ppb1 at pci0 dev 28 function 1 "Intel 82801FB PCIE" rev 0x04: apic 2 int
17 (irq 10)
pci2 at ppb1 bus 2
mskc1 at pci2 dev 0 function 0 "Marvell Yukon 88E8053" rev 0x15, Yukon-2
EC rev. A3 (0x2): apic 2 int 17 (irq 10)
msk1 at mskc1 port A: address 00:10:f3:13:c6:99
eephy1 at msk1 phy 0: 88E1111 Gigabit PHY, rev. 2
ppb2 at pci0 dev 28 function 2 "Intel 82801FB PCIE" rev 0x04: apic 2 int
18 (irq 12)
pci3 at ppb2 bus 3
mskc2 at pci3 dev 0 function 0 "Marvell Yukon 88E8053" rev 0x15, Yukon-2
EC rev. A3 (0x2): apic 2 int 18 (irq 12)
msk2 at mskc2 port A: address 00:10:f3:13:c6:9a
eephy2 at msk2 phy 0: 88E1111 Gigabit PHY, rev. 2
ppb3 at pci0 dev 28 function 3 "Intel 82801FB PCIE" rev 0x04: apic 2 int
19 (irq 11)
pci4 at ppb3 bus 4
mskc3 at pci4 dev 0 function 0 "Marvell Yukon 88E8053" rev 0x15, Yukon-2
EC rev. A3 (0x2): apic 2 int 19 (irq 11)
msk3 at mskc3 port A: address 00:10:f3:13:c6:9b
eephy3 at msk3 phy 0: 88E1111 Gigabit PHY, rev. 2
uhci0 at pci0 dev 29 function 0 "Intel 82801FB USB" rev 0x04: apic 2 int
23 (irq 9)
uhci1 at pci0 dev 29 function 1 "Intel 82801FB USB" rev 0x04: apic 2 int
19 (irq 11)
uhci2 at pci0 dev 29 function 2 "Intel 82801FB USB" rev 0x04: apic 2 int
18 (irq 12)
ehci0 at pci0 dev 29 function 7 "Intel 82801FB USB" rev 0x04: apic 2 int
23 (irq 9)
usb0 at ehci0: USB revision 2.0
uhub0 at usb0 "Intel EHCI root hub" rev 2.00/1.00 addr 1
ppb4 at pci0 dev 30 function 0 "Intel 82801BA Hub-to-PCI" rev 0xd4
pci5 at ppb4 bus 5
skc0 at pci5 dev 4 function 0 "Marvell Yukon 88E8001/8003/8010" rev
0x13, Yukon Lite (0x9): apic 2 int 16 (irq 5)
sk0 at skc0 port A: address 00:10:f3:13:c6:94
eephy4 at sk0 phy 0: 88E1011 Gigabit PHY, rev. 5
skc1 at pci5 dev 5 function 0 "Marvell Yukon 88E8001/8003/8010" rev
0x13, Yukon Lite (0x9): apic 2 int 19 (irq 11)
sk1 at skc1 port A: address 00:10:f3:13:c6:95
eephy5 at sk1 phy 0: 88E1011 Gigabit PHY, rev. 5
skc2 at pci5 dev 6 function 0 "Marvell Yukon 88E8001/8003/8010" rev
0x13, Yukon Lite (0x9): apic 2 int 18 (irq 12)
sk2 at skc2 port A: address 00:10:f3:13:c6:96
eephy6 at sk2 phy 0: 88E1011 Gigabit PHY, rev. 5
skc3 at pci5 dev 7 function 0 "Marvell Yukon 88E8001/8003/8010" rev
0x13, Yukon Lite (0x9): apic 2 int 17 (irq 10)
sk3 at skc3 port A: address 00:10:f3:13:c6:97
eephy7 at sk3 phy 0: 88E1011 Gigabit PHY, rev. 5
ichpcib0 at pci0 dev 31 function 0 "Intel 82801FB LPC" rev 0x04: PM disabled
pciide0 at pci0 dev 31 function 1 "Intel 82801FB IDE" rev 0x04: DMA,
channel 0 configured to compatibility, channel 1 configured to compatibility
wd0 at pciide0 channel 0 drive 0: <SILICONSYSTEMS INC 2GB>
wd0: 1-sector PIO, LBA, 2001MB, 4098528 sectors
wd0(pciide0:0:0): using PIO mode 4, DMA mode 2
pciide0: channel 1 disabled (no drives)
ichiic0 at pci0 dev 31 function 3 "Intel 82801FB SMBus" rev 0x04: apic 2
int 19 (irq 11)
iic0 at ichiic0
spdmem0 at iic0 addr 0x50: 2GB DDR2 SDRAM non-parity PC2-6400CL5
usb1 at uhci0: USB revision 1.0
uhub1 at usb1 "Intel UHCI root hub" rev 1.00/1.00 addr 1
usb2 at uhci1: USB revision 1.0
uhub2 at usb2 "Intel UHCI root hub" rev 1.00/1.00 addr 1
usb3 at uhci2: USB revision 1.0
uhub3 at usb3 "Intel UHCI root hub" rev 1.00/1.00 addr 1
isa0 at ichpcib0
isadma0 at isa0
com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
com0: console
com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
pckbc0 at isa0 port 0x60/5
pckbd0 at pckbc0 (kbd slot)
pckbc0: using irq 1 for kbd slot
wskbd0 at pckbd0: console keyboard, using wsdisplay0
pcppi0 at isa0 port 0x61
spkr0 at pcppi0
lpt0 at isa0 port 0x378/4 irq 7
it0 at isa0 port 0x2e/2: IT8712F rev 7, EC port 0x290
npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16
mtrr: Pentium Pro MTRR support
vscsi0 at root
scsibus0 at vscsi0: 256 targets
softraid0 at root
root on wd0a swap on wd0b dump on wd0b
Mathieu.