I just upgraded one of our Dell Poweredge 1750 servers from OpenBSD 4.2 to 4.4, and am having a problem with watchdogd, which we have enabled in our configuration.
I figured out that ipmi is disabled by default in the 4.4 GENERIC kernel, so enabled it. After this I can read sensor values, etc. But running watchdogd results in an almost immediate panic. I'm not certain at this point if I am doing something dumb, or if I should be submitting a bug report. I assume there must be some problems (on some hardware) with ipmi support, for it to be disabled by default, but I'm afraid I haven't been able to find many details (I saw some reference to issues on IBM servers). Here is my dmesg output, and below that is some ddb output from the panic ("trace" and "ps"). I'd be grateful if someone had any ideas as to what I might try next. (I had also disabled acpi in the boot below to see if it made any difference - it didn't). OpenBSD 4.4 (GENERIC) #1021: Tue Aug 12 17:16:55 MDT 2008 [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC cpu0: Intel(R) Xeon(TM) CPU 2.40GHz ("GenuineIntel" 686-class) 2.39 GHz cpu0: FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPIR real mem = 1073147904 (1023MB) avail mem = 1029259264 (981MB) mainbus0 at root bios0 at mainbus0: AT/286+ BIOS, date 09/29/06, BIOS32 rev. 0 @ 0xffe90, SMBIOS rev. 2.3 @ 0) bios0: vendor Dell Computer Corporation version "A12" date 09/29/2006 bios0: Dell Computer Corporation PowerEdge 1750 acpi at bios0 function 0x0 not configured pcibios0 at bios0: rev 2.1 @ 0xf0000/0x10000 pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfc4a0/144 (7 entries) pcibios0: PCI Interrupt Router at 000:15:0 ("ServerWorks CSB5" rev 0x00) pcibios0: PCI bus #0 is the last bus bios0: ROM list: 0xc0000/0x8000 0xc8000/0x4000 0xcc000/0x600 0xec000/0x4000! ipmi0 at mainbus0: version 1.0 interface BT iobase 0xe4/3 spacing 1 irq 10 cpu0 at mainbus0 pci0 at mainbus0 bus 0: configuration mode 1 (no bios) pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20-HE Host (GC-LE)" rev 0x33 pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20-HE Host (GC-LE)" rev 0x00 pci1 at pchb1 bus 1 em0 at pci1 dev 4 function 0 "Intel PRO/1000MT (82546GB)" rev 0x03: irq 7, address 00:04:23:c em1 at pci1 dev 4 function 1 "Intel PRO/1000MT (82546GB)" rev 0x03: irq 11, address 00:04:23d pchb2 at pci0 dev 0 function 2 "ServerWorks CNB20-HE Host (GC-LE)" rev 0x00 pci2 at pchb2 bus 3 "Dell DRAC 4 Embedded/Optional" rev 0x00 at pci0 dev 8 function 0 not configured puc0 at pci0 dev 8 function 1 "Dell DRAC 3 Virtual UART" rev 0x00: ports: 1 com com3 at puc0 port 0 irq 7: ns16550a, 16 byte fifo com3: probed fifo depth: 0 bytes "Dell DRAC 3 Embedded/Optional" rev 0x00 at pci0 dev 8 function 2 not configured vga1 at pci0 dev 14 function 0 "ATI Rage XL" rev 0x27 wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation) wsdisplay0: screen 1-5 added (80x25, vt100 emulation) drm at vga1 unsupported piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB5" rev 0x93: SMBus disabled pciide0 at pci0 dev 15 function 1 "ServerWorks CSB5 IDE" rev 0x93: DMA atapiscsi0 at pciide0 channel 1 drive 0 scsibus0 at atapiscsi0: 2 targets, initiator 7 cd0 at scsibus0 targ 0 lun 0: <SAMSUNG, CD-ROM SN-124, N103> ATAPI 5/cdrom removable cd0(pciide0:1:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 2 ohci0 at pci0 dev 15 function 2 "ServerWorks OSB4/CSB5 USB" rev 0x05: irq 5, version 1.0, let pcib0 at pci0 dev 15 function 3 "ServerWorks CSB5 LPC" rev 0x00 pchb3 at pci0 dev 16 function 0 "ServerWorks CIOB-E" rev 0x12 pchb4 at pci0 dev 16 function 2 "ServerWorks CIOB-E" rev 0x12 pci3 at pchb4 bus 2 bge0 at pci3 dev 0 function 0 "Broadcom BCM5704C" rev 0x02, BCM5704 A2 (0x2002): irq 7, addr2 brgphy0 at bge0 phy 1: BCM5704 10/100/1000baseT PHY, rev. 0 bge1 at pci3 dev 0 function 1 "Broadcom BCM5704C" rev 0x02, BCM5704 A2 (0x2002): irq 11, add3 brgphy1 at bge1 phy 1: BCM5704 10/100/1000baseT PHY, rev. 0 pchb5 at pci0 dev 17 function 0 "ServerWorks CIOB-X2 PCIX" rev 0x05 pchb6 at pci0 dev 17 function 2 "ServerWorks CIOB-X2 PCIX" rev 0x05 pci4 at pchb6 bus 4 mpi0 at pci4 dev 5 function 0 "Symbios Logic 53c1030" rev 0x07: irq 7 scsibus1 at mpi0: 16 targets, initiator 7 sd0 at scsibus1 targ 0 lun 0: <IBM, IC35L036UCDY10-0, S27T> SCSI3 0/direct fixed sd0: 34732MB, 36703 cyl, 6 head, 323 sec, 512 bytes/sec, 71132959 sec total safte0 at scsibus1 targ 6 lun 0: <PE/PV, 1x3 SCSI BP, 1.1> SCSI2 3/processor fixed mpi0: target 0 Sync at 160MHz width 16bit offset 127 QAS 0 DT 1 IU 1 mpi1 at pci4 dev 5 function 1 "Symbios Logic 53c1030" rev 0x07: irq 11 scsibus2 at mpi1: 16 targets, initiator 7 usb0 at ohci0: USB revision 1.0 uhub0 at usb0 "ServerWorks OHCI root hub" rev 1.00/1.00 addr 1 isa0 at pcib0 isadma0 at isa0 com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo com0: console com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo pckbc0 at isa0 port 0x60/5 pckbd0 at pckbc0 (kbd slot) pckbc0: using irq 1 for kbd slot wskbd0 at pckbd0: console keyboard, using wsdisplay0 pmsi0 at pckbc0 (aux slot) pckbc0: using irq 12 for aux slot wsmouse0 at pmsi0 mux 0 pcppi0 at isa0 port 0x61 midi0 at pcppi0: <PC speaker> spkr0 at pcppi0 npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16 fdc0 at isa0 port 0x3f0/6 irq 6 drq 2 fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec biomask efe5 netmask efe5 ttymask ffff mtrr: Pentium Pro MTRR support softraid0 at root root on sd0a swap on sd0b dump on sd0b and from the panic: panic: ohci_add_done: addr 0xfaa0039e not found Stopped at Debugger+0x4: leave RUN AT LEAST 'trace' AND 'ps' AND INCLUDE OUTPUT WHEN REPORTING THIS PANIC! DO NOT EVEN BOTHER REPORTING THIS WITHOUT INCLUDING THAT INFORMATION! ddb> trace Debugger(3c,0,d7a78100,faa0039e,0) at Debugger+0x4 panic(d06ed9a0,faa0039e,d7a78100,14,0) at panic+0x55 ohci_softintr(d19db000,faa0039e,d0201f42,dd750e4c,d04b0605) at ohci_softintr ohci_intr1(d19db000) at ohci_intr1+0x1d5 Xrecurse_legacy5() at Xrecurse_legacy5+0xad --- interrupt --- bcopy(ddd75,30000,0,eb40000,ddd75) at bcopy+0x41 ddb> ps PID PPID PGRP UID S FLAGS WAIT COMMAND 5109 8345 5109 0 2 0x4402 watchdogd 8345 16814 8345 0 3 0x4082 wait bash 16814 9948 16814 5000 3 0x4082 wait bash 9948 11938 11938 5000 3 0x180 select sshd 11938 32278 11938 0 3 0x4180 netio sshd 30145 1 30145 0 3 0x4082 ttyin getty 20700 1 20700 0 3 0x4082 ttyin getty 17058 1 17058 0 3 0x4082 ttyin getty 10975 1 10975 0 3 0x4082 ttyin getty 19585 1 19585 0 3 0x4082 ttyin getty 28157 1 28157 0 3 0x4082 ttyin getty 3016 1 3016 0 3 0x80 select cron 31990 1 31990 565 2 0x180 pfflowd 6319 1 6319 77 3 0x180 poll dhcrelay 20820 1 20820 77 3 0x180 poll dhcrelay 21763 1 21763 77 3 0x180 poll dhcrelay 7056 1 7056 77 3 0x180 poll dhcrelay 2446 1 2446 77 3 0x180 poll dhcrelay 18761 1 18761 77 3 0x180 poll dhcrelay 20703 1 20703 77 3 0x180 poll dhcrelay 10615 1 10615 77 3 0x180 poll dhcrelay 10075 1 10075 77 3 0x180 poll dhcrelay 8326 1 8326 77 3 0x180 poll dhcrelay 428 1 428 77 3 0x180 poll dhcrelay 28507 1 28507 77 3 0x180 poll dhcrelay 10244 1 10244 0 3 0x80 poll cfexecd 26052 1 13348 0 3 0x80 select snmpd 22575 1 22575 0 3 0x40180 select sendmail 32278 1 32278 0 3 0x80 select sshd 95 1 95 0 3 0x80 poll ntpd 20128 1 20128 83 3 0x180 poll ntpd 24418 24328 24328 74 3 0x180 bpf pflogd 24328 1 24328 0 3 0x80 netio pflogd 31386 13097 13097 73 3 0x180 poll syslogd 13097 1 13097 0 3 0x88 netio syslogd 14 0 0 0 3 0x100200 bored crypto 13 0 0 0 3 0x100200 aiodoned aiodoned 12 0 0 0 3 0x100200 syncer update 11 0 0 0 3 0x100200 cleaner cleaner 10 0 0 0 3 0x100200 reaper reaper 9 0 0 0 3 0x100200 pgdaemon pagedaemon 8 0 0 0 3 0x100200 pftm pfpurge 7 0 0 0 3 0x100200 usbtsk usbtask 6 0 0 0 3 0x100200 usbevt usb0 * 5 0 0 0 7 0x100200 ipmi0 4 0 0 0 3 0x100200 bored syswq 3 0 0 0 3 0x100200 idle0 2 0 0 0 3 0x100200 kmalloc kmthread 1 0 1 0 3 0x4080 wait init 0 -1 0 0 3 0x80200 scheduler swapper Thanks for any ideas, Graham -- ------------------------------------------------------------------------- Graham Allan School of Physics and Astronomy - University of Minnesota -------------------------------------------------------------------------