I just upgraded one of our Dell Poweredge 1750 servers from OpenBSD 4.2
to 4.4, and am having a problem with watchdogd, which we have enabled
in our configuration.

I figured out that ipmi is disabled by default in the 4.4 GENERIC
kernel, so enabled it. After this I can read sensor values, etc. But
running watchdogd results in an almost immediate panic.

I'm not certain at this point if I am doing something dumb, or if I
should be submitting a bug report. I assume there must be some problems
(on some hardware) with ipmi support, for it to be disabled by default,
but I'm afraid I haven't been able to find many details (I saw some
reference to issues on IBM servers).

Here is my dmesg output, and below that is some ddb output from the
panic ("trace" and "ps"). I'd be grateful if someone had any ideas as to
what I might try next.

(I had also disabled acpi in the boot below to see if it made any
difference - it didn't).

    OpenBSD 4.4 (GENERIC) #1021: Tue Aug 12 17:16:55 MDT 2008
    [EMAIL PROTECTED]:/usr/src/sys/arch/i386/compile/GENERIC
    cpu0: Intel(R) Xeon(TM) CPU 2.40GHz ("GenuineIntel" 686-class) 2.39
    GHz
    cpu0:
    
FPU,V86,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CFLUSH,DS,ACPIR
    real mem  = 1073147904 (1023MB)
    avail mem = 1029259264 (981MB)
    mainbus0 at root
    bios0 at mainbus0: AT/286+ BIOS, date 09/29/06, BIOS32 rev. 0 @
    0xffe90, SMBIOS rev. 2.3 @ 0)
    bios0: vendor Dell Computer Corporation version "A12" date
    09/29/2006
    bios0: Dell Computer Corporation PowerEdge 1750
    acpi at bios0 function 0x0 not configured
    pcibios0 at bios0: rev 2.1 @ 0xf0000/0x10000
    pcibios0: PCI IRQ Routing Table rev 1.0 @ 0xfc4a0/144 (7 entries)
    pcibios0: PCI Interrupt Router at 000:15:0 ("ServerWorks CSB5" rev
    0x00)
    pcibios0: PCI bus #0 is the last bus
    bios0: ROM list: 0xc0000/0x8000 0xc8000/0x4000 0xcc000/0x600
    0xec000/0x4000!
    ipmi0 at mainbus0: version 1.0 interface BT iobase 0xe4/3 spacing 1
    irq 10
    cpu0 at mainbus0
    pci0 at mainbus0 bus 0: configuration mode 1 (no bios)
    pchb0 at pci0 dev 0 function 0 "ServerWorks CNB20-HE Host (GC-LE)"
    rev 0x33
    pchb1 at pci0 dev 0 function 1 "ServerWorks CNB20-HE Host (GC-LE)"
    rev 0x00
    pci1 at pchb1 bus 1
    em0 at pci1 dev 4 function 0 "Intel PRO/1000MT (82546GB)" rev 0x03:
    irq 7, address 00:04:23:c
    em1 at pci1 dev 4 function 1 "Intel PRO/1000MT (82546GB)" rev 0x03:
    irq 11, address 00:04:23d
    pchb2 at pci0 dev 0 function 2 "ServerWorks CNB20-HE Host (GC-LE)"
    rev 0x00
    pci2 at pchb2 bus 3
    "Dell DRAC 4 Embedded/Optional" rev 0x00 at pci0 dev 8 function 0
    not configured
    puc0 at pci0 dev 8 function 1 "Dell DRAC 3 Virtual UART" rev 0x00:
    ports: 1 com
    com3 at puc0 port 0 irq 7: ns16550a, 16 byte fifo
    com3: probed fifo depth: 0 bytes
    "Dell DRAC 3 Embedded/Optional" rev 0x00 at pci0 dev 8 function 2
    not configured
    vga1 at pci0 dev 14 function 0 "ATI Rage XL" rev 0x27
    wsdisplay0 at vga1 mux 1: console (80x25, vt100 emulation)
    wsdisplay0: screen 1-5 added (80x25, vt100 emulation)
    drm at vga1 unsupported
    piixpm0 at pci0 dev 15 function 0 "ServerWorks CSB5" rev 0x93: SMBus
    disabled
    pciide0 at pci0 dev 15 function 1 "ServerWorks CSB5 IDE" rev 0x93:
    DMA
    atapiscsi0 at pciide0 channel 1 drive 0
    scsibus0 at atapiscsi0: 2 targets, initiator 7
    cd0 at scsibus0 targ 0 lun 0: <SAMSUNG, CD-ROM SN-124, N103> ATAPI
    5/cdrom removable
    cd0(pciide0:1:0): using PIO mode 4, DMA mode 2, Ultra-DMA mode 2
    ohci0 at pci0 dev 15 function 2 "ServerWorks OSB4/CSB5 USB" rev
    0x05: irq 5, version 1.0, let
    pcib0 at pci0 dev 15 function 3 "ServerWorks CSB5 LPC" rev 0x00
    pchb3 at pci0 dev 16 function 0 "ServerWorks CIOB-E" rev 0x12
    pchb4 at pci0 dev 16 function 2 "ServerWorks CIOB-E" rev 0x12
    pci3 at pchb4 bus 2
    bge0 at pci3 dev 0 function 0 "Broadcom BCM5704C" rev 0x02, BCM5704
    A2 (0x2002): irq 7, addr2
    brgphy0 at bge0 phy 1: BCM5704 10/100/1000baseT PHY, rev. 0
    bge1 at pci3 dev 0 function 1 "Broadcom BCM5704C" rev 0x02, BCM5704
    A2 (0x2002): irq 11, add3
    brgphy1 at bge1 phy 1: BCM5704 10/100/1000baseT PHY, rev. 0
    pchb5 at pci0 dev 17 function 0 "ServerWorks CIOB-X2 PCIX" rev 0x05
    pchb6 at pci0 dev 17 function 2 "ServerWorks CIOB-X2 PCIX" rev 0x05
    pci4 at pchb6 bus 4
    mpi0 at pci4 dev 5 function 0 "Symbios Logic 53c1030" rev 0x07: irq
    7
    scsibus1 at mpi0: 16 targets, initiator 7
    sd0 at scsibus1 targ 0 lun 0: <IBM, IC35L036UCDY10-0, S27T> SCSI3
    0/direct fixed
    sd0: 34732MB, 36703 cyl, 6 head, 323 sec, 512 bytes/sec, 71132959
    sec total
    safte0 at scsibus1 targ 6 lun 0: <PE/PV, 1x3 SCSI BP, 1.1> SCSI2
    3/processor fixed
    mpi0: target 0 Sync at 160MHz width 16bit offset 127 QAS 0 DT 1 IU 1
    mpi1 at pci4 dev 5 function 1 "Symbios Logic 53c1030" rev 0x07: irq
    11
    scsibus2 at mpi1: 16 targets, initiator 7
    usb0 at ohci0: USB revision 1.0
    uhub0 at usb0 "ServerWorks OHCI root hub" rev 1.00/1.00 addr 1
    isa0 at pcib0
    isadma0 at isa0
    com0 at isa0 port 0x3f8/8 irq 4: ns16550a, 16 byte fifo
    com0: console
    com1 at isa0 port 0x2f8/8 irq 3: ns16550a, 16 byte fifo
    pckbc0 at isa0 port 0x60/5
    pckbd0 at pckbc0 (kbd slot)
    pckbc0: using irq 1 for kbd slot
    wskbd0 at pckbd0: console keyboard, using wsdisplay0
    pmsi0 at pckbc0 (aux slot)
    pckbc0: using irq 12 for aux slot
    wsmouse0 at pmsi0 mux 0
    pcppi0 at isa0 port 0x61
    midi0 at pcppi0: <PC speaker>
    spkr0 at pcppi0
    npx0 at isa0 port 0xf0/16: reported by CPUID; using exception 16
    fdc0 at isa0 port 0x3f0/6 irq 6 drq 2
    fd0 at fdc0 drive 0: 1.44MB 80 cyl, 2 head, 18 sec
    biomask efe5 netmask efe5 ttymask ffff
    mtrr: Pentium Pro MTRR support
    softraid0 at root
    root on sd0a swap on sd0b dump on sd0b

and from the panic:

panic: ohci_add_done: addr 0xfaa0039e not found
Stopped at Debugger+0x4: leave
RUN AT LEAST 'trace' AND 'ps' AND INCLUDE OUTPUT WHEN REPORTING THIS
PANIC!
DO NOT EVEN BOTHER REPORTING THIS WITHOUT INCLUDING THAT INFORMATION!
ddb> trace
Debugger(3c,0,d7a78100,faa0039e,0) at Debugger+0x4
panic(d06ed9a0,faa0039e,d7a78100,14,0) at panic+0x55
ohci_softintr(d19db000,faa0039e,d0201f42,dd750e4c,d04b0605) at
ohci_softintr
ohci_intr1(d19db000) at ohci_intr1+0x1d5
Xrecurse_legacy5() at Xrecurse_legacy5+0xad
--- interrupt ---
bcopy(ddd75,30000,0,eb40000,ddd75) at bcopy+0x41
ddb> ps
PID PPID PGRP UID S FLAGS WAIT COMMAND
5109 8345 5109 0 2 0x4402 watchdogd
8345 16814 8345 0 3 0x4082 wait bash
16814 9948 16814 5000 3 0x4082 wait bash
9948 11938 11938 5000 3 0x180 select sshd
11938 32278 11938 0 3 0x4180 netio sshd
30145 1 30145 0 3 0x4082 ttyin getty
20700 1 20700 0 3 0x4082 ttyin getty
17058 1 17058 0 3 0x4082 ttyin getty
10975 1 10975 0 3 0x4082 ttyin getty
19585 1 19585 0 3 0x4082 ttyin getty
28157 1 28157 0 3 0x4082 ttyin getty
3016 1 3016 0 3 0x80 select cron
31990 1 31990 565 2 0x180 pfflowd
6319 1 6319 77 3 0x180 poll dhcrelay
20820 1 20820 77 3 0x180 poll dhcrelay
21763 1 21763 77 3 0x180 poll dhcrelay
7056 1 7056 77 3 0x180 poll dhcrelay
2446 1 2446 77 3 0x180 poll dhcrelay
18761 1 18761 77 3 0x180 poll dhcrelay
20703 1 20703 77 3 0x180 poll dhcrelay
10615 1 10615 77 3 0x180 poll dhcrelay
10075 1 10075 77 3 0x180 poll dhcrelay
8326 1 8326 77 3 0x180 poll dhcrelay
428 1 428 77 3 0x180 poll dhcrelay
28507 1 28507 77 3 0x180 poll dhcrelay
10244 1 10244 0 3 0x80 poll cfexecd
26052 1 13348 0 3 0x80 select snmpd
22575 1 22575 0 3 0x40180 select sendmail
32278 1 32278 0 3 0x80 select sshd
95 1 95 0 3 0x80 poll ntpd
20128 1 20128 83 3 0x180 poll ntpd
24418 24328 24328 74 3 0x180 bpf pflogd
24328 1 24328 0 3 0x80 netio pflogd
31386 13097 13097 73 3 0x180 poll syslogd
13097 1 13097 0 3 0x88 netio syslogd
14 0 0 0 3 0x100200 bored crypto
13 0 0 0 3 0x100200 aiodoned aiodoned
12 0 0 0 3 0x100200 syncer update
11 0 0 0 3 0x100200 cleaner cleaner
10 0 0 0 3 0x100200 reaper reaper
9 0 0 0 3 0x100200 pgdaemon pagedaemon
8 0 0 0 3 0x100200 pftm pfpurge
7 0 0 0 3 0x100200 usbtsk usbtask
6 0 0 0 3 0x100200 usbevt usb0
* 5 0 0 0 7 0x100200 ipmi0
4 0 0 0 3 0x100200 bored syswq
3 0 0 0 3 0x100200 idle0
2 0 0 0 3 0x100200 kmalloc kmthread
1 0 1 0 3 0x4080 wait init
0 -1 0 0 3 0x80200 scheduler swapper

Thanks for any ideas,

Graham
-- 
-------------------------------------------------------------------------
Graham Allan
School of Physics and Astronomy - University of Minnesota
-------------------------------------------------------------------------

Reply via email to