Robert, On Tue, 27 Sep 2005, Robert Watson wrote:
> Great. As mentioned I'll be offline for about the next 48 hours, but back > after then. If we can get a nice clean crash out of this, would really be > best. If it's top panicking, it could well be due to a bug in the process > monitoring code, in kern_proc. We've run into bugs a few times there in > the past, generally associated with threading or races in process > creation/teardown, in which partially initialized (or torn down) processes > are accessed by another thread and are in an unexpected state. We re-compiled the kernel with 'options KDB_STOP_NMI', and were able to get a much more full analysis of what was happening on the 6-BETA5 crash. We crashed in top again, and it does look like we may have hit a kern_proc bug. in the attached file type3-core.txt you can see that it hits an exception in: 0xffffffff802b897a is in fill_kinfo_thread (/usr/src/sys/kern/kern_proc.c:736). 731 } 732 733 kg = td->td_ksegrp; 734 735 /* things in the KSE GROUP */ 736 kp->ki_estcpu = kg->kg_estcpu; 737 kp->ki_slptime = kg->kg_slptime; 738 kp->ki_pri.pri_user = kg->kg_user_pri; 739 kp->ki_pri.pri_class = kg->kg_pri_class; 740 (kgdb) frame 8 #8 0xffffffff802b897a in fill_kinfo_thread (td=0xffffff0063311260, kp=0xffffffffb62d8510) at /usr/src/sys/kern/kern_proc.c:733 733 kg = td->td_ksegrp; (kgdb) p kg->kg_estcpu Cannot access memory at address 0x173 (kgdb) p td->td_ksegrp $1 = (struct ksegrp *) 0x0 (kgdb) p kp->ki_estcpu $2 = 0 (kgdb) p kg $4 = (struct ksegrp *) 0x12b it seems that kg is an invalid pointer. We have started our tests again without running top. Hope you have a great vacation. - Rob Watt
type3-core.txt
Description: Binary data
Copyright (c) 1992-2005 The FreeBSD Project. Copyright (c) 1979, 1980, 1983, 1986, 1988, 1989, 1991, 1992, 1993, 1994 The Regents of the University of California. All rights reserved. FreeBSD 6.0-BETA5 #1: Tue Sep 27 17:38:32 EDT 2005 [EMAIL PROTECTED]:/usr/obj/usr/src/sys/LOCAL-DEBUG-NMI WARNING: WITNESS option enabled, expect reduced performance. Timecounter "i8254" frequency 1193182 Hz quality 0 CPU: Dual Core AMD Opteron(tm) Processor 275 (2190.05-MHz K8-class CPU) Origin = "AuthenticAMD" Id = 0x20f12 Stepping = 2 Features=0x178bfbff<FPU,VME,DE,PSE,TSC,MSR,PAE,MCE,CX8,APIC,SEP,MTRR,PGE,MCA,CMOV,PAT,PSE36,CLFLUSH,MMX,FXSR,SSE,SSE2,HTT> Features2=0x1<SSE3> AMD Features=0xe2500800<SYSCALL,NX,MMX+,<b25>,LM,3DNow+,3DNow> Hyperthreading: 2 logical CPUs real memory = 3942580224 (3759 MB) avail memory = 3807399936 (3631 MB) ACPI APIC Table: <A M I OEMAPIC > FreeBSD/SMP: Multiprocessor System Detected: 4 CPUs cpu0 (BSP): APIC ID: 0 cpu1 (AP): APIC ID: 1 cpu2 (AP): APIC ID: 2 cpu3 (AP): APIC ID: 3 MADT: Forcing active-low polarity and level trigger for SCI ioapic0 <Version 1.1> irqs 0-23 on motherboard ioapic1 <Version 1.1> irqs 24-27 on motherboard ioapic2 <Version 1.1> irqs 28-31 on motherboard acpi0: <A M I OEMRSDT> on motherboard acpi0: Power Button (fixed) pci_link0: <ACPI PCI Link LNKA> irq 10 on acpi0 pci_link1: <ACPI PCI Link LNKB> irq 5 on acpi0 pci_link2: <ACPI PCI Link LNKC> irq 11 on acpi0 pci_link3: <ACPI PCI Link LNKD> irq 9 on acpi0 Timecounter "ACPI-fast" frequency 3579545 Hz quality 1000 acpi_timer0: <24-bit timer at 3.579545MHz> port 0x1008-0x100b on acpi0 cpu0: <ACPI CPU> on acpi0 acpi_throttle0: <ACPI CPU Throttling> on cpu0 cpu1: <ACPI CPU> on acpi0 cpu2: <ACPI CPU> on acpi0 cpu3: <ACPI CPU> on acpi0 pcib0: <ACPI Host-PCI bridge> port 0xcf8-0xcff on acpi0 pci0: <ACPI PCI bus> on pcib0 pcib1: <ACPI PCI-PCI bridge> at device 6.0 on pci0 pci3: <ACPI PCI bus> on pcib1 ohci0: <OHCI (generic) USB controller> mem 0xfeafc000-0xfeafcfff irq 19 at device 0.0 on pci3 ohci0: [GIANT-LOCKED] usb0: OHCI version 1.0, legacy support usb0: <OHCI (generic) USB controller> on ohci0 usb0: USB revision 1.0 uhub0: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub0: 3 ports with 3 removable, self powered ohci1: <OHCI (generic) USB controller> mem 0xfeafd000-0xfeafdfff irq 19 at device 0.1 on pci3 ohci1: [GIANT-LOCKED] usb1: OHCI version 1.0, legacy support usb1: <OHCI (generic) USB controller> on ohci1 usb1: USB revision 1.0 uhub1: AMD OHCI root hub, class 9/0, rev 1.00/1.00, addr 1 uhub1: 3 ports with 3 removable, self powered pci3: <display, VGA> at device 6.0 (no driver attached) fxp0: <Intel 82551 Pro/100 Ethernet> port 0xbc00-0xbc3f mem 0xfeafb000-0xfeafbfff,0xfeaa0000-0xfeabffff irq 18 at device 8.0 on pci3 miibus0: <MII bus> on fxp0 inphy0: <i82555 10/100 media interface> on miibus0 inphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, auto fxp0: Ethernet address: 00:e0:81:31:89:1c isab0: <PCI-ISA bridge> at device 7.0 on pci0 isa0: <ISA bus> on isab0 atapci0: <AMD 8111 UDMA133 controller> port 0x1f0-0x1f7,0x3f6,0x170-0x177,0x376,0xffa0-0xffaf at device 7.1 on pci0 ata0: <ATA channel 0> on atapci0 ata1: <ATA channel 1> on atapci0 pci0: <serial bus, SMBus> at device 7.2 (no driver attached) pci0: <bridge> at device 7.3 (no driver attached) pcib2: <ACPI PCI-PCI bridge> at device 10.0 on pci0 pci2: <ACPI PCI bus> on pcib2 em0: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0x8880-0x88bf mem 0xfc8c0000-0xfc8dffff,0xfc800000-0xfc83ffff irq 26 at device 2.0 on pci2 em0: Ethernet address: 00:04:23:ba:d0:42 em0: Speed:N/A Duplex:N/A em1: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0x8c00-0x8c3f mem 0xfc8e0000-0xfc8fffff,0xfc880000-0xfc8bffff irq 27 at device 2.1 on pci2 em1: Ethernet address: 00:04:23:ba:d0:43 em1: Speed:N/A Duplex:N/A em2: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0x8480-0x84bf mem 0xfc780000-0xfc79ffff,0xfc740000-0xfc77ffff irq 27 at device 3.0 on pci2 em2: Ethernet address: 00:04:23:ae:60:0a em2: Speed:1000 Mbps Duplex:Full em3: <Intel(R) PRO/1000 Network Connection, Version - 2.1.7> port 0x8800-0x883f mem 0xfc7a0000-0xfc7bffff irq 24 at device 3.1 on pci2 em3: Ethernet address: 00:04:23:ae:60:0b em3: Speed:1000 Mbps Duplex:Full bge0: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem 0xfc6c0000-0xfc6cffff,0xfc6b0000-0xfc6bffff irq 24 at device 9.0 on pci2 miibus1: <MII bus> on bge0 brgphy0: <BCM5704 10/100/1000baseTX PHY> on miibus1 brgphy0: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge0: Ethernet address: 00:e0:81:31:8f:80 bge1: <Broadcom BCM5704C Dual Gigabit Ethernet, ASIC rev. 0x2003> mem 0xfc6f0000-0xfc6fffff,0xfc6e0000-0xfc6effff irq 25 at device 9.1 on pci2 miibus2: <MII bus> on bge1 brgphy1: <BCM5704 10/100/1000baseTX PHY> on miibus2 brgphy1: 10baseT, 10baseT-FDX, 100baseTX, 100baseTX-FDX, 1000baseTX, 1000baseTX-FDX, auto bge1: Ethernet address: 00:e0:81:31:8f:81 pci0: <base peripheral, interrupt controller> at device 10.1 (no driver attached) pcib3: <ACPI PCI-PCI bridge> at device 11.0 on pci0 pci1: <ACPI PCI bus> on pcib3 aac0: <Adaptec SCSI RAID 2230S> mem 0xfb800000-0xfbffffff,0xf0000000-0xf7ffffff irq 28 at device 3.0 on pci1 aac0: [FAST] aac0: Enabling 64-bit address support aacp0: <SCSI Passthrough Bus> on aac0 aacp1: <SCSI Passthrough Bus> on aac0 pci0: <base peripheral, interrupt controller> at device 11.1 (no driver attached) acpi_button0: <Power Button> on acpi0 atkbdc0: <Keyboard controller (i8042)> port 0x60,0x64 irq 1 on acpi0 atkbd0: <AT Keyboard> irq 1 on atkbdc0 kbd0 at atkbd0 atkbd0: [GIANT-LOCKED] sio0: <16550A-compatible COM port> port 0x3f8-0x3ff irq 4 flags 0x10 on acpi0 sio0: type 16550A sio1: <16550A-compatible COM port> port 0x2f8-0x2ff irq 3 on acpi0 sio1: type 16550A fdc0: <floppy drive controller (FDE)> port 0x3f0-0x3f5,0x3f7 irq 6 drq 2 on acpi0 fdc0: [FAST] fd0: <1440-KB 3.5" drive> on fdc0 drive 0 ppc0: <Standard parallel printer port> port 0x378-0x37f irq 7 on acpi0 ppc0: Generic chipset (NIBBLE-only) in COMPATIBLE mode ppbus0: <Parallel port bus> on ppc0 lpt0: <Printer> on ppbus0 lpt0: Interrupt-driven port ppi0: <Parallel I/O> on ppbus0 orm0: <ISA Option ROMs> at iomem 0xc0000-0xc7fff,0xc8000-0xcc7ff,0xcc800-0xcd7ff,0xcd800-0xcefff,0xcf000-0xd07ff,0xd0800-0xd17ff on isa0 sc0: <System console> at flags 0x100 on isa0 sc0: VGA <16 virtual consoles, flags=0x300> vga0: <Generic ISA VGA> at port 0x3c0-0x3df iomem 0xa0000-0xbffff on isa0 Timecounters tick every 1.000 msec ipfw2 (+ipv6) initialized, divert loadable, rule-based forwarding disabled, default to deny, logging unlimited acd0: CDROM <SONY CD-ROM CDU5215/7YS1> at ata1-master UDMA33 aacd0: <RAID 5> on aac0 aacd0: 209922MB (429920256 sectors) (probe9:aacp0:0:10:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe9:aacp0:0:10:0): ILLEGAL REQUEST asc:55,3 (probe9:aacp0:0:10:0): Reserved ASC/ASCQ pair (probe0:aacp0:0:0:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe0:aacp0:0:0:0): ILLEGAL REQUEST asc:55,3 (probe0:aacp0:0:0:0): Reserved ASC/ASCQ pair (probe1:aacp0:0:1:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe1:aacp0:0:1:0): ILLEGAL REQUEST asc:55,3 (probe1:aacp0:0:1:0): Reserved ASC/ASCQ pair (probe2:aacp0:0:2:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe2:aacp0:0:2:0): ILLEGAL REQUEST asc:55,3 (probe2:aacp0:0:2:0): Reserved ASC/ASCQ pair (probe3:aacp0:0:3:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe3:aacp0:0:3:0): ILLEGAL REQUEST asc:55,3 (probe3:aacp0:0:3:0): Reserved ASC/ASCQ pair (probe4:aacp0:0:4:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe4:aacp0:0:4:0): ILLEGAL REQUEST asc:55,3 (probe4:aacp0:0:4:0): Reserved ASC/ASCQ pair (probe5:aacp0:0:5:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe5:aacp0:0:5:0): ILLEGAL REQUEST asc:55,3 (probe5:aacp0:0:5:0): Reserved ASC/ASCQ pair (probe6:aacp0:0:6:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe6:aacp0:0:6:0): ILLEGAL REQUEST asc:55,3 (probe6:aacp0:0:6:0): Reserved ASC/ASCQ pair (probe7:aacp0:0:8:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe7:aacp0:0:8:0): ILLEGAL REQUEST asc:55,3 (probe7:aacp0:0:8:0): Reserved ASC/ASCQ pair (probe8:aacp0:0:9:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe8:aacp0:0:9:0): ILLEGAL REQUEST asc:55,3 (probe8:aacp0:0:9:0): Reserved ASC/ASCQ pair (probe10:aacp0:0:11:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe10:aacp0:0:11:0): ILLEGAL REQUEST asc:55,3 (probe10:aacp0:0:11:0): Reserved ASC/ASCQ pair (probe11:aacp0:0:12:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe11:aacp0:0:12:0): ILLEGAL REQUEST asc:55,3 (probe11:aacp0:0:12:0): Reserved ASC/ASCQ pair (probe12:aacp0:0:13:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe12:aacp0:0:13:0): ILLEGAL REQUEST asc:55,3 (probe12:aacp0:0:13:0): Reserved ASC/ASCQ pair (probe13:aacp0:0:14:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe13:aacp0:0:14:0): ILLEGAL REQUEST asc:55,3 (probe13:aacp0:0:14:0): Reserved ASC/ASCQ pair (probe14:aacp0:0:15:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe14:aacp0:0:15:0): ILLEGAL REQUEST asc:55,3 (probe14:aacp0:0:15:0): Reserved ASC/ASCQ pair (probe15:aacp1:0:0:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe15:aacp1:0:0:0): ILLEGAL REQUEST asc:55,3 (probe15:aacp1:0:0:0): Reserved ASC/ASCQ pair (probe16:aacp1:0:1:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe16:aacp1:0:1:0): ILLEGAL REQUEST asc:55,3 (probe16:aacp1:0:1:0): Reserved ASC/ASCQ pair (probe17:aacp1:0:2:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe17:aacp1:0:2:0): ILLEGAL REQUEST asc:55,3 (probe17:aacp1:0:2:0): Reserved ASC/ASCQ pair (probe18:aacp1:0:3:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe18:aacp1:0:3:0): ILLEGAL REQUEST asc:55,3 (probe18:aacp1:0:3:0): Reserved ASC/ASCQ pair (probe19:aacp1:0:4:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe19:aacp1:0:4:0): ILLEGAL REQUEST asc:55,3 (probe19:aacp1:0:4:0): Reserved ASC/ASCQ pair (probe20:aacp1:0:5:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe20:aacp1:0:5:0): ILLEGAL REQUEST asc:55,3 (probe20:aacp1:0:5:0): Reserved ASC/ASCQ pair (probe21:aacp1:0:6:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe21:aacp1:0:6:0): ILLEGAL REQUEST asc:55,3 (probe21:aacp1:0:6:0): Reserved ASC/ASCQ pair (probe22:aacp1:0:8:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe22:aacp1:0:8:0): ILLEGAL REQUEST asc:55,3 (probe22:aacp1:0:8:0): Reserved ASC/ASCQ pair (probe23:aacp1:0:9:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe23:aacp1:0:9:0): ILLEGAL REQUEST asc:55,3 (probe23:aacp1:0:9:0): Reserved ASC/ASCQ pair (probe24:aacp1:0:10:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe24:aacp1:0:10:0): ILLEGAL REQUEST asc:55,3 (probe24:aacp1:0:10:0): Reserved ASC/ASCQ pair (probe25:aacp1:0:11:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe25:aacp1:0:11:0): ILLEGAL REQUEST asc:55,3 (probe25:aacp1:0:11:0): Reserved ASC/ASCQ pair (probe26:aacp1:0:12:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe26:aacp1:0:12:0): ILLEGAL REQUEST asc:55,3 (probe26:aacp1:0:12:0): Reserved ASC/ASCQ pair (probe27:aacp1:0:13:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe27:aacp1:0:13:0): ILLEGAL REQUEST asc:55,3 (probe27:aacp1:0:13:0): Reserved ASC/ASCQ pair (probe28:aacp1:0:14:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe28:aacp1:0:14:0): ILLEGAL REQUEST asc:55,3 (probe28:aacp1:0:14:0): Reserved ASC/ASCQ pair (probe29:aacp1:0:15:0): INQUIRY. CDB: 12 0 0 0 24 0 (probe29:aacp1:0:15:0): ILLEGAL REQUEST asc:55,3 (probe29:aacp1:0:15:0): Reserved ASC/ASCQ pair SMP: AP CPU #1 Launched! SMP: AP CPU #2 Launched! SMP: AP CPU #3 Launched! Trying to mount root from ufs:/dev/aacd0s1a
_______________________________________________ freebsd-hackers@freebsd.org mailing list http://lists.freebsd.org/mailman/listinfo/freebsd-hackers To unsubscribe, send any mail to "[EMAIL PROTECTED]"