Package: kernel-image-2.6.8-1-k7 Version: 2.6.8-3 Severity: normal I'm having problems with Debian's 2.6.8 k7 kernel (from kernel-image-2.6.8-1-k7 version 2.6.8-3 from unstable) on my Athlon 800. The system is running Debian sarge, with a very few packages from unstable. One repeatable way to trigger a kernel bug is with bonnie++. The problem happens every time, even on a freshly booted system. My filesystems are reiserfs, except /usr/src which is ext3. I've seen some funny things with reiserfs, but not as repeatable as this. I recently backed off my BIOS settings from fast memory speed to normal, but this doesn't seem to have changed stability any. The system already passed memtest86+ many times, so it's not likely a hardware problem (I have a really solid power supply and good cooling, too.)
The bug: After just rebooting the system and logging in on tty1, in /usr/local/src/tmp (a newly created empty directory), I ran: bonnie++ -f -s 0 -n 20:70000:1024 ... Delete files in sequential order...Can't delete file 0004984MLu This time, I got an oops. Other times, bonnie tries to clean up, but gets an IO error. The disk wasn't full; Even with 20*1024*70000bytes, that's only 1.3GB. The filesystem has 5.5GB free. What's left is: ls -lR /usr/local/src/tmp /usr/local/src/tmp/: Bonnie.3594 /usr/local/src/tmp/Bonnie.3594: 0009041S6y9Qz It left just one file the other times as well. rm -rf has no problem with it... However, I do have a bash process stuck in disk sleep trying to echo to a file in /usr/local/src/tmp. (I wanted to record the error message, but I'm not running gpm, so I was going to echo it to a file...) Anyway, if anyone wants to send me vmlinux image for 2.6.8-1-k7 debian revision 3, I can spare a couple hours to poke around with gdb vmlinux /proc/kcore. This machine is at home, so I don't need to put it into "production" too soon. I've stopped cron, so it won't kill itself overnight. /dev/mapper/big-src 7224824 837628 5653196 13% /usr/src /usr/src/local 7224824 837628 5653196 13% /usr/local/src My entire kernel log (oops at the bottom) is: Linux version 2.6.8-1-k7 ([EMAIL PROTECTED]) (gcc version 3.3.4 (Debian 1:3.3.4-11)) #1 Mon Sep 13 23:56:00 EDT 2004 BIOS-provided physical RAM map: BIOS-e820: 0000000000000000 - 000000000009fc00 (usable) BIOS-e820: 000000000009fc00 - 00000000000a0000 (reserved) BIOS-e820: 00000000000ec000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 000000002fff0000 (usable) BIOS-e820: 000000002fff0000 - 000000002fff8000 (ACPI data) BIOS-e820: 000000002fff8000 - 0000000030000000 (ACPI NVS) BIOS-e820: 00000000ffff0000 - 0000000100000000 (reserved) 0MB HIGHMEM available. 767MB LOWMEM available. On node 0 totalpages: 196592 DMA zone: 4096 pages, LIFO batch:1 Normal zone: 192496 pages, LIFO batch:16 HighMem zone: 0 pages, LIFO batch:1 DMI 2.3 present. ACPI: RSDP (v000 AMI ) @ 0x000fac10 ACPI: RSDT (v001 AMIINT VIA_K7 0x00000010 MSFT 0x00000097) @ 0x2fff0000 ACPI: FADT (v001 AMIINT VIA_K7 0x00000011 MSFT 0x00000097) @ 0x2fff0030 ACPI: DSDT (v001 VIA VT8371 0x00001000 MSFT 0x0100000b) @ 0x00000000 ACPI: PM-Timer IO Port: 0x808 Built 1 zonelists Kernel command line: root=/dev/md0 ro Local APIC disabled by BIOS -- reenabling. Found and enabled local APIC! Initializing CPU#0 PID hash table entries: 4096 (order 12: 32768 bytes) Detected 802.274 MHz processor. Using pmtmr for high-res timesource Console: colour VGA+ 80x25 Dentry cache hash table entries: 131072 (order: 7, 524288 bytes) Inode-cache hash table entries: 65536 (order: 6, 262144 bytes) Memory: 771268k/786368k available (1527k kernel code, 14356k reserved, 694k data, 148k init, 0k highmem) Checking if this processor honours the WP bit even in supervisor mode... Ok. Calibrating delay loop... 1585.15 BogoMIPS Security Scaffold v1.0.0 initialized Mount-cache hash table entries: 512 (order: 0, 4096 bytes) CPU: After generic identify, caps: 0183fbff c1c7fbff 00000000 00000000 CPU: After vendor identify, caps: 0183fbff c1c7fbff 00000000 00000000 CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line) CPU: L2 Cache: 256K (64 bytes/line) CPU: After all inits, caps: 0183fbff c1c7fbff 00000000 00000020 Intel machine check architecture supported. Intel machine check reporting enabled on CPU#0. CPU: AMD Athlon(tm) Processor stepping 02 Enabling fast FPU save and restore... done. Checking 'hlt' instruction... OK. ACPI: IRQ9 SCI: Level Trigger. enabled ExtINT on CPU#0 ESR value before enabling vector: 00000000 ESR value after enabling vector: 00000000 Using local APIC timer interrupts. calibrating APIC timer ... ..... CPU clock speed is 801.0734 MHz. ..... host bus clock speed is 200.0433 MHz. checking if image is initramfs...it isn't (ungzip failed); looks like an initrd Freeing initrd memory: 4680k freed NET: Registered protocol family 16 PCI: PCI BIOS revision 2.10 entry at 0xfdb61, last bus=1 PCI: Using configuration type 1 mtrr: v2.0 (20020519) ACPI: Subsystem revision 20040326 spurious 8259A interrupt: IRQ7. ACPI: Interpreter enabled ACPI: Using PIC for interrupt routing ACPI: PCI Root Bridge [PCI0] (00:00) PCI: Probing PCI hardware (bus 00) ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT] ACPI: Power Resource [URP1] (off) ACPI: Power Resource [URP2] (off) ACPI: Power Resource [FDDP] (off) ACPI: Power Resource [LPTP] (off) ACPI: PCI Interrupt Link [LNKA] (IRQs 3 4 5 6 7 9 10 *11 12 14 15) ACPI: PCI Interrupt Link [LNKB] (IRQs 3 4 *5 6 7 9 10 11 12 14 15) ACPI: PCI Interrupt Link [LNKC] (IRQs 3 4 5 6 7 9 *10 11 12 14 15) ACPI: PCI Interrupt Link [LNKD] (IRQs 3 4 5 6 7 *9 10 11 12 14 15) Linux Plug and Play Support v0.97 (c) Adam Belay PnPBIOS: Scanning system for PnP BIOS support... PnPBIOS: Found PnP BIOS installation structure at 0xc00f7520 PnPBIOS: PnP BIOS version 1.0, entry 0xf0000:0x62d4, dseg 0xf0000 PnPBIOS: 12 nodes reported by PnP BIOS; 12 recorded by driver PCI: Using ACPI for IRQ routing ACPI: PCI Interrupt Link [LNKD] enabled at IRQ 9 ACPI: PCI interrupt 0000:00:07.2[D] -> GSI 9 (level, low) -> IRQ 9 ACPI: PCI interrupt 0000:00:07.3[D] -> GSI 9 (level, low) -> IRQ 9 ACPI: PCI Interrupt Link [LNKC] enabled at IRQ 10 ACPI: PCI interrupt 0000:00:07.5[C] -> GSI 10 (level, low) -> IRQ 10 ACPI: PCI interrupt 0000:00:0a.0[A] -> GSI 10 (level, low) -> IRQ 10 ACPI: PCI Interrupt Link [LNKB] enabled at IRQ 5 ACPI: PCI interrupt 0000:00:0d.0[A] -> GSI 5 (level, low) -> IRQ 5 ACPI: PCI Interrupt Link [LNKA] enabled at IRQ 11 ACPI: PCI interrupt 0000:01:00.0[A] -> GSI 11 (level, low) -> IRQ 11 VFS: Disk quotas dquot_6.5.1 Dquot-cache hash table entries: 1024 (order 0, 4096 bytes) devfs: 2004-01-31 Richard Gooch ([EMAIL PROTECTED]) devfs: boot_options: 0x0 Initializing Cryptographic API PCI: Disabling Via external APIC routing isapnp: Scanning for PnP cards... isapnp: No Plug & Play device found Serial: 8250/16550 driver $Revision: 1.90 $ 48 ports, IRQ sharing enabled ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A RAMDISK driver initialized: 16 RAM disks of 8192K size 1024 blocksize serio: i8042 AUX port at 0x60,0x64 irq 12 serio: i8042 KBD port at 0x60,0x64 irq 1 input: AT Translated Set 2 keyboard on isa0060/serio0 NET: Registered protocol family 2 IP: routing cache hash table of 8192 buckets, 64Kbytes TCP: Hash tables configured (established 262144 bind 65536) NET: Registered protocol family 8 NET: Registered protocol family 20 ACPI: (supports S0 S1 S4 S5) RAMDISK: cramfs filesystem found at block 0 RAMDISK: Loading 4680 blocks [1 disk] into ram disk... |/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|/-\|done. VFS: Mounted root (cramfs filesystem) readonly. Freeing unused kernel memory: 148k freed vesafb: probe of vesafb0 failed with error -6 NET: Registered protocol family 1 md: md driver 0.90.0 MAX_MD_DEVS=256, MD_SB_DISKS=27 md: raid1 personality registered as nr 3 Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2 ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx VP_IDE: IDE controller at PCI slot 0000:00:07.1 VP_IDE: chipset revision 16 VP_IDE: not 100% native mode: will probe irqs later VP_IDE: VIA vt82c686a (rev 22) IDE UDMA66 controller on pci0000:00:07.1 ide0: BM-DMA at 0xffa0-0xffa7, BIOS settings: hda:pio, hdb:pio ide1: BM-DMA at 0xffa8-0xffaf, BIOS settings: hdc:DMA, hdd:pio hdc: ATAPI CD-ROM DRIVE 40X MAXIMUM, ATAPI CD/DVD-ROM drive Using anticipatory io scheduler ide1 at 0x170-0x177,0x376 on irq 15 PDC20268: IDE controller at PCI slot 0000:00:0a.0 ACPI: PCI interrupt 0000:00:0a.0[A] -> GSI 10 (level, low) -> IRQ 10 PDC20268: chipset revision 2 PDC20268: ROM enabled at 0xdffe0000 PDC20268: 100% native mode on irq 10 ide2: BM-DMA at 0xcc00-0xcc07, BIOS settings: hde:pio, hdf:pio ide3: BM-DMA at 0xcc08-0xcc0f, BIOS settings: hdg:pio, hdh:pio hde: WDC WD1600JB-00FUA0, ATA DISK drive ide2 at 0xdc00-0xdc07,0xd802 on irq 10 hde: max request size: 1024KiB hde: 312581808 sectors (160041 MB) w/8192KiB Cache, CHS=19457/255/63, UDMA(100) /dev/ide/host2/bus0/target0/lun0: p1 p2 p3 < p5 p6 > p4 hdg: WDC WD1600JB-00EVA0, ATA DISK drive ide3 at 0xd400-0xd407,0xd002 on irq 10 hdg: max request size: 1024KiB hdg: 312581808 sectors (160041 MB) w/8192KiB Cache, CHS=19457/255/63, UDMA(100) /dev/ide/host2/bus1/target0/lun0: p1 p2 p3 < p5 p6 > md: md1 stopped. md: bind<hdg2> md: bind<hde2> raid1: raid set md1 active with 2 out of 2 mirrors md: md0 stopped. md: bind<hdg1> md: bind<hde1> raid1: raid set md0 active with 2 out of 2 mirrors VFS: Can't find ext3 filesystem on dev md0. ReiserFS: md0: found reiserfs format "3.6" with standard journal ReiserFS: md0: using ordered data mode ReiserFS: md0: journal params: device md0, size 8192, journal first block 18, max trans len 1024, max batch 900, max commit age 30, max trans age 30 ReiserFS: md0: checking transaction log (md0) ReiserFS: md0: Using r5 hash to sort names Adding 979832k swap on /dev/md1. Priority:-1 extents:1 Real Time Clock Driver v1.12 Capability LSM initialized device-mapper: 4.1.0-ioctl (2003-12-10) initialised: [EMAIL PROTECTED] md: md2 stopped. md: bind<hdg5> md: bind<hde5> raid1: raid set md2 active with 2 out of 2 mirrors ReiserFS: dm-3: found reiserfs format "3.6" with standard journal ReiserFS: dm-3: using ordered data mode ReiserFS: dm-3: journal params: device dm-3, size 8192, journal first block 18, max trans len 1024, max batch 900, max commit age 30, max trans age 30 ReiserFS: dm-3: checking transaction log (dm-3) ReiserFS: dm-3: Using r5 hash to sort names ReiserFS: dm-4: found reiserfs format "3.6" with standard journal ReiserFS: dm-4: using ordered data mode ReiserFS: dm-4: journal params: device dm-4, size 8192, journal first block 18, max trans len 1024, max batch 900, max commit age 30, max trans age 30 ReiserFS: dm-4: checking transaction log (dm-4) ReiserFS: dm-4: Using r5 hash to sort names ReiserFS: dm-0: found reiserfs format "3.6" with standard journal ReiserFS: dm-0: using ordered data mode ReiserFS: dm-0: journal params: device dm-0, size 8192, journal first block 18, max trans len 1024, max batch 900, max commit age 30, max trans age 30 ReiserFS: dm-0: checking transaction log (dm-0) ReiserFS: dm-0: Using r5 hash to sort names kjournald starting. Commit interval 5 seconds EXT3 FS on dm-1, internal journal EXT3-fs: mounted filesystem with writeback data mode. kjournald starting. Commit interval 5 seconds EXT3 FS on dm-2, internal journal EXT3-fs: mounted filesystem with ordered data mode. input: PC Speaker parport0: PC-style at 0x378, irq 7 [PCSPP,TRISTATE,EPP] input: PS/2 Generic Mouse on isa0060/serio1 ts: Compaq touchscreen protocol output mice: PS/2 mouse device common for all mice Linux agpgart interface v0.100 (c) Dave Jones agpgart: Detected VIA Twister-K/KT133x/KM133 chipset agpgart: Maximum main memory to use for agp memory: 690M agpgart: AGP aperture is 64M @ 0xe0000000 cpci_hotplug: CompactPCI Hot Plug Core version: 0.2 pci_hotplug: PCI Hot Plug PCI Core version: 0.5 shpchp: shpc_init : shpc_cap_offset == 0 shpchp: Standard Hot Plug PCI Controller Driver version: 0.4 pciehp: PCI Express Hot Plug Controller Driver version: 0.4 usbcore: registered new driver usbfs usbcore: registered new driver hub USB Universal Host Controller Interface driver v2.2 ACPI: PCI interrupt 0000:00:07.2[D] -> GSI 9 (level, low) -> IRQ 9 uhci_hcd 0000:00:07.2: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller uhci_hcd 0000:00:07.2: irq 9, io base 0000b800 uhci_hcd 0000:00:07.2: new USB bus registered, assigned bus number 1 hub 1-0:1.0: USB hub found hub 1-0:1.0: 2 ports detected ACPI: PCI interrupt 0000:00:07.3[D] -> GSI 9 (level, low) -> IRQ 9 uhci_hcd 0000:00:07.3: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (#2) uhci_hcd 0000:00:07.3: irq 9, io base 0000bc00 uhci_hcd 0000:00:07.3: new USB bus registered, assigned bus number 2 hub 2-0:1.0: USB hub found hub 2-0:1.0: 2 ports detected ACPI: PCI interrupt 0000:00:07.5[C] -> GSI 10 (level, low) -> IRQ 10 PCI: Setting latency timer of device 0000:00:07.5 to 64 Linux Tulip driver version 1.1.13 (May 11, 2002) ACPI: PCI interrupt 0000:00:0d.0[A] -> GSI 5 (level, low) -> IRQ 5 tulip0: MII transceiver #1 config 1000 status 786d advertising 01e1. eth0: ADMtek Comet rev 17 at 0xb400, 00:03:6D:11:34:4B, IRQ 5. NET: Registered protocol family 17 eth0: Setting full-duplex based on MII#1 link partner capability of 45e1. nfs warning: mount version older than kernel nfs warning: mount version older than kernel nfs warning: mount version older than kernel nfs warning: mount version older than kernel nfs warning: mount version older than kernel nfs warning: mount version older than kernel NET: Registered protocol family 10 Disabled Privacy Extensions on device c02f8de0(lo) IPv6 over IPv4 tunneling driver Installing knfsd (copyright (C) 1996 [EMAIL PROTECTED]). hdc: ATAPI 40X CD-ROM drive, 128kB Cache, UDMA(33) Uniform CD-ROM driver Revision: 3.20 hdc: drive_cmd: status=0x58 { DriveReady SeekComplete DataRequest } hdc: drive_cmd: error=0x00 hdc: drive_cmd: status=0x51 { DriveReady SeekComplete Error } hdc: drive_cmd: error=0x04Aborted Command SCSI subsystem initialized Unable to handle kernel NULL pointer dereference at virtual address 00000004 printing eip: c016ba40 *pde = 00000000 Oops: 0002 [#1] PREEMPT Modules linked in: sd_mod scsi_mod ide_cd cdrom nfsd exportfs ipv6 nfs lockd sunrpc af_packet tulip crc32 snd_via82xx snd_ac97_codec snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd_page_alloc gameport snd_mpu401_uart snd_rawmidi snd_seq_device snd soundcore uhci_hcd usbcore pciehp shpchp pci_hotplug via_agp agpgart mousedev tsdev psmouse parport_pc parport pcspkr evdev dm_mod capability commoncap rtc reiserfs ext3 jbd mbcache ide_generic ide_disk pdc202xx_new via82cxxx ide_core raid1 md unix font vesafb cfbcopyarea cfbimgblt cfbfillrect CPU: 0 EIP: 0060:[<c016ba40>] Not tainted EFLAGS: 00010207 (2.6.8-1-k7) EIP is at select_parent+0x40/0xb0 eax: 00000000 ebx: d820ce04 ecx: d820ce24 edx: 00000000 esi: 00000000 edi: d829cdfc ebp: d829ce2c esp: eade5f18 ds: 007b es: 007b ss: 0068 Process bonnie++ (pid: 3066, threadinfo=eade4000 task=eb2fe290) Stack: 00003216 eab93234 eab93234 ed3120d4 eade5f70 c016bac0 eab93234 eade4000 c01637b8 eab93234 eab93234 eab93234 c01638bb eab93234 00000003 00000000 eab93234 eade2000 eab93234 c0163b16 ed3120d4 eab93234 eae0b674 efb2b680 Call Trace: [<c016bac0>] shrink_dcache_parent+0x10/0x30 [<c01637b8>] d_unhash+0x38/0xa0 [<c01638bb>] vfs_rmdir+0x9b/0x210 [<c0163b16>] sys_rmdir+0xe6/0x100 [<c011cf3d>] profile_hook+0x2d/0x4b [<c01060b5>] sysenter_past_esp+0x52/0x71 Code: 89 50 04 89 02 89 49 04 89 4b 20 8b 03 85 c0 75 1b a1 40 b1 <6>note: bonnie++[3066] exited with preempt_count 1 bad: scheduling while atomic! [<c027c67c>] schedule+0x47c/0x490 [<c01437eb>] unmap_page_range+0x4b/0x80 [<c01439cc>] unmap_vmas+0x1ac/0x1c0 [<c0147c2b>] exit_mmap+0x8b/0x170 [<c011a044>] mmput+0x64/0x90 [<c011e67c>] do_exit+0x15c/0x420 [<c0116cb0>] do_page_fault+0x0/0x549 [<c01072a8>] die+0xf8/0x100 [<c0116e8e>] do_page_fault+0x1de/0x549 [<c01187d9>] scheduler_tick+0x159/0x420 [<f0869c3c>] do_get_write_access+0x25c/0x630 [jbd] [<c0123f06>] update_process_times+0x46/0x60 [<c0123d6b>] update_wall_time+0xb/0x40 [<c01241df>] do_timer+0xdf/0xf0 [<c011fffd>] __do_softirq+0x7d/0x80 [<c01087ed>] do_IRQ+0xfd/0x130 [<c0116cb0>] do_page_fault+0x0/0x549 [<c0106ba9>] error_code+0x2d/0x38 [<c016ba40>] select_parent+0x40/0xb0 [<c016bac0>] shrink_dcache_parent+0x10/0x30 [<c01637b8>] d_unhash+0x38/0xa0 [<c01638bb>] vfs_rmdir+0x9b/0x210 [<c0163b16>] sys_rmdir+0xe6/0x100 [<c011cf3d>] profile_hook+0x2d/0x4b [<c01060b5>] sysenter_past_esp+0x52/0x71 /usr/src is an ext3 filesystem mounted with data=writeback on top of a striped logical volume (striped between two physical volumes on hde6 and hdg6). My two hard drives are connected to separate cables of a Promise IDE PCI card. (The controller on the mobo doesn't do ATA100). /usr/local/src is a bind mount to /usr/src/local. $ mount /dev/md0 on / type reiserfs (rw) proc on /proc type proc (rw) sysfs on /sys type sysfs (rw) devpts on /dev/pts type devpts (rw,gid=5,mode=620) tmpfs on /dev/shm type tmpfs (rw) /dev/mapper/redundant-usr on /usr type reiserfs (rw,noatime) /dev/mapper/redundant-home on /home type reiserfs (rw) /dev/mapper/big-cache on /var/cache type reiserfs (rw,noatime,notail) /dev/mapper/big-src on /usr/src type ext3 (rw,noatime,data=writeback) /usr/src/local on /usr/local/src type none (rw,bind) /dev/mapper/big-stuff on /stuff type ext3 (rw,noatime,data=ordered) usbfs on /proc/bus/usb type usbfs (rw) 10.0.0.1:/home on /net/llama/home type nfs (rw,nfsvers=3,intr,addr=10.0.0.1) 10.0.0.1:/var/cache/apt/archives on /var/cache/apt/archives type nfs (rw,nfsvers=3,intr,addr=10.0.0.1) 10.0.0.1:/pub on /mnt/pub type nfs (rw,nfsvers=3,intr,addr=10.0.0.1) 10.0.0.1:/stuff on /mnt/stuff type nfs (rw,nfsvers=3,intr,addr=10.0.0.1) 10.0.0.1:/mnt/raid0 on /mnt/raid0 type nfs (rw,nfsvers=3,intr,addr=10.0.0.1) 10.0.0.1:/mnt/large on /mnt/large type nfs (rw,nfsvers=3,intr,addr=10.0.0.1) (This machine will replace my fileserver at 10.0.0.1 once I get it stable.) Running bonnie++ in /var/cache/tmp worked. It didn't have problems deleting its files. I have seen other oopses such as: (with kernel-image-2.6.8-1-k7 2.6.8-2) While running debsums to verify some packages: (The kernel might have already oopsed at this point, so things might already be messed up...). Sep 19 20:44:46 holly kernel: Unable to handle kernel paging request at virtual address f2888510 Sep 19 20:44:46 holly kernel: printing eip: Sep 19 20:44:46 holly kernel: c016c1f0 Sep 19 20:44:46 holly kernel: *pde = 00000000 Sep 19 20:44:46 holly kernel: Oops: 0000 [#1] Sep 19 20:44:46 holly kernel: PREEMPT Sep 19 20:44:46 holly kernel: Modules linked in: snd_seq_midi snd_seq_oss snd_seq_midi_event snd_seq mga sd_mod scsi_mod ide_cd cdrom nfsd exportfs ipv6 nfs lockd sunrpc af_packet tulip crc32 snd_via82xx snd_ac97_codec snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd_page_alloc gameport snd_mpu401_uart snd_rawmidi snd_seq_device snd soundcore uhci_hcd usbcore pciehp shpchp pci_hotplug via_agp agpgart mousedev tsdev psmouse parport_pc parport pcspkr evdev dm_mod capability commoncap rtc ext3 jbd mbcache reiserfs ide_generic ide_disk pdc202xx_new via82cxxx ide_core raid1 md unix font vesafb cfbcopyarea cfbimgblt cfbfillrect Sep 19 20:44:46 holly kernel: CPU: 0 Sep 19 20:44:46 holly kernel: EIP: 0060:[__d_lookup+128/368] Not tainted Sep 19 20:44:46 holly kernel: EFLAGS: 00010282 (2.6.8-1-k7) Sep 19 20:44:46 holly kernel: EIP is at __d_lookup+0x80/0x170 Sep 19 20:44:46 holly kernel: eax: f2888510 ebx: f2888510 ecx: 00000011 edx: c162a140 Sep 19 20:44:46 holly kernel: esi: c1691a90 edi: 2f2f2963 ebp: daa145c8 esp: d0cfde40 Sep 19 20:44:46 holly kernel: ds: 007b es: 007b ss: 0068 Sep 19 20:44:46 holly kernel: Process debsums (pid: 30278, threadinfo=d0cfc000 task=eb03e7a0) Sep 19 20:44:46 holly kernel: Stack: 00000001 eca53b00 00000001 d0cfc000 2f2f2963 000017a1 00000000 c1691a90 Sep 19 20:44:46 holly kernel: cfee0031 0000000a d0cfded4 d0cfdf60 efb2cc20 d0cfded4 c0161380 da9f030c Sep 19 20:44:46 holly kernel: d0cfdedc da9f0654 d0cfded4 cfee0031 da9f524c d0cfdf60 c0161acd d0cfdf60 Sep 19 20:44:46 holly kernel: Call Trace: Sep 19 20:44:46 holly kernel: [do_lookup+48/176] do_lookup+0x30/0xb0 Sep 19 20:44:46 holly kernel: [link_path_walk+1741/3488] link_path_walk+0x6cd/0xda0 Sep 19 20:44:46 holly kernel: [file_read_actor+0/240] file_read_actor+0x0/0xf0 Sep 19 20:44:46 holly kernel: [copy_to_user+62/80] copy_to_user+0x3e/0x50 Sep 19 20:44:46 holly kernel: [path_lookup+126/320] path_lookup+0x7e/0x140 Sep 19 20:44:46 holly kernel: [open_namei+133/1520] open_namei+0x85/0x5f0 Sep 19 20:44:46 holly kernel: [filp_open+62/112] filp_open+0x3e/0x70 Sep 19 20:44:46 holly kernel: [__crc_lookup_hash+9758196/10153217] reiserfs_file_release+0x0/0x470 [reiserfs] Sep 19 20:44:46 holly kernel: [get_unused_fd+57/224] get_unused_fd+0x39/0xe0 Sep 19 20:44:46 holly kernel: [sys_open+91/144] sys_open+0x5b/0x90 Sep 19 20:44:46 holly kernel: [sysenter_past_esp+82/113] sysenter_past_esp+0x52/0x71 Sep 19 20:44:46 holly kernel: Code: 8b 03 8d 44 20 00 8d 6b a0 8b 7c 24 10 39 7d 14 74 2a 8b 1b Sep 19 20:44:46 holly kernel: <6>note: debsums[30278] exited with preempt_count 1 Sep 19 20:44:46 holly kernel: bad: scheduling while atomic! Sep 19 20:44:46 holly kernel: [schedule+1148/1168] schedule+0x47c/0x490 Sep 19 20:44:46 holly kernel: [unmap_page_range+75/128] unmap_page_range+0x4b/0x80 Sep 19 20:44:46 holly kernel: [unmap_vmas+428/448] unmap_vmas+0x1ac/0x1c0 Sep 19 20:44:46 holly kernel: [exit_mmap+139/368] exit_mmap+0x8b/0x170 Sep 19 20:44:46 holly kernel: [mmput+100/144] mmput+0x64/0x90 Sep 19 20:44:46 holly kernel: [do_exit+348/1056] do_exit+0x15c/0x420 Sep 19 20:44:46 holly kernel: [do_page_fault+0/1353] do_page_fault+0x0/0x549 Sep 19 20:44:46 holly kernel: [die+248/256] die+0xf8/0x100 Sep 19 20:44:46 holly kernel: [do_page_fault+478/1353] do_page_fault+0x1de/0x549 Sep 19 20:44:46 holly kernel: [schedule+654/1168] schedule+0x28e/0x490 Sep 19 20:44:46 holly kernel: [__crc_lookup_hash+9512908/10153217] dm_table_unplug_all+0x48/0x4a [dm_mod] Sep 19 20:44:46 holly kernel: [__lock_page+217/240] __lock_page+0xd9/0xf0 Sep 19 20:44:46 holly kernel: [do_page_fault+0/1353] do_page_fault+0x0/0x549 Sep 19 20:44:46 holly kernel: [error_code+45/56] error_code+0x2d/0x38 Sep 19 20:44:46 holly kernel: [__d_lookup+128/368] __d_lookup+0x80/0x170 Sep 19 20:44:46 holly kernel: [do_lookup+48/176] do_lookup+0x30/0xb0 Sep 19 20:44:46 holly kernel: [link_path_walk+1741/3488] link_path_walk+0x6cd/0xda0 Sep 19 20:44:46 holly kernel: [file_read_actor+0/240] file_read_actor+0x0/0xf0 Sep 19 20:44:46 holly kernel: [copy_to_user+62/80] copy_to_user+0x3e/0x50 Sep 19 20:44:46 holly kernel: [path_lookup+126/320] path_lookup+0x7e/0x140 Sep 19 20:44:46 holly kernel: [open_namei+133/1520] open_namei+0x85/0x5f0 Sep 19 20:44:46 holly kernel: [filp_open+62/112] filp_open+0x3e/0x70 Sep 19 20:44:46 holly kernel: [__crc_lookup_hash+9758196/10153217] reiserfs_file_release+0x0/0x470 [reiserfs] Sep 19 20:44:46 holly kernel: [get_unused_fd+57/224] get_unused_fd+0x39/0xe0 Sep 19 20:44:46 holly kernel: [sys_open+91/144] sys_open+0x5b/0x90 Sep 19 20:44:46 holly kernel: [sysenter_past_esp+82/113] sysenter_past_esp+0x52/0x71 Sep 19 20:44:46 holly kernel: bad: scheduling while atomic! Sep 19 20:44:46 holly kernel: [schedule+1148/1168] schedule+0x47c/0x490 Sep 19 20:44:46 holly kernel: [unmap_page_range+75/128] unmap_page_range+0x4b/0x80 Sep 19 20:44:46 holly kernel: [unmap_vmas+428/448] unmap_vmas+0x1ac/0x1c0 Sep 19 20:44:46 holly kernel: [exit_mmap+139/368] exit_mmap+0x8b/0x170 Sep 19 20:44:46 holly kernel: [mmput+100/144] mmput+0x64/0x90 Sep 19 20:44:46 holly kernel: [do_exit+348/1056] do_exit+0x15c/0x420 Sep 19 20:44:46 holly kernel: [do_page_fault+0/1353] do_page_fault+0x0/0x549 Sep 19 20:44:46 holly kernel: [die+248/256] die+0xf8/0x100 Sep 19 20:44:46 holly kernel: [do_page_fault+478/1353] do_page_fault+0x1de/0x549 Sep 19 20:44:46 holly kernel: [schedule+654/1168] schedule+0x28e/0x490 Sep 19 20:44:46 holly kernel: [__crc_lookup_hash+9512908/10153217] dm_table_unplug_all+0x48/0x4a [dm_mod] Sep 19 20:44:46 holly kernel: [__lock_page+217/240] __lock_page+0xd9/0xf0 Sep 19 20:44:46 holly kernel: [do_page_fault+0/1353] do_page_fault+0x0/0x549 Sep 19 20:44:46 holly kernel: [error_code+45/56] error_code+0x2d/0x38 Sep 19 20:44:46 holly kernel: [__d_lookup+128/368] __d_lookup+0x80/0x170 Sep 19 20:44:46 holly kernel: [do_lookup+48/176] do_lookup+0x30/0xb0 Sep 19 20:44:46 holly kernel: [link_path_walk+1741/3488] link_path_walk+0x6cd/0xda0 Sep 19 20:44:46 holly kernel: [file_read_actor+0/240] file_read_actor+0x0/0xf0 Sep 19 20:44:46 holly kernel: [copy_to_user+62/80] copy_to_user+0x3e/0x50 Sep 19 20:44:46 holly kernel: [path_lookup+126/320] path_lookup+0x7e/0x140 Sep 19 20:44:46 holly kernel: [open_namei+133/1520] open_namei+0x85/0x5f0 Sep 19 20:44:46 holly kernel: [filp_open+62/112] filp_open+0x3e/0x70 Sep 19 20:44:46 holly kernel: [__crc_lookup_hash+9758196/10153217] reiserfs_file_release+0x0/0x470 [reiserfs] Sep 19 20:44:46 holly kernel: [get_unused_fd+57/224] get_unused_fd+0x39/0xe0 Sep 19 20:44:46 holly kernel: [sys_open+91/144] sys_open+0x5b/0x90 Sep 19 20:44:46 holly kernel: [sysenter_past_esp+82/113] sysenter_past_esp+0x52/0x71 and Sep 22 21:38:07 holly kernel: Linux version 2.6.8-1-k7 ([EMAIL PROTECTED]) (gcc version 3.3.4 (Debian 1:3.3.4-9)) #1 Sat Aug 28 13:44:52 EDT 2004 ... (while aptitude was downloading some packages via http, to /var/cache/apt/archives, an NFS mount) Sep 22 21:57:41 holly kernel: Unable to handle kernel paging request at virtual address 081b4b90 Sep 22 21:57:41 holly kernel: printing eip: Sep 22 21:57:41 holly kernel: c01543e3 Sep 22 21:57:41 holly kernel: *pde = 118ad067 Sep 22 21:57:41 holly kernel: *pte = 00000000 Sep 22 21:57:41 holly kernel: Oops: 0000 [#1] Sep 22 21:57:41 holly kernel: PREEMPT Sep 22 21:57:41 holly kernel: Modules linked in: mga sd_mod scsi_mod ide_cd cdrom nfsd exportfs ipv6 nfs lockd sunrpc af_packet tulip crc32 snd_via82xx snd_ac97_codec snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd_page_alloc gameport snd_mpu401_uart snd_rawmidi snd_seq_device snd soundcore uhci_hcd usbcore pciehp shpchp pci_hotplug via_agp agpgart mousedev tsdev psmouse parport_pc parport pcspkr evdev dm_mod capability commoncap rtc ext3 jbd mbcache reiserfs ide_generic ide_disk pdc202xx_new via82cxxx ide_core raid1 md unix font vesafb cfbcopyarea cfbimgblt cfbfillrect Sep 22 21:57:41 holly kernel: CPU: 0 Sep 22 21:57:41 holly kernel: EIP: 0060:[__fput+19/304] Not tainted Sep 22 21:57:41 holly kernel: EFLAGS: 00010286 (2.6.8-1-k7) Sep 22 21:57:41 holly kernel: EIP is at __fput+0x13/0x130 Sep 22 21:57:41 holly kernel: eax: 081b4b88 ebx: d2ff51a0 ecx: 00000002 edx: d2ff51a0 Sep 22 21:57:41 holly kernel: esi: ee46a000 edi: 0815a11c ebp: ce2f0620 esp: ee46be90 Sep 22 21:57:41 holly kernel: ds: 007b es: 007b ss: 0068 Sep 22 21:57:41 holly kernel: Process rpciod (pid: 1748, threadinfo=ee46a000 task=eee40bf0) Sep 22 21:57:41 holly kernel: Stack: cd9ed040 ec8241e0 081b4b88 ce2f0620 ee46a000 d20615c4 ce2f0620 f0e4fe5e Sep 22 21:57:41 holly kernel: d20614c0 d20614c0 f0e53dce ce2f0620 0000142e cd9ed044 ce2f0674 cd9ed180 Sep 22 21:57:41 holly kernel: cd9ed040 f0e55c17 ce2f0620 ecffcaa0 ecffca14 01a75582 00000000 00000001 Sep 22 21:57:41 holly kernel: Call Trace: Sep 22 21:57:41 holly kernel: [__crc_acpi_get_timer_duration+120373/3573603] nfs_clear_request+0x7e/0x90 [nfs] Sep 22 21:57:41 holly kernel: [__crc_acpi_get_timer_duration+136613/3573603] nfs_inode_remove_request+0x6e/0xb0 [nfs] Sep 22 21:57:41 holly kernel: [__crc_acpi_get_timer_duration+144366/3573603] nfs_commit_done+0x157/0x260 [nfs] Sep 22 21:57:41 holly kernel: [__crc_do_softirq+3782842/4062834] __rpc_execute+0x301/0x3f0 [sunrpc] Sep 22 21:57:41 holly kernel: [schedule+654/1168] schedule+0x28e/0x490 Sep 22 21:57:41 holly kernel: [__crc_do_softirq+3783302/4062834] __rpc_schedule+0x4d/0xd0 [sunrpc] Sep 22 21:57:41 holly kernel: [__crc_do_softirq+3785662/4062834] rpciod+0xe5/0x330 [sunrpc] Sep 22 21:57:41 holly kernel: [autoremove_wake_function+0/96] autoremove_wake_function+0x0/0x60 Sep 22 21:57:41 holly kernel: [ret_from_fork+6/20] ret_from_fork+0x6/0x14 Sep 22 21:57:41 holly kernel: [autoremove_wake_function+0/96] autoremove_wake_function+0x0/0x60 Sep 22 21:57:41 holly kernel: [__crc_do_softirq+3785433/4062834] rpciod+0x0/0x330 [sunrpc] Sep 22 21:57:41 holly kernel: [kernel_thread_helper+5/20] kernel_thread_helper+0x5/0x14 Sep 22 21:57:41 holly kernel: Code: 8b 68 08 8d 83 80 00 00 00 39 83 80 00 00 00 0f 85 fb 00 00 Running a kernel I compiled from Debian's 2.6.8-5 source package, I've seen some reiserfs oopses, such as: Sep 19 23:12:40 holly kernel: Linux version 2.6.8-holly ([EMAIL PROTECTED]) (gcc version 3.3.4 (Debian 1:3.3.4-6sarge1)) #1 Mon Sep 6 16:39:34 ADT 2004 ... ran ok for a while, then, during nightly cron jobs: Sep 22 06:25:40 holly kernel: Unable to handle kernel paging request at virtual address 6967616d Sep 22 06:25:40 holly kernel: printing eip: Sep 22 06:25:40 holly kernel: c01608d5 Sep 22 06:25:40 holly kernel: *pde = 00000000 Sep 22 06:25:40 holly kernel: Oops: 0000 [#1] Sep 22 06:25:40 holly kernel: PREEMPT Sep 22 06:25:40 holly kernel: Modules linked in: 8250 serial_core mga ide_cd cdrom nfs af_packet snd_via82xx snd_ac97_codec snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd_page_alloc gameport snd_mpu401_uart snd_rawmidi snd_seq_device parport_pc parport via_agp agpgart evdev rtc Sep 22 06:25:40 holly kernel: CPU: 0 Sep 22 06:25:40 holly kernel: EIP: 0060:[find_inode+21/96] Not tainted Sep 22 06:25:40 holly kernel: EFLAGS: 00010202 (2.6.8-holly) Sep 22 06:25:40 holly kernel: EIP is at find_inode+0x15/0x60 Sep 22 06:25:40 holly kernel: eax: 6967616d ebx: 6967616d ecx: 00000000 edx: e442bd90 Sep 22 06:25:40 holly kernel: esi: e442be2c edi: effc4200 ebp: c16b59b0 esp: e442bd48 Sep 22 06:25:41 holly kernel: ds: 007b es: 007b ss: 0068 Sep 22 06:25:41 holly kernel: Process updatedb (pid: 26072, threadinfo=e442a000 task=e96813a0) Sep 22 06:25:41 holly kernel: Stack: c0183670 e442a000 e442be2c effc4200 c16b59b0 c0160e6c e442bd90 00000003 Sep 22 06:25:41 holly kernel: e442bd90 c0183560 c0183670 e442bdb8 e442be2c e442bd90 db99f6e0 c01836c7 Sep 22 06:25:41 holly kernel: c0183560 e442bd90 0000f9aa 0000f99d e442bdb8 00000001 e442bdfc c017ead3 Sep 22 06:25:41 holly kernel: Call Trace: Sep 22 06:25:41 holly kernel: [reiserfs_find_actor+0/32] reiserfs_find_actor+0x0/0x20 Sep 22 06:25:41 holly kernel: [iget5_locked+108/240] iget5_locked+0x6c/0xf0 Sep 22 06:25:41 holly kernel: [reiserfs_init_locked_inode+0/16] reiserfs_init_locked_inode+0x0/0x10 Sep 22 06:25:41 holly kernel: [reiserfs_find_actor+0/32] reiserfs_find_actor+0x0/0x20 Sep 22 06:25:41 holly kernel: [reiserfs_iget+55/160] reiserfs_iget+0x37/0xa0 Sep 22 06:25:41 holly kernel: [reiserfs_init_locked_inode+0/16] reiserfs_init_locked_inode+0x0/0x10 Sep 22 06:25:41 holly kernel: [reiserfs_lookup+323/416] reiserfs_lookup+0x143/0x1a0 Sep 22 06:25:41 holly kernel: [smp_apic_timer_interrupt+40/240] smp_apic_timer_interrupt+0x28/0xf0 Sep 22 06:25:41 holly kernel: [d_lookup+27/64] d_lookup+0x1b/0x40 Sep 22 06:25:41 holly kernel: [real_lookup+189/224] real_lookup+0xbd/0xe0 Sep 22 06:25:41 holly kernel: [do_lookup+126/144] do_lookup+0x7e/0x90 Sep 22 06:25:41 holly kernel: [link_path_walk+1725/3440] link_path_walk+0x6bd/0xd70 Sep 22 06:25:41 holly kernel: [dput+28/512] dput+0x1c/0x200 Sep 22 06:25:41 holly kernel: [copy_to_user+50/80] copy_to_user+0x32/0x50 Sep 22 06:25:41 holly kernel: [path_lookup+123/304] path_lookup+0x7b/0x130 Sep 22 06:25:41 holly kernel: [__user_walk+68/96] __user_walk+0x44/0x60 Sep 22 06:25:41 holly kernel: [vfs_lstat+26/80] vfs_lstat+0x1a/0x50 Sep 22 06:25:41 holly kernel: [sys_lstat64+18/48] sys_lstat64+0x12/0x30 Sep 22 06:25:41 holly kernel: [sysenter_past_esp+82/113] sysenter_past_esp+0x52/0x71 Sep 22 06:25:41 holly kernel: Code: 8b 03 8d 44 20 00 39 bb 8c 00 00 00 74 10 85 c0 89 c3 75 ec Sep 22 06:25:41 holly kernel: <6>note: updatedb[26072] exited with preempt_count 2 Sep 22 06:25:41 holly kernel: bad: scheduling while atomic! Sep 22 06:25:41 holly kernel: [schedule+1123/1136] schedule+0x463/0x470 Sep 22 06:25:41 holly kernel: [zap_pmd_range+63/96] zap_pmd_range+0x3f/0x60 Sep 22 06:25:41 holly kernel: [unmap_page_range+61/112] unmap_page_range+0x3d/0x70 Sep 22 06:25:41 holly kernel: [unmap_vmas+420/448] unmap_vmas+0x1a4/0x1c0 Sep 22 06:25:41 holly kernel: [exit_mmap+124/336] exit_mmap+0x7c/0x150 Sep 22 06:25:41 holly kernel: [mmput+85/112] mmput+0x55/0x70 Sep 22 06:25:41 holly kernel: [do_exit+332/1024] do_exit+0x14c/0x400 Sep 22 06:25:41 holly kernel: [die+241/256] die+0xf1/0x100 Sep 22 06:25:41 holly kernel: [do_page_fault+0/1279] do_page_fault+0x0/0x4ff Sep 22 06:25:41 holly kernel: [do_page_fault+439/1279] do_page_fault+0x1b7/0x4ff Sep 22 06:25:41 holly kernel: [__alloc_pages+705/768] __alloc_pages+0x2c1/0x300 Sep 22 06:25:41 holly kernel: [bh_lru_install+155/208] bh_lru_install+0x9b/0xd0 Sep 22 06:25:41 holly kernel: [pathrelse+32/48] pathrelse+0x20/0x30 Sep 22 06:25:41 holly kernel: [init_inode+423/976] init_inode+0x1a7/0x3d0 Sep 22 06:25:41 holly kernel: [do_page_fault+0/1279] do_page_fault+0x0/0x4ff Sep 22 06:25:41 holly kernel: [error_code+45/56] error_code+0x2d/0x38 Sep 22 06:25:41 holly kernel: [find_inode+21/96] find_inode+0x15/0x60 Sep 22 06:25:41 holly kernel: [reiserfs_find_actor+0/32] reiserfs_find_actor+0x0/0x20 Sep 22 06:25:41 holly kernel: [iget5_locked+108/240] iget5_locked+0x6c/0xf0 Sep 22 06:25:41 holly kernel: [reiserfs_init_locked_inode+0/16] reiserfs_init_locked_inode+0x0/0x10 Sep 22 06:25:41 holly kernel: [reiserfs_find_actor+0/32] reiserfs_find_actor+0x0/0x20 Sep 22 06:25:41 holly kernel: [reiserfs_iget+55/160] reiserfs_iget+0x37/0xa0 Sep 22 06:25:41 holly kernel: [reiserfs_init_locked_inode+0/16] reiserfs_init_locked_inode+0x0/0x10 Sep 22 06:25:41 holly kernel: [reiserfs_lookup+323/416] reiserfs_lookup+0x143/0x1a0 Sep 22 06:25:41 holly kernel: [smp_apic_timer_interrupt+40/240] smp_apic_timer_interrupt+0x28/0xf0 Sep 22 06:25:41 holly kernel: [d_lookup+27/64] d_lookup+0x1b/0x40 Sep 22 06:25:41 holly kernel: [real_lookup+189/224] real_lookup+0xbd/0xe0 Sep 22 06:25:41 holly kernel: [do_lookup+126/144] do_lookup+0x7e/0x90 Sep 22 06:25:41 holly kernel: [link_path_walk+1725/3440] link_path_walk+0x6bd/0xd70 Sep 22 06:25:41 holly kernel: [dput+28/512] dput+0x1c/0x200 Sep 22 06:25:41 holly kernel: [copy_to_user+50/80] copy_to_user+0x32/0x50 Sep 22 06:25:41 holly kernel: [path_lookup+123/304] path_lookup+0x7b/0x130 Sep 22 06:25:41 holly kernel: [__user_walk+68/96] __user_walk+0x44/0x60 Sep 22 06:25:41 holly kernel: [vfs_lstat+26/80] vfs_lstat+0x1a/0x50 Sep 22 06:25:41 holly kernel: [sys_lstat64+18/48] sys_lstat64+0x12/0x30 Sep 22 06:25:41 holly kernel: [sysenter_past_esp+82/113] sysenter_past_esp+0x52/0x71 Other oopses I've seen recently often seem to involve lstat64 and reiserfs. Maybe it's just that that's what updatedb does a lot of... 2.6.8-holly was compiled with CONFIG_REGPARM, the experimental parameter passing in registers option. Sorry about the size of this report, but I've been getting frustrated with all the oopses, and I wanted to get them off my chest. Until recently, Linux hasn't caused me any problems for years. If you would like me to report separate bugs for different oopses, I can do that. But like I said, this machine is in a weird state right now, and I'm willing to poke around with gdb if there's anything anyone wants to know in particular. Thanks for making Debian so good, -- System Information: Debian Release: 3.1 APT prefers testing APT policy: (990, 'testing'), (500, 'unstable') Architecture: i386 (i686) Kernel: Linux 2.6.8-1-k7 Locale: LANG=C, LC_CTYPE=C Versions of packages kernel-image-2.6.8-1-k7 depends on: ii coreutils [fileutils] 5.2.1-2 The GNU core utilities ii initrd-tools 0.1.74 tools to create initrd image for p ii module-init-tools 3.1-pre5-7 tools for managing Linux kernel mo -- no debconf information