Hi Daniel, I don't have much light to shed on your bug, except that I've got something similar without the nvidia kernel taint.
You wrote: > XFS > LVM > dm_crypt > MD (RAID1) > { SATA AHCI, IDE PIIX } I'm running: ext3 loopback > ext3 > LVM > aacraid scsi. The bug was triggered while I was executing "invoke-rc.d vz stop" and simultaneously copying from the underlying ext3 to the ext3-loopback file. In every case the Call Traces end at: system_call_after_swapgs+0x8a/0x8f I've been trying to 'kill -9' all of the blocked processes, and that gives me, for example: [3594730.862500] BUG: soft lockup - CPU#1 stuck for 61s! [apache2:4576] [3594730.862500] Modules linked in: vzethdev vznetdev simfs vzrst vzcpt tun vzmon xt_length ipt_ttl xt_tcpmss xt_TCPMSS iptable_mangle iptable_filter xt_multiport xt_limit xt_dscp ipt_REJECT xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack vzdquota vzdev ip_tables x_tables ipv6 loop snd_pcm snd_timer snd soundcore snd_page_alloc parport_pc parport pcspkr psmouse serio_raw k8temp i2c_amd8111 amd_rng rng_core i2c_amd756 i2c_core button shpchp pci_hotplug evdev ext3 jbd mbcache dm_mirror dm_log dm_snapshot dm_mod ide_cd_mod cdrom ide_pci_generic amd74xx ide_core sd_mod floppy ata_generic libata dock ohci_hcd tg3 aacraid scsi_mod thermal processor fan thermal_sys [last unloaded: simfs] [3594730.862500] CPU 1: [3594730.862500] Modules linked in: vzethdev vznetdev simfs vzrst vzcpt tun vzmon xt_length ipt_ttl xt_tcpmss xt_TCPMSS iptable_mangle iptable_filter xt_multiport xt_limit xt_dscp ipt_REJECT xt_tcpudp iptable_nat nf_nat nf_conntrack_ipv4 nf_conntrack vzdquota vzdev ip_tables x_tables ipv6 loop snd_pcm snd_timer snd soundcore snd_page_alloc parport_pc parport pcspkr psmouse serio_raw k8temp i2c_amd8111 amd_rng rng_core i2c_amd756 i2c_core button shpchp pci_hotplug evdev ext3 jbd mbcache dm_mirror dm_log dm_snapshot dm_mod ide_cd_mod cdrom ide_pci_generic amd74xx ide_core sd_mod floppy ata_generic libata dock ohci_hcd tg3 aacraid scsi_mod thermal processor fan thermal_sys [last unloaded: simfs] [3594730.862500] Pid: 4576, comm: apache2 Not tainted 2.6.26-1-openvz-amd64 #1 036test001 [3594730.862500] RIP: 0010:[<ffffffff804238de>] [<ffffffff804238de>] _spin_lock+0xc/0x15 [3594730.862500] RSP: 0018:ffff810003367d10 EFLAGS: 00000293 [3594730.862500] RAX: 0000000000001614 RBX: ffff81007e8e34a8 RCX: 0000000000000000 [3594730.862500] RDX: ffffe200004e2968 RSI: 0000000000000002 RDI: ffff81007f5a67e0 [3594730.862500] RBP: 0000000000000246 R08: 0000000000000008 R09: ffff810001101700 [3594730.862500] R10: 0000000000000002 R11: 0000000000000000 R12: ffff810000010f80 [3594730.862500] R13: ffffe200005612c0 R14: ffffffff8027a4ce R15: 0000000000000004 [3594730.862500] FS: 00007f6e32f0f750(0000) GS:ffff81007f5a6a40(0000) knlGS:0000000000000000 [3594730.862500] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b [3594730.862500] CR2: 000000000183c1c8 CR3: 00000000549b5000 CR4: 00000000000006e0 [3594730.862500] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [3594730.862500] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 [3594730.862500] [3594730.862500] Call Trace: [3594730.862500] [<ffffffff802971d1>] ? shmem_free_blocks+0x27/0x42 [3594730.862500] [<ffffffff80297a41>] ? shmem_truncate_range+0x763/0x808 [3594730.862500] [<ffffffff80299a55>] ? shmem_delete_inode+0x65/0xde [3594730.862500] [<ffffffff802999f0>] ? shmem_delete_inode+0x0/0xde [3594730.862500] [<ffffffff802b398c>] ? generic_delete_inode+0xa3/0x115 [3594730.862500] [<ffffffff802b0777>] ? d_kill+0x38/0x59 [3594730.862500] [<ffffffff802b1964>] ? dput+0x119/0x14f [3594730.862500] [<ffffffff802a1d70>] ? __fput+0x14f/0x178 [3594730.862500] [<ffffffff80288913>] ? remove_vma+0x53/0x88 [3594730.862500] [<ffffffff80289633>] ? do_munmap+0x205/0x227 [3594730.862500] [<ffffffff80423766>] ? __down_write_nested+0x12/0xa1 [3594730.862500] [<ffffffff80289695>] ? sys_munmap+0x40/0x5a [3594730.862500] [<ffffffff8020bffa>] ? system_call_after_swapgs+0x8a/0x8f [3594730.862500] Things are continuing to block, e.g.: [3594580.540114] INFO: task sshd:23585 blocked for more than 120 seconds. [3594580.540182] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message. [3594580.540283] sshd D ffff81013d92d7d0 0 23585 15642 [3594580.540288] ffff8101207a1bb0 0000000000000086 0000000000000000 0000000000000002 [3594580.540294] ffff81013d92d7d0 ffff81007f5b2810 ffff81013d92da58 00000003bd68fcf8 [3594580.540300] 0000000000000002 0000000000000000 00000000ffffffff 0000000000000000 [3594580.540305] Call Trace: [3594580.540329] [<ffffffff80358ae3>] mix_pool_bytes_extract+0x5c/0x155 [3594580.540338] [<ffffffff80422bd7>] __mutex_lock_slowpath+0x64/0x9b [3594580.540346] [<ffffffff80422a3c>] mutex_lock+0xa/0xb [3594580.540352] [<ffffffff803b97b3>] rtnetlink_rcv+0x9/0x1e [3594580.540357] [<ffffffff803c7a46>] netlink_unicast+0x215/0x28d [3594580.540362] [<ffffffff803aaa0b>] __alloc_skb+0x8d/0x153 [3594580.540370] [<ffffffff803c8240>] netlink_sendmsg+0x25b/0x26e [3594580.540382] [<ffffffff803a46ae>] sock_sendmsg+0xcb/0xe3 [3594580.540393] [<ffffffff80247be5>] autoremove_wake_function+0x0/0x2e [3594580.540405] [<ffffffff8022a8b9>] __wake_up+0x38/0x4f [3594580.540412] [<ffffffff803c7141>] netlink_insert+0x118/0x127 [3594580.540420] [<ffffffff803a50aa>] sys_sendto+0xf3/0x127 [3594580.540427] [<ffffffff803a5273>] move_addr_to_user+0x5d/0x78 [3594580.540434] [<ffffffff803a5715>] sys_getsockname+0x72/0xa2 [3594580.540440] [<ffffffff802b154f>] d_instantiate+0x52/0x5d [3594580.540453] [<ffffffff8020bffa>] system_call_after_swapgs+0x8a/0x8f I would be happy to provide any other information, but I'm not sure what would be useful at this point! Regards, Tom -- -- Tom Rathborne <Bootsy> Tommer needs a revision that takes care of itself. -- To UNSUBSCRIBE, email to debian-kernel-requ...@lists.debian.org with a subject of "unsubscribe". Trouble? Contact listmas...@lists.debian.org