2 nights ago something happened. I have a 5th server which doesn't use LVM snapshots, but still it's backed up every night. I didn't install the mainline kernel on this machine, because we thought the problem was due to snapshots. And it crashed two nights ago, in the same way the other machines crashed, without using snapshots. It crashed after finishing the backup. I think the problem lies in the disconnection of the iscsi volume on which the machines do the backup. On the first reboot, I had the following in the kernel log:
[ 209.312097] ------------[ cut here ]------------ [ 209.312106] WARNING: CPU: 0 PID: 1808 at /build/buildd/linux-3.13.0/drivers/pci/pci.c:1444 pci_disable_device+0x9c/0xb0() [ 209.312108] ipmi_si 0000:01:04.6: disabling already-disabled device [ 209.312110] Modules linked in: ib_iser rdma_cm iw_cm ib_cm ib_sa ib_mad ib_core ib_addr iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi sit tunnel4 ip_tunnel dm_crypt gpio_ich coretemp kvm joydev serio_raw hpilo lpc_ich ipmi_si(-) i3200_edac shpchp edac_core mac_hid lp parport reiserfs raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq raid0 multipath linear raid1 hid_generic radeon i2c_algo_bit ttm drm_kms_helper psmouse drm pata_acpi tg3 usbhid hid ptp pps_core [ 209.312151] CPU: 0 PID: 1808 Comm: modprobe Not tainted 3.13.0-53-generic #89-Ubuntu [ 209.312153] Hardware name: HP ProLiant DL320 G5p, BIOS W05 04/03/2008 [ 209.312154] 0000000000000009 ffff88007abb3d40 ffffffff81722e1e ffff88007abb3d88 [ 209.312158] ffff88007abb3d78 ffffffff810677fd ffff88007c311000 ffff88007c2c5580 [ 209.312161] ffff88007c311000 00007f459c5473f0 00007ffd2b319338 ffff88007abb3dd8 [ 209.312164] Call Trace: [ 209.312170] [<ffffffff81722e1e>] dump_stack+0x45/0x56 [ 209.312175] [<ffffffff810677fd>] warn_slowpath_common+0x7d/0xa0 [ 209.312177] [<ffffffff8106786c>] warn_slowpath_fmt+0x4c/0x50 [ 209.312182] [<ffffffff811a259d>] ? kfree+0xfd/0x140 [ 209.312186] [<ffffffff813a9c7c>] pci_disable_device+0x9c/0xb0 [ 209.312192] [<ffffffffa0398059>] ipmi_pci_remove+0x29/0x30 [ipmi_si] [ 209.312195] [<ffffffff813ac68b>] pci_device_remove+0x3b/0xb0 [ 209.312200] [<ffffffff81498c3f>] __device_release_driver+0x7f/0xf0 [ 209.312203] [<ffffffff81499608>] driver_detach+0xb8/0xc0 [ 209.312207] [<ffffffff81498875>] bus_remove_driver+0x55/0xd0 [ 209.312210] [<ffffffff81499c7c>] driver_unregister+0x2c/0x50 [ 209.312213] [<ffffffff813ab179>] pci_unregister_driver+0x29/0x90 [ 209.312218] [<ffffffffa03984c4>] cleanup_ipmi_si+0xd4/0xf0 [ipmi_si] [ 209.312222] [<ffffffff810e05d2>] SyS_delete_module+0x162/0x200 [ 209.312227] [<ffffffff81013ed7>] ? do_notify_resume+0x97/0xb0 [ 209.312231] [<ffffffff8173391d>] system_call_fastpath+0x1a/0x1f [ 209.312233] ---[ end trace f6143eeb3c0e8dba ]--- I don't know if this is related, anyway it happened only on this particular reboot. I now installed the mainline kernel also on this machine. I changed the title of this bug to reflect the additional information I got. ** Summary changed: - System hangs apparently randomly when creating LVM snapshots + System hangs apparently randomly when disconnecting iScsi volumes -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1449910 Title: System hangs apparently randomly when disconnecting iScsi volumes To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1449910/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs