Public bug reported: User reports a kernel panic under load:
[2503476.606215] kernel BUG at /build/linux-hVVhWi/linux-4.4.0/fs/ext4/inode.c:1894! [2503476.606236] invalid opcode: 0000 [#1] SMP [2503476.606249] Modules linked in: veth ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype xt_conntrack nf_nat nf_conntrack br_netfilter bridge stp llc overlay xt_multiport iptable_filter ip_tables x_tables nv_peer_mem(OE) cachefiles fscache msr rdma_ucm(OE) ib_ucm(OE) rdma_cm(OE) iw_cm(OE) configfs ib_ipoib(OE) ib_cm(OE) ib_uverbs(OE) ib_umad(OE) mlx5_ib(OE) mlx4_ib(OE) ib_core(OE) mlx4_en(OE) mlx4_core(OE) ipmi_ssif mxm_wmi intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd joydev input_leds sb_edac edac_core nvidia_uvm(POE) lpc_ich mei_me mei shpchp [2503476.606483] 8250_fintek ipmi_si acpi_power_meter wmi mac_hid ipmi_devintf ipmi_msghandler knem(OE) sunrpc autofs4 nvidia_drm(POE) nvidia_modeset(POE) ses enclosure ast ixgbe i2c_algo_bit dca ttm nvidia(POE) drm_kms_helper syscopyarea hid_generic vxlan sysfillrect sysimgblt ip6_udp_tunnel mlx5_core(OE) usbhid udp_tunnel mlx_compat(OE) megaraid_sas fb_sys_fops hid ahci ptp libahci drm pps_core mdio fjes [2503476.606608] CPU: 23 PID: 27629 Comm: kworker/u162:3 Tainted: P OE 4.4.0-92-generic #115-Ubuntu [2503476.606632] Hardware name: NVIDIA DGX-1 with V100/DGX-1 with V100, BIOS S2W_3A04 08/29/2017 [2503476.606659] Workqueue: writeback wb_workfn (flush-8:0) [2503476.606675] task: ffff884b48124600 ti: ffff887a04a58000 task.ti: ffff887a04a58000 [2503476.606695] RIP: 0010:[<ffffffff8129ef7c>] [<ffffffff8129ef7c>] ext4_writepage+0x2ec/0x540 [2503476.606721] RSP: 0018:ffff887a04a5b858 EFLAGS: 00010246 [2503476.606735] RAX: 0501ef000000016d RBX: 0000000000001000 RCX: 0000000000000038 [2503476.606754] RDX: ffff8827720d3490 RSI: ffff887a04a5bbd8 RDI: ffffea010f9c1c00 [2503476.606772] RBP: ffff887a04a5b8c0 R08: 000000000001a8c0 R09: 0000000000000002 [2503476.606790] R10: ffff88807fff8000 R11: 0000000000000033 R12: ffff8827720d3328 [2503476.606808] R13: ffff887a04a5bbd8 R14: ffffea010f9c1c00 R15: ffffea010f9c1c00 [2503476.606828] FS: 0000000000000000(0000) GS:ffff887f7ecc0000(0000) knlGS:0000000000000000 [2503476.606848] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [2503476.606864] CR2: 00007f4fb11430b0 CR3: 0000000002e0a000 CR4: 00000000003406e0 [2503476.606882] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [2503476.606916] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [2503476.606935] Stack: [2503476.606943] ffffffff811ceba3 000000008118f139 ffff887a04a5b864 ffffffff811cd450 [2503476.606969] 0000000000000000 0000000000000000 0000000000000246 63a00f9eaf8e9dde [2503476.606991] ffff8827720d3490 ffff887a04a5bbd8 ffff8827720d3490 ffffea010f9c1c00 [2503476.607013] Call Trace: [2503476.607025] [<ffffffff811ceba3>] ? page_mkclean+0x73/0xa0 [2503476.607043] [<ffffffff811cd450>] ? page_referenced_one+0x1a0/0x1a0 [2503476.607062] [<ffffffff8129f1e2>] __writepage+0x12/0x30 [2503476.607080] [<ffffffff8119b28e>] write_cache_pages+0x1ee/0x510 [2503476.607097] [<ffffffff8129f1d0>] ? ext4_writepage+0x540/0x540 [2503476.607114] [<ffffffff8129fdc5>] ext4_writepages+0x195/0xd30 [2503476.607132] [<ffffffff8119b60b>] ? generic_writepages+0x5b/0x80 [2503476.607149] [<ffffffff8119da9e>] do_writepages+0x1e/0x30 [2503476.607165] [<ffffffff8123e585>] __writeback_single_inode+0x45/0x340 [2503476.607201] [<ffffffff8123ed92>] writeback_sb_inodes+0x262/0x600 [2503476.607218] [<ffffffff8123f1bc>] __writeback_inodes_wb+0x8c/0xc0 [2503476.607236] [<ffffffff8123f513>] wb_writeback+0x253/0x310 [2503476.607251] [<ffffffff8123fced>] wb_workfn+0x24d/0x400 [2503476.607268] [<ffffffff8109a625>] process_one_work+0x165/0x480 [2503476.607285] [<ffffffff8109a98b>] worker_thread+0x4b/0x4c0 [2503476.607300] [<ffffffff8109a940>] ? process_one_work+0x480/0x480 [2503476.607318] [<ffffffff810a0cc5>] kthread+0xe5/0x100 [2503476.607332] [<ffffffff810a0be0>] ? kthread_create_on_node+0x1e0/0x1e0 [2503476.607352] [<ffffffff8184238f>] ret_from_fork+0x3f/0x70 [2503476.607368] [<ffffffff810a0be0>] ? kthread_create_on_node+0x1e0/0x1e0 [2503476.607385] Code: 74 2b 49 8b 94 24 68 ff ff ff 80 e6 40 74 07 a9 00 00 00 08 74 17 25 00 08 00 00 3d 00 08 00 00 0f 84 aa fd ff ff e8 15 ed 08 00 <0f> 0b 49 8b 84 24 68 ff ff ff f6 c4 08 0f 85 92 fd ff ff e9 79 [2503476.607525] RIP [<ffffffff8129ef7c>] ext4_writepage+0x2ec/0x540 [2503476.607543] RSP <ffff887a04a5b858> [2503476.613569] ---[ end trace 03d35738081084c6 ]--- [2503476.690872] BUG: unable to handle kernel paging request at ffffffffffffffd8 [2503476.690921] IP: [<ffffffff810a1370>] kthread_data+0x10/0x20 [2503476.690943] PGD 2e0d067 PUD 2e0f067 PMD 0 [2503476.690962] Oops: 0000 [#2] SMP [2503476.690975] Modules linked in: veth ipt_MASQUERADE nf_nat_masquerade_ipv4 nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype xt_conntrack nf_nat nf_conntrack br_netfilter bridge stp llc overlay xt_multiport iptable_filter ip_tables x_tables nv_peer_mem(OE) cachefiles fscache msr rdma_ucm(OE) ib_ucm(OE) rdma_cm(OE) iw_cm(OE) configfs ib_ipoib(OE) ib_cm(OE) ib_uverbs(OE) ib_umad(OE) mlx5_ib(OE) mlx4_ib(OE) ib_core(OE) mlx4_en(OE) mlx4_core(OE) ipmi_ssif mxm_wmi intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd joydev input_leds sb_edac edac_core nvidia_uvm(POE) lpc_ich mei_me mei shpchp [2503476.691252] 8250_fintek ipmi_si acpi_power_meter wmi mac_hid ipmi_devintf ipmi_msghandler knem(OE) sunrpc autofs4 nvidia_drm(POE) nvidia_modeset(POE) ses enclosure ast ixgbe i2c_algo_bit dca ttm nvidia(POE) drm_kms_helper syscopyarea hid_generic vxlan sysfillrect sysimgblt ip6_udp_tunnel mlx5_core(OE) usbhid udp_tunnel mlx_compat(OE) megaraid_sas fb_sys_fops hid ahci ptp libahci drm pps_core mdio fjes [2503476.692726] CPU: 23 PID: 27629 Comm: kworker/u162:3 Tainted: P D OE 4.4.0-92-generic #115-Ubuntu [2503476.693996] Hardware name: NVIDIA DGX-1 with V100/DGX-1 with V100, BIOS S2W_3A04 08/29/2017 [2503476.695119] task: ffff884b48124600 ti: ffff887a04a58000 task.ti: ffff887a04a58000 [2503476.696215] RIP: 0010:[<ffffffff810a1370>] [<ffffffff810a1370>] kthread_data+0x10/0x20 [2503476.697313] RSP: 0018:ffff887a04a5b528 EFLAGS: 00010002 [2503476.698404] RAX: 0000000000000000 RBX: 0000000000000017 RCX: ffffffff82109e80 [2503476.699529] RDX: 0000000000000017 RSI: 0000000000000017 RDI: ffff884b48124600 [2503476.700569] RBP: ffff887a04a5b528 R08: 00000000ffffffff R09: 0000000000000000 [2503476.701511] R10: ffff884b48124660 R11: 0000000000006c00 R12: 0000000000000000 [2503476.702419] R13: 0000000000016dc0 R14: ffff884b48124600 R15: ffff887f7ecd6dc0 [2503476.703346] FS: 0000000000000000(0000) GS:ffff887f7ecc0000(0000) knlGS:0000000000000000 [2503476.704206] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 [2503476.705054] CR2: 0000000000000028 CR3: 0000000002e0a000 CR4: 00000000003406e0 [2503476.705898] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 [2503476.706732] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 [2503476.707606] Stack: [2503476.708423] ffff887a04a5b540 ffffffff8109b9c1 ffff887f7ecd6dc0 ffff887a04a5b590 [2503476.709253] ffffffff8183dab0 ffff887f60547eb0 ffff887a00000017 ffff884b48124600 [2503476.710079] ffff887a04a5c000 ffff884b48124cd0 ffff887a04a5b140 0000000000000000 [2503476.710922] Call Trace: [2503476.711769] [<ffffffff8109b9c1>] wq_worker_sleeping+0x11/0x90 [2503476.712594] [<ffffffff8183dab0>] __schedule+0x650/0xa30 [2503476.713414] [<ffffffff8183dec5>] schedule+0x35/0x80 [2503476.714228] [<ffffffff81084465>] do_exit+0x775/0xb00 [2503476.715061] [<ffffffff81031c41>] oops_end+0xa1/0xd0 [2503476.715883] [<ffffffff810320fb>] die+0x4b/0x70 [2503476.716681] [<ffffffff8102f121>] do_trap+0xb1/0x140 [2503476.717481] [<ffffffff8102f4a9>] do_error_trap+0x89/0x110 [2503476.718280] [<ffffffff8129ef7c>] ? ext4_writepage+0x2ec/0x540 [2503476.719099] [<ffffffff812eda1d>] ? do_get_write_access+0x38d/0x490 [2503476.719909] [<ffffffff810c3d94>] ? __wake_up+0x44/0x50 [2503476.720690] [<ffffffff8102fa10>] do_invalid_op+0x20/0x30 [2503476.721468] [<ffffffff81843b0e>] invalid_op+0x1e/0x30 [2503476.722241] [<ffffffff8129ef7c>] ? ext4_writepage+0x2ec/0x540 [2503476.723044] [<ffffffff811ceba3>] ? page_mkclean+0x73/0xa0 [2503476.723847] [<ffffffff811cd450>] ? page_referenced_one+0x1a0/0x1a0 [2503476.724621] [<ffffffff8129f1e2>] __writepage+0x12/0x30 [2503476.725396] [<ffffffff8119b28e>] write_cache_pages+0x1ee/0x510 [2503476.726172] [<ffffffff8129f1d0>] ? ext4_writepage+0x540/0x540 [2503476.726967] [<ffffffff8129fdc5>] ext4_writepages+0x195/0xd30 [2503476.727770] [<ffffffff8119b60b>] ? generic_writepages+0x5b/0x80 [2503476.728545] [<ffffffff8119da9e>] do_writepages+0x1e/0x30 [2503476.729318] [<ffffffff8123e585>] __writeback_single_inode+0x45/0x340 [2503476.730093] [<ffffffff8123ed92>] writeback_sb_inodes+0x262/0x600 [2503476.730870] [<ffffffff8123f1bc>] __writeback_inodes_wb+0x8c/0xc0 [2503476.731696] [<ffffffff8123f513>] wb_writeback+0x253/0x310 [2503476.732472] [<ffffffff8123fced>] wb_workfn+0x24d/0x400 [2503476.733248] [<ffffffff8109a625>] process_one_work+0x165/0x480 [2503476.734026] [<ffffffff8109a98b>] worker_thread+0x4b/0x4c0 [2503476.734801] [<ffffffff8109a940>] ? process_one_work+0x480/0x480 [2503476.735691] [<ffffffff810a0cc5>] kthread+0xe5/0x100 [2503476.736464] [<ffffffff810a0be0>] ? kthread_create_on_node+0x1e0/0x1e0 [2503476.737240] [<ffffffff8184238f>] ret_from_fork+0x3f/0x70 [2503476.737991] [<ffffffff810a0be0>] ? kthread_create_on_node+0x1e0/0x1e0 [2503476.738724] Code: ff ff ff be 49 02 00 00 48 c7 c7 98 b9 cb 81 e8 e7 00 fe ff e9 a6 fe ff ff 66 90 0f 1f 44 00 00 48 8b 87 18 05 00 00 55 48 89 e5 <48> 8b 40 d8 5d c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 [2503476.739609] RIP [<ffffffff810a1370>] kthread_data+0x10/0x20 [2503476.740384] RSP <ffff887a04a5b528> [2503476.741149] CR2: ffffffffffffffd8 [2503476.741909] ---[ end trace 03d35738081084c7 ]--- ** Affects: linux (Ubuntu) Importance: Undecided Assignee: Dragan S. (dragan-s) Status: New ** Changed in: linux (Ubuntu) Assignee: (unassigned) => Dragan S. (dragan-s) -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1756197 Title: Kernel panic BUG_ON in 4.4.0-92-generic fs/ext4/inode.c:1894! To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1756197/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs