Hello, I had recently a general protection fault on a Debian 8 server with Xen (debian pacakge: 4.4.4lts4-0+deb8u1) on the vif50.1-q1-guest kernel proces. I have copied the kernel log below in this mail for reference. After this GPF the system was still responding but one domU lost network connectivity and all the others where still working properly. I decided to power-off and power-on the system as a soft GPF renders the system in an unstable state.
Now I am trying to find out what is most likely the cause of this general protection fault in order to avoid that again in the future and would like your opinion on that: - is this maybe a bug in the Debian kernel I am using? - a bug in the Xen package used by Debian 8? - a hardware issue? - if it is a hardware issue, what is most likely? RAM? CPU? - anything else I am missing? Note that the hardware is enterprise grade hardware and that the BIOS has been updated to the latest available version.The CPUs (dual CPU) are Intel Xeon E5-2640 v3 @ 2.60GHz. Thank you for your input. Best regards, John [Wed May 6 14:48:02 2020] general protection fault: 0000 [#1] SMP [Wed May 6 14:48:02 2020] Modules linked in: xt_physdev iptable_filter ip_tables x_tables xen_netback xen_blkback hmac binfmt_misc xen_gntdev xen_evtchn xenfs xen_privcmd nfsd auth_rpcgss oid_registry nfs_acl nfs lockd fscache sunrpc bridge bonding iTCO_wdt iTCO_vendor_support mxm_wmi zfs(PO) zunicode(PO) x86_pkg_temp_thermal intel_powerclamp zcommon(PO) intel_rapl znvpair(PO) spl(O) coretemp crc32_pclmul zavl(PO) aesni_intel pcspkr aes_x86_64 lrw gf128mul glue_helper ablk_helper cryptd ast ttm drm_kms_helper evdev joydev drm lpc_ich mfd_core i2c_algo_bit mei_me mei shpchp tpm_tis tpm ipmi_si ipmi_msghandler wmi acpi_power_meter processor thermal_sys button 8021q garp stp mrp llc drbd lru_cache libcrc32c crc32c_generic autofs4 ext4 crc16 mbcache jbd2 dm_mod raid1 md_mod mlx4_en vxlan xen_blkfront ptp pps_core [Wed May 6 14:48:02 2020] hid_generic usbhid hid sg sd_mod crc_t10dif crct10dif_generic ahci libahci crct10dif_pclmul crct10dif_common crc32c_intel ehci_pci ehci_hcd mlx4_core libata i2c_i801 i2c_core usbcore usb_common scsi_mod nvme [Wed May 6 14:48:02 2020] CPU: 0 PID: 8305 Comm: vif50.1-q1-gues Tainted: P O 3.16.0-10-amd64 #1 Debian 3.16.72-1 [Wed May 6 14:48:02 2020] Hardware name: Quanta Computer Inc QuantaPlex T41S-2U/S2S-MB, BIOS S2S_3B12 05/30/2019 [Wed May 6 14:48:02 2020] task: ffff88003c9f95d0 ti: ffff88004a3ac000 task.ti: ffff88004a3ac000 [Wed May 6 14:48:02 2020] RIP: e030:[<ffffffffa08fcaa2>] [<ffffffffa08fcaa2>] xenvif_gop_frag_copy+0x22/0x3b0 [xen_netback] [Wed May 6 14:48:02 2020] RSP: e02b:ffff88004a3afd98 EFLAGS: 00010282 [Wed May 6 14:48:02 2020] RAX: 0000000000001000 RBX: ffff8802e0841800 RCX: 7aec7d18f3f45689 [Wed May 6 14:48:02 2020] RDX: ffff88004a3afe80 RSI: ffff8802e0841800 RDI: 0000000111f703b7 [Wed May 6 14:48:02 2020] RBP: ffffc9002332c258 R08: 000000005ff8d9a9 R09: 00000000b1fe2a0e [Wed May 6 14:48:02 2020] R10: ffff880000000000 R11: 0000000000000002 R12: 7aec7d18f3f45689 [Wed May 6 14:48:02 2020] R13: ffffc9002332c258 R14: ffff88004a3afe54 R15: 0000000000000001 [Wed May 6 14:48:02 2020] FS: 0000000000000000(0000) GS:ffff880484000000(0000) knlGS:ffff880484000000 [Wed May 6 14:48:02 2020] CS: e033 DS: 0000 ES: 0000 CR0: 0000000080050033 [Wed May 6 14:48:02 2020] CR2: 00007f49c8679000 CR3: 0000000074855000 CR4: 0000000000042660 [Wed May 6 14:48:02 2020] Stack: [Wed May 6 14:48:02 2020] 0000000058f6d400 ffffc90023336c08 00000000000002c0 ffff8802e0841800 [Wed May 6 14:48:02 2020] ffff88004a3afe80 0000000000000080 ffff8802e0841800 ffffc9002332c258 [Wed May 6 14:48:02 2020] 79eb3472cad61644 0000000000000028 ffff88004a3afe54 0000000000000001 [Wed May 6 14:48:02 2020] Call Trace: [Wed May 6 14:48:02 2020] [<ffffffffa08ff2c9>] ? xenvif_kthread_guest_rx+0x549/0xce0 [xen_netback] [Wed May 6 14:48:02 2020] [<ffffffffa08fed80>] ? xenvif_map_frontend_rings+0xd0/0xd0 [xen_netback] [Wed May 6 14:48:02 2020] [<ffffffff810905d1>] ? kthread+0xd1/0xf0 [Wed May 6 14:48:02 2020] [<ffffffff8153be8f>] ? __schedule+0x22f/0x750 [Wed May 6 14:48:02 2020] [<ffffffff81090500>] ? kthread_create_on_node+0x1b0/0x1b0 [Wed May 6 14:48:02 2020] [<ffffffff8154030e>] ? ret_from_fork+0x6e/0xa0 [Wed May 6 14:48:02 2020] [<ffffffff81090500>] ? kthread_create_on_node+0x1b0/0x1b0 [Wed May 6 14:48:02 2020] Code: 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 41 57 41 56 b8 00 10 00 00 41 55 41 54 49 89 cc 55 53 49 89 fd 4b 8d 3c 08 48 83 ec 30 <48> 8b 09 4c 8b 74 24 68 4c 8b 7c 24 70 80 e5 40 74 08 49 8b 4c [Wed May 6 14:48:02 2020] RIP [<ffffffffa08fcaa2>] xenvif_gop_frag_copy+0x22/0x3b0 [xen_netback] [Wed May 6 14:48:02 2020] RSP <ffff88004a3afd98> [Wed May 6 14:48:33 2020] ---[ end trace 4fb039a0de2de66f ]---