Public bug reported: I'm using a 20.04.2 LTS install on a t470 thinkpad. After installing the linux-image-generic-hwe-20.04 the boot kernel switched to 5.8.0.55 when a lot of mce messages appeared in the kernel log, e.g. kern.log, e.g.:
Jun 14 10:57:50 monster kernel: [ 0.627088] mce: [Hardware Error]: CPU 1: Machine Check: 0 Bank 3: 8c40004000100151 Jun 14 10:57:50 monster kernel: [ 0.627089] mce: [Hardware Error]: TSC c619c16f1 ADDR 4414b2940 MISC 306485 Jun 14 10:57:50 monster kernel: [ 0.627090] mce: [Hardware Error]: PROCESSOR 0:806e9 TIME 1623661045 SOCKET 0 APIC 2 microcode de Using rasdaemon and the fixed ras-mc-ctl script from upstream this got elaborated to $ ras-mc-ctl --errors --snip-- 188 2021-06-14 10:54:21 +0200 error: Instruction CACHE Level-1 Instruction-Fetch Error, mcg mcgstatus=0, mci Error_overflow Corrected_error Threshold based error status: yellow, mcgcap=0x00000c08, status=0xcc400e8000100151, addr=0x2146b9240, misc=0x00516485, walltime=0x60c7193d, cpu=0x00000001, cpuid=0x000806e9, apicid=0x00000002, bank=0x00000003 189 2021-06-14 10:54:22 +0200 error: Instruction CACHE Level-1 Instruction-Fetch Error, mcg mcgstatus=0, mci Error_overflow Corrected_error Threshold based error status: yellow, mcgcap=0x00000c08, status=0xcc40020000100151, addr=0x4344eee40, misc=0x02526485, walltime=0x60c7193e, cpu=0x00000001, cpuid=0x000806e9, apicid=0x00000002, bank=0x00000003 190 2021-06-14 10:54:26 +0200 error: Instruction CACHE Level-1 Instruction-Fetch Error, mcg mcgstatus=0, mci Error_overflow Corrected_error Threshold based error status: yellow, mcgcap=0x00000c08, status=0xcc40064000100151, addr=0x21447e7c0, misc=0x02526485, walltime=0x60c71942, cpu=0x00000001, cpuid=0x000806e9, apicid=0x00000002, bank=0x00000003 --snap-- Is this just better reporting by the 5.8 kernel or is this a mismatch of kernel and hardware? I have no sudden application crashes or other indications for failing hardware. And a few hours of memtest86+ (not the broken version from the repo but a current one from a boot cd) report no errors. ** Affects: linux (Ubuntu) Importance: Undecided Status: New ** Attachment added: "lspci-vnvn.log" https://bugs.launchpad.net/bugs/1931845/+attachment/5504476/+files/lspci-vnvn.log -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1931845 Title: Lots of cme when using 5.8.0 but not with 5.4.0 Status in linux package in Ubuntu: New Bug description: I'm using a 20.04.2 LTS install on a t470 thinkpad. After installing the linux-image-generic-hwe-20.04 the boot kernel switched to 5.8.0.55 when a lot of mce messages appeared in the kernel log, e.g. kern.log, e.g.: Jun 14 10:57:50 monster kernel: [ 0.627088] mce: [Hardware Error]: CPU 1: Machine Check: 0 Bank 3: 8c40004000100151 Jun 14 10:57:50 monster kernel: [ 0.627089] mce: [Hardware Error]: TSC c619c16f1 ADDR 4414b2940 MISC 306485 Jun 14 10:57:50 monster kernel: [ 0.627090] mce: [Hardware Error]: PROCESSOR 0:806e9 TIME 1623661045 SOCKET 0 APIC 2 microcode de Using rasdaemon and the fixed ras-mc-ctl script from upstream this got elaborated to $ ras-mc-ctl --errors --snip-- 188 2021-06-14 10:54:21 +0200 error: Instruction CACHE Level-1 Instruction-Fetch Error, mcg mcgstatus=0, mci Error_overflow Corrected_error Threshold based error status: yellow, mcgcap=0x00000c08, status=0xcc400e8000100151, addr=0x2146b9240, misc=0x00516485, walltime=0x60c7193d, cpu=0x00000001, cpuid=0x000806e9, apicid=0x00000002, bank=0x00000003 189 2021-06-14 10:54:22 +0200 error: Instruction CACHE Level-1 Instruction-Fetch Error, mcg mcgstatus=0, mci Error_overflow Corrected_error Threshold based error status: yellow, mcgcap=0x00000c08, status=0xcc40020000100151, addr=0x4344eee40, misc=0x02526485, walltime=0x60c7193e, cpu=0x00000001, cpuid=0x000806e9, apicid=0x00000002, bank=0x00000003 190 2021-06-14 10:54:26 +0200 error: Instruction CACHE Level-1 Instruction-Fetch Error, mcg mcgstatus=0, mci Error_overflow Corrected_error Threshold based error status: yellow, mcgcap=0x00000c08, status=0xcc40064000100151, addr=0x21447e7c0, misc=0x02526485, walltime=0x60c71942, cpu=0x00000001, cpuid=0x000806e9, apicid=0x00000002, bank=0x00000003 --snap-- Is this just better reporting by the 5.8 kernel or is this a mismatch of kernel and hardware? I have no sudden application crashes or other indications for failing hardware. And a few hours of memtest86+ (not the broken version from the repo but a current one from a boot cd) report no errors. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/1931845/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp