Public bug reported: I have an Epyc Genoa system that won't boot with either the 6.14.0 or 6.14.1 kernels without mce=off. The system is only a few months old, and has run without issue on all kernels up to the most recent 6.13.10 release. Starting with kernel 6.14.0, the system is unable to boot, apparently due to a CPU cache issue (based on my reading of the journalctl -b logs). I don't believe this is actually a hardware issue as reported by journalctl -b, as I have run thorough stress tests on both the CPU and memory without any problem since first coming accross this. I have reached out to the board manufacturer, and there are no newer BIOS updates available. System details are below.
CPU: AMD EPYC 9554 (microcode 0x0a101148) Motherboard: ASRock Rack GENOAD8X-2T/BCM (BIOS Firmware Version 10.05, BMC Firmware Version 10.02.00) Memory: 8x64GB MICRON DDR5 RDIMM [ 2.016623] BERT: Error records from previous boot: [ 2.019601] [Hardware Error]: event severity: fatal [ 2.022688] [Hardware Error]: Error 0, type: fatal [ 2.026173] [Hardware Error]: fru_text: ProcessorError [ 2.028867] [Hardware Error]: section_type: IA32/X64 processor error [ 2.031706] [Hardware Error]: Local APIC_ID: 0x0 [ 2.033879] [Hardware Error]: CPUID Info: [ 2.036475] [Hardware Error]: 00000000: 00a10f11 00000000 00800800 00000000 [ 2.038856] [Hardware Error]: 00000010: 76fa320b 00000000 178bfbff 00000000 [ 2.040891] [Hardware Error]: 00000020: 00000000 00000000 00000000 00000000 [ 2.043879] [Hardware Error]: Error Information Structure 0: [ 2.045883] [Hardware Error]: Error Structure Type: cache error [ 2.047903] [Hardware Error]: Check Information: 0x000000000602001f [ 2.050184] [Hardware Error]: Transaction Type: 2, Generic [ 2.052881] [Hardware Error]: Operation: 0, generic error [ 2.054882] [Hardware Error]: Level: 0 [ 2.057883] [Hardware Error]: Processor Context Corrupt: true [ 2.059883] [Hardware Error]: Uncorrected: true [ 2.061899] [Hardware Error]: Context Information Structure 0: [ 2.063883] [Hardware Error]: Register Context Type: MSR Registers (Machine Check and other MSRs) [ 2.067872] usb 1-1: new high-speed USB device number 2 using xhci_hcd [ 2.067891] [Hardware Error]: Register Array Size: 0x0050 [ 2.073207] [Hardware Error]: MSR Address: 0xc0002051 [ 2.076587] [Hardware Error]: Context Information Structure 1: [ 2.078873] [Hardware Error]: Register Context Type: Unclassified Data [ 2.081350] [Hardware Error]: Register Array Size: 0x0010 [ 2.083291] [Hardware Error]: Register Array: [ 2.085194] [Hardware Error]: 00000000: 00000010 00000000 1c3010c0 fffffffe [ 2.087887] BERT: Total records found: 1 ProblemType: Bug DistroRelease: Ubuntu 25.04 Package: linux-image-6.14.0-13-generic 6.14.0-13.13 ProcVersionSignature: Ubuntu 6.14.0-13.13-generic 6.14.0 Uname: Linux 6.14.0-13-generic x86_64 AlsaVersion: Advanced Linux Sound Architecture Driver Version k6.14.0-13-generic. AplayDevices: Error: [Errno 2] No such file or directory: 'aplay' ApportVersion: 2.32.0-0ubuntu3 Architecture: amd64 ArecordDevices: Error: [Errno 2] No such file or directory: 'arecord' AudioDevicesInUse: Error: command ['fuser', '-v', '/dev/snd/by-path', '/dev/snd/controlC0', '/dev/snd/controlC2', '/dev/snd/controlC1', '/dev/snd/hwC0D0', '/dev/snd/pcmC0D9p', '/dev/snd/pcmC0D8p', '/dev/snd/hwC2D0', '/dev/snd/pcmC2D9p', '/dev/snd/hwC1D0', '/dev/snd/pcmC2D8p', '/dev/snd/pcmC1D9p', '/dev/snd/pcmC0D7p', '/dev/snd/pcmC2D7p', '/dev/snd/pcmC1D8p', '/dev/snd/pcmC0D3p', '/dev/snd/pcmC2D3p', '/dev/snd/pcmC1D7p', '/dev/snd/pcmC1D3p', '/dev/snd/seq', '/dev/snd/timer'] failed with exit code 1: CRDA: N/A Card0.Amixer.info: Error: [Errno 2] No such file or directory: 'amixer' Card0.Amixer.values: Error: [Errno 2] No such file or directory: 'amixer' Card1.Amixer.info: Error: [Errno 2] No such file or directory: 'amixer' Card1.Amixer.values: Error: [Errno 2] No such file or directory: 'amixer' Card2.Amixer.info: Error: [Errno 2] No such file or directory: 'amixer' Card2.Amixer.values: Error: [Errno 2] No such file or directory: 'amixer' CasperMD5CheckResult: pass CurrentDmesg: Error: command ['dmesg'] failed with exit code 1: dmesg: read kernel buffer failed: Operation not permitted Date: Tue Apr 8 23:49:59 2025 InstallationDate: Installed on 2025-04-08 (0 days ago) InstallationMedia: Ubuntu-Server 25.04 "Plucky Puffin" - Daily amd64 (20250324) MachineType: To Be Filled By O.E.M. GENOAD8X-2T/BCM ProcEnviron: LANG=en_US.UTF-8 PATH=(custom, no user) SHELL=/bin/bash TERM=xterm-256color XDG_RUNTIME_DIR=<set> ProcFB: 0 astdrmfb ProcKernelCmdLine: BOOT_IMAGE=/vmlinuz-6.14.0-13-generic root=/dev/mapper/ubuntu--vg-ubuntu--lv ro crashkernel=2G-4G:320M,4G-32G:512M,32G-64G:1024M,64G-128G:2048M,128G-:4096M mce=off RelatedPackageVersions: linux-restricted-modules-6.14.0-13-generic N/A linux-backports-modules-6.14.0-13-generic N/A linux-firmware 20250317.git1d4c88ee-0ubuntu1 RfKill: Error: [Errno 2] No such file or directory: 'rfkill' SourcePackage: linux UpgradeStatus: No upgrade log present (probably fresh install) acpidump: dmi.bios.date: 12/12/2024 dmi.bios.release: 5.27 dmi.bios.vendor: American Megatrends International, LLC. dmi.bios.version: 10.05 dmi.board.name: GENOAD8X-2T/BCM dmi.board.vendor: ASRockRack dmi.chassis.asset.tag: To Be Filled By O.E.M. dmi.chassis.type: 17 dmi.chassis.vendor: To Be Filled By O.E.M. dmi.chassis.version: To Be Filled By O.E.M. dmi.modalias: dmi:bvnAmericanMegatrendsInternational,LLC.:bvr10.05:bd12/12/2024:br5.27:svnToBeFilledByO.E.M.:pnGENOAD8X-2T/BCM:pvrToBeFilledByO.E.M.:rvnASRockRack:rnGENOAD8X-2T/BCM:rvr:cvnToBeFilledByO.E.M.:ct17:cvrToBeFilledByO.E.M.:skuToBeFilledByO.E.M.: dmi.product.family: To Be Filled By O.E.M. dmi.product.name: GENOAD8X-2T/BCM dmi.product.sku: To Be Filled By O.E.M. dmi.product.version: To Be Filled By O.E.M. dmi.sys.vendor: To Be Filled By O.E.M. ** Affects: linux (Ubuntu) Importance: Undecided Status: New ** Tags: amd64 apport-bug plucky -- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/2106553 Title: Unable to boot starting with Kernel 6.14.0 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2106553/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs