Currently, there is only one CPER buffer (entry) can be delivered and acknoledged at once. This conflicts to the scenario where the host and guest has 64KB and 4KB page size. In this specific scenario, a problematic host page can affect 16x guest pages, resulting in 16x memory errors in the worst case. Unfortunately, qemu runs to core dump at (a) because the previous error isn't acknoledged and current error is no way to be delivered, shown in the following call trace
kvm_vcpu_thread_fn kvm_cpu_exec kvm_arch_on_sigbus_vcpu kvm_cpu_synchronize_state acpi_ghes_memory_errors (a) kvm_inject_arm_sea | abort Fix the issue by sending 16x consecutive memory CPER entries for this specific case in one shot. With the series applied on top of Mauro's (v9) series [1], no qemu core dump is observed in the test where guest memory access is triggered by 'victimd' and the recoverable memory error is injected from the host. [1] [PATCH v9 00/20] Change ghes to use HEST-based offsets and add support for error inject (https://lists.nongnu.org/archive/html/qemu-arm/2025-05/msg01104.html) Changelog ========= v2: * v1: https://lists.nongnu.org/archive/html/qemu-arm/2025-02/msg00897.html * Send 16x memory errors for the specific case (Jonathan) Gavin Shan (3): acpi/ghes: Extend acpi_ghes_memory_errors() to support multiple CPERs kvm/arm/kvm: Introduce helper push_ghes_memory_errors() target/arm/kvm: Support multiple memory CPERs injection hw/acpi/ghes-stub.c | 2 +- hw/acpi/ghes.c | 29 ++++++++-------- include/hw/acpi/ghes.h | 2 +- target/arm/kvm.c | 77 +++++++++++++++++++++++++++++++++++++----- 4 files changed, 86 insertions(+), 24 deletions(-) -- 2.49.0