Re: [PATCH v1] powerpc/eeh: Fix missing PE bridge reconfiguration during VFIO EEH recovery

2025-05-14 Thread Ganesh G R
On 5/8/25 11:59 AM, Narayana Murty N wrote: VFIO EEH recovery for PCI passthrough devices fails on PowerNV and pseries platforms due to missing host-side PE bridge reconfiguration. In the current implementation, eeh_pe_configure() only performs RTAS or OPAL-based bridge reconfiguration for native

Re: [linux-next][next-20250207]Observing Kernel Softlock up's while running kselftest

2025-02-23 Thread Ganesh G R
On 2/10/25 10:04 AM, Venkat Rao Bagalkote wrote: Greetings!!! I am observing kernel soft lock up's while running kselftest on IBM Power Servers. Though, I colud not reporduce this consistently, but CI has detected this error twice now. Hence reporting. This error was reported firat time, w

Re: [PATCH] powerpc/eeh: avoid possible crash when edev->pdev changes

2024-06-13 Thread Ganesh G R
On 6/11/24 8:18 AM, Michael Ellerman wrote: Hi Ganesh, Ganesh Goudar writes: If a PCI device is removed during eeh_pe_report_edev(), edev->pdev will change and can cause a crash, hold the PCI rescan/remove lock while taking a copy of edev->pdev. Signed-off-by: Ganesh Goudar --- arch/power

Re: [PATCH] powerpc/eeh: Permanently disable the removed device

2024-04-14 Thread Ganesh G R
On 4/9/24 14:37, Michael Ellerman wrote: Hi Ganesh, Ganesh Goudar writes: When a device is hot removed on powernv, the hotplug driver clears the device's state. However, on pseries, if a device is removed by phyp after reaching the error threshold, the kernel remains unaware, leading to the d

Re: [RFC 0/3] Asynchronous EEH recovery

2023-07-17 Thread Ganesh G R
On 6/13/23 8:06 AM, Oliver O'Halloran wrote: On Tue, Jun 13, 2023 at 11:44 AM Ganesh Goudar wrote: Hi, EEH recovery is currently serialized and these patches shorten the time taken for EEH recovery by making the recovery to run in parallel. The original author of these patches is Sam Bobroff

Re: [PATCH v2] powerpc/mce: log the error for all unrecoverable errors

2023-02-01 Thread Ganesh G R
On 1/31/23 4:59 PM, Michael Ellerman wrote: Ganesh Goudar writes: For all unrecoverable errors we are missing to log the error, Since machine_check_log_err() is not getting called for unrecoverable errors. Raise irq work in save_mce_event() for unrecoverable errors, So that we log the error