On 5/8/25 11:59 AM, Narayana Murty N wrote:
VFIO EEH recovery for PCI passthrough devices fails on PowerNV and pseries
platforms due to missing host-side PE bridge reconfiguration. In the
current implementation, eeh_pe_configure() only performs RTAS or OPAL-based
bridge reconfiguration for native
On 2/10/25 10:04 AM, Venkat Rao Bagalkote wrote:
Greetings!!!
I am observing kernel soft lock up's while running kselftest on IBM
Power Servers.
Though, I colud not reporduce this consistently, but CI has detected
this error twice now. Hence reporting.
This error was reported firat time, w
On 6/11/24 8:18 AM, Michael Ellerman wrote:
Hi Ganesh,
Ganesh Goudar writes:
If a PCI device is removed during eeh_pe_report_edev(), edev->pdev
will change and can cause a crash, hold the PCI rescan/remove lock
while taking a copy of edev->pdev.
Signed-off-by: Ganesh Goudar
---
arch/power
On 4/9/24 14:37, Michael Ellerman wrote:
Hi Ganesh,
Ganesh Goudar writes:
When a device is hot removed on powernv, the hotplug
driver clears the device's state. However, on pseries,
if a device is removed by phyp after reaching the error
threshold, the kernel remains unaware, leading to the
d
On 6/13/23 8:06 AM, Oliver O'Halloran wrote:
On Tue, Jun 13, 2023 at 11:44 AM Ganesh Goudar wrote:
Hi,
EEH recovery is currently serialized and these patches shorten
the time taken for EEH recovery by making the recovery to run
in parallel. The original author of these patches is Sam Bobroff
On 1/31/23 4:59 PM, Michael Ellerman wrote:
Ganesh Goudar writes:
For all unrecoverable errors we are missing to log the
error, Since machine_check_log_err() is not getting called
for unrecoverable errors.
Raise irq work in save_mce_event() for unrecoverable errors,
So that we log the error