** Description changed: - Issue found on node entei with Focal kernel. + [Impact] + When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver: - When trying to run this test, it will try to break 4 devices on Focal, - and one of them is using the AHCI driver: - - $ sudo ./eeh-basic.sh + $ sudo ./eeh-basic.sh 0000:00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error - 0021:0e:00.0, waited 3/60 - ./eeh-basic.sh: 74: sleep: Input/output error .... ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found - And drop into read-only state, dmesg can be found in the attachment. + And drop into a read-only state + + [Fixes] + * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") + + This is only affecting Focal and it can be cherry-picked. + + [Test case] + Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. + + [Where problems could occur] + This fix is limited to PowerPC testing tool, it should not cause any issue.
** Description changed: - [Impact] - When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver: + [Impact] + When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh 0000:00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error .... ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into a read-only state [Fixes] * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") This is only affecting Focal and it can be cherry-picked. [Test case] Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. [Where problems could occur] This fix is limited to PowerPC testing tool, it should not cause any issue. -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/1916468 Title: powerpc/eeh-basic.sh in kselftest make P8 node stopped working Status in ubuntu-kernel-tests: New Status in linux package in Ubuntu: Fix Released Status in linux source package in Focal: Incomplete Bug description: [Impact] When trying to run this test on P8 node entei with Focal kernel, it will try to break 4 devices on Focal, and one of them is using the AHCI driver which doesn't support error recovery: $ sudo ./eeh-basic.sh 0000:00:00.0, Skipped: bridge 0001:00:00.0, Skipped: bridge 0020:00:00.0, Skipped: bridge 0021:00:00.0, Skipped: bridge 0021:01:00.0, Skipped: bridge 0021:02:01.0, Skipped: bridge 0021:02:08.0, Skipped: bridge 0021:02:09.0, Skipped: bridge 0021:02:0a.0, Skipped: bridge 0021:02:0b.0, Skipped: bridge 0021:02:0c.0, Skipped: bridge 0021:0d:00.0, Added 0021:0e:00.0, Added 0021:0f:00.0, Skipped: bridge 0021:10:00.0, Added 0022:00:00.0, Skipped: bridge 0022:01:00.0, Added Found 4 breakable devices... Breaking 0021:0d:00.0... 0021:0d:00.0, waited 0/60 0021:0d:00.0, waited 1/60 0021:0d:00.0, waited 2/60 0021:0d:00.0, waited 3/60 0021:0d:00.0, waited 4/60 0021:0d:00.0, waited 5/60 0021:0d:00.0, waited 6/60 0021:0d:00.0, waited 7/60 0021:0d:00.0, waited 8/60 0021:0d:00.0, Recovered after 9 seconds Breaking 0021:0e:00.0... 0021:0e:00.0, waited 0/60 0021:0e:00.0, waited 1/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 2/60 ./eeh-basic.sh: 74: sleep: Input/output error .... ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 59/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, waited 60/60 ./eeh-basic.sh: 74: sleep: Input/output error 0021:0e:00.0, Failed to recover! Breaking 0021:10:00.0... Skipping 0021:10:00.0, Initial PE state is not ok Breaking 0022:01:00.0... Skipping 0022:01:00.0, Initial PE state is not ok 3 devices failed to recover (4 tested) ./eeh-basic.sh: 81: lspci: Input/output error ./eeh-basic.sh: 81: diff: Input/output error ./eeh-basic.sh: 82: rm: Input/output error ./eeh-basic.sh: 84: test: 3: unexpected operator With the driver failed to recovery, the system will start acting up. $ ls ls: command not found And drop into a read-only state [Fixes] * bbe9064f30f06e ("selftests/eeh: Skip ahci adapters") This is only affecting Focal and it can be cherry-picked. [Test case] Run the eeh-basic.sh script in tools/testing/selftests/powerpc/eeh/ on the affected P8 node, the test should pass without any issue. [Where problems could occur] This fix is limited to PowerPC testing tool, it should not cause any issue. To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/1916468/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp