Public bug reported: This was observed first checking the ADT results for the 2023.10.02 Lunar kernel (6.2.0-36.37) on amd64. But it seems also present for at least the Jammy (5.15) kernel. Also seen doing internal regression testing on OpenStack. However NOT reproducible on a local KVM VM. Also bare-metal appears ok.
There were already issues before on Arm and public clouds (AWS, Azure) getting this test disabled there. Running as ADT (which is OpenStack like the internal RT tests) we saw multiple failed attempts before but the attempt which now results in the timeout did succeed before. So memory 38 could be offlined before and now somehow gets stuck. But not bad enough that the script cannot be terminated and the rest of the tests completing more or less normally. 3143s 13:33:40 DEBUG| [stdout] TAP version 13 3143s 13:33:40 DEBUG| [stdout] 1..1 3143s 13:33:40 DEBUG| [stdout] # selftests: memory-hotplug: mem-on-off-test.sh 3143s 13:33:40 DEBUG| [stdout] # Test scope: 2% hotplug memory 3143s 13:33:40 DEBUG| [stdout] # online all hot-pluggable memory in offline state: 3143s 13:33:40 DEBUG| [stdout] # SKIPPED - no hot-pluggable memory in offline state 3144s 13:33:41 DEBUG| [stdout] # offline 2% hot-pluggable memory in online state 3144s 13:33:41 DEBUG| [stdout] # trying to offline 2 out of 64 memory block(s): 3144s 13:33:41 DEBUG| [stdout] # online->offline memory0 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory1 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory10 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory11 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory12 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory13 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory14 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory15 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory16 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory17 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory18 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory19 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory2 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory20 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory21 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory22 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory23 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory3 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory32 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory33 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory34 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory35 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory36 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory37 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory38 3743s 13:43:40 DEBUG| [stdout] # 3743s 13:43:40 DEBUG| [stdout] not ok 1 selftests: memory-hotplug: mem-on-off-test.sh # TIMEOUT 600 seconds ** Affects: linux (Ubuntu) Importance: Medium Status: Triaged ** Tags: kernel-adt-failure -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux in Ubuntu. https://bugs.launchpad.net/bugs/2039809 Title: selftests:memory-hotplug:mem-on-off-test.sh: Timeout when offlining memory as a VM guest Status in linux package in Ubuntu: Triaged Bug description: This was observed first checking the ADT results for the 2023.10.02 Lunar kernel (6.2.0-36.37) on amd64. But it seems also present for at least the Jammy (5.15) kernel. Also seen doing internal regression testing on OpenStack. However NOT reproducible on a local KVM VM. Also bare-metal appears ok. There were already issues before on Arm and public clouds (AWS, Azure) getting this test disabled there. Running as ADT (which is OpenStack like the internal RT tests) we saw multiple failed attempts before but the attempt which now results in the timeout did succeed before. So memory 38 could be offlined before and now somehow gets stuck. But not bad enough that the script cannot be terminated and the rest of the tests completing more or less normally. 3143s 13:33:40 DEBUG| [stdout] TAP version 13 3143s 13:33:40 DEBUG| [stdout] 1..1 3143s 13:33:40 DEBUG| [stdout] # selftests: memory-hotplug: mem-on-off-test.sh 3143s 13:33:40 DEBUG| [stdout] # Test scope: 2% hotplug memory 3143s 13:33:40 DEBUG| [stdout] # online all hot-pluggable memory in offline state: 3143s 13:33:40 DEBUG| [stdout] # SKIPPED - no hot-pluggable memory in offline state 3144s 13:33:41 DEBUG| [stdout] # offline 2% hot-pluggable memory in online state 3144s 13:33:41 DEBUG| [stdout] # trying to offline 2 out of 64 memory block(s): 3144s 13:33:41 DEBUG| [stdout] # online->offline memory0 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory1 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory10 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory11 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory12 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory13 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory14 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory15 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory16 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory17 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory18 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory19 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory2 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory20 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory21 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory22 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory23 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory3 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory32 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory33 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory34 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory35 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory36 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory37 3144s 13:33:41 DEBUG| [stdout] # -> Failure 3144s 13:33:41 DEBUG| [stdout] # online->offline memory38 3743s 13:43:40 DEBUG| [stdout] # 3743s 13:43:40 DEBUG| [stdout] not ok 1 selftests: memory-hotplug: mem-on-off-test.sh # TIMEOUT 600 seconds To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2039809/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp