Public bug reported:

This was observed first checking the ADT results for the 2023.10.02
Lunar kernel (6.2.0-36.37) on amd64. But it seems also present for at
least the Jammy (5.15) kernel. Also seen doing internal regression
testing on OpenStack. However NOT reproducible on a local KVM VM. Also
bare-metal appears ok.

There were already issues before on Arm and public clouds (AWS, Azure)
getting this test disabled there. Running as ADT (which is OpenStack
like the internal RT tests) we saw multiple failed attempts before but
the attempt which now results in the timeout did succeed before. So
memory 38 could be offlined before and now somehow gets stuck. But not
bad enough that the script cannot be terminated and the rest of the
tests completing more or less normally.

3143s 13:33:40 DEBUG| [stdout] TAP version 13
3143s 13:33:40 DEBUG| [stdout] 1..1
3143s 13:33:40 DEBUG| [stdout] # selftests: memory-hotplug: mem-on-off-test.sh
3143s 13:33:40 DEBUG| [stdout] # Test scope: 2% hotplug memory
3143s 13:33:40 DEBUG| [stdout] #         online all hot-pluggable memory in 
offline state:
3143s 13:33:40 DEBUG| [stdout] #                 SKIPPED - no hot-pluggable 
memory in offline state
3144s 13:33:41 DEBUG| [stdout] #         offline 2% hot-pluggable memory in 
online state
3144s 13:33:41 DEBUG| [stdout] #         trying to offline 2 out of 64 memory 
block(s):
3144s 13:33:41 DEBUG| [stdout] # online->offline memory0
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory1
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory10
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory11
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory12
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory13
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory14
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory15
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory16
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory17
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory18
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory19
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory2
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory20
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory21
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory22
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory23
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory3
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory32
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory33
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory34
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory35
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory36
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory37
3144s 13:33:41 DEBUG| [stdout] # -> Failure
3144s 13:33:41 DEBUG| [stdout] # online->offline memory38
3743s 13:43:40 DEBUG| [stdout] #
3743s 13:43:40 DEBUG| [stdout] not ok 1 selftests: memory-hotplug: 
mem-on-off-test.sh # TIMEOUT 600 seconds

** Affects: linux (Ubuntu)
     Importance: Medium
         Status: Triaged


** Tags: kernel-adt-failure

-- 
You received this bug notification because you are a member of Kernel
Packages, which is subscribed to linux in Ubuntu.
https://bugs.launchpad.net/bugs/2039809

Title:
  selftests:memory-hotplug:mem-on-off-test.sh: Timeout when offlining
  memory as a VM guest

Status in linux package in Ubuntu:
  Triaged

Bug description:
  This was observed first checking the ADT results for the 2023.10.02
  Lunar kernel (6.2.0-36.37) on amd64. But it seems also present for at
  least the Jammy (5.15) kernel. Also seen doing internal regression
  testing on OpenStack. However NOT reproducible on a local KVM VM. Also
  bare-metal appears ok.

  There were already issues before on Arm and public clouds (AWS, Azure)
  getting this test disabled there. Running as ADT (which is OpenStack
  like the internal RT tests) we saw multiple failed attempts before but
  the attempt which now results in the timeout did succeed before. So
  memory 38 could be offlined before and now somehow gets stuck. But not
  bad enough that the script cannot be terminated and the rest of the
  tests completing more or less normally.

  3143s 13:33:40 DEBUG| [stdout] TAP version 13
  3143s 13:33:40 DEBUG| [stdout] 1..1
  3143s 13:33:40 DEBUG| [stdout] # selftests: memory-hotplug: mem-on-off-test.sh
  3143s 13:33:40 DEBUG| [stdout] # Test scope: 2% hotplug memory
  3143s 13:33:40 DEBUG| [stdout] #       online all hot-pluggable memory in 
offline state:
  3143s 13:33:40 DEBUG| [stdout] #               SKIPPED - no hot-pluggable 
memory in offline state
  3144s 13:33:41 DEBUG| [stdout] #       offline 2% hot-pluggable memory in 
online state
  3144s 13:33:41 DEBUG| [stdout] #       trying to offline 2 out of 64 memory 
block(s):
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory0
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory1
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory10
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory11
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory12
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory13
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory14
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory15
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory16
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory17
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory18
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory19
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory2
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory20
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory21
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory22
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory23
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory3
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory32
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory33
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory34
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory35
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory36
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory37
  3144s 13:33:41 DEBUG| [stdout] # -> Failure
  3144s 13:33:41 DEBUG| [stdout] # online->offline memory38
  3743s 13:43:40 DEBUG| [stdout] #
  3743s 13:43:40 DEBUG| [stdout] not ok 1 selftests: memory-hotplug: 
mem-on-off-test.sh # TIMEOUT 600 seconds

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu/+source/linux/+bug/2039809/+subscriptions


-- 
Mailing list: https://launchpad.net/~kernel-packages
Post to     : kernel-packages@lists.launchpad.net
Unsubscribe : https://launchpad.net/~kernel-packages
More help   : https://help.launchpad.net/ListHelp

Reply via email to