Public bug reported: The following very simple bash script results in an unkillable process topping the CPU. ``` #!/bin/bash
while true; do echo -n "." if echo "hw-tc-offload: off [fixed]" 2>&1 | grep 'x-checksumming' ; then echo "should never get here...." fi done ``` I can reproduce the issue by starting a few (8) concurrent executions and wait a few minutes. The system used is: ``` > cat /etc/os-release PRETTY_NAME="Ubuntu 21.10" NAME="Ubuntu" VERSION_ID="21.10" VERSION="21.10 (Impish Indri)" VERSION_CODENAME=impish ID=ubuntu ID_LIKE=debian HOME_URL="https://www.ubuntu.com/" SUPPORT_URL="https://help.ubuntu.com/" BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" UBUNTU_CODENAME=impish ``` With kernel: ``` > uname -a Linux ubuntu 5.13.0-1011-raspi #13-Ubuntu SMP PREEMPT Fri Nov 19 18:40:23 UTC 2021 aarch64 aarch64 aarch64 GNU/Linux ``` The issue reported over at [1] and it seems to not be present on debian [2]. At the time the issue is triggered top shows the script using one of the CPUs: ``` top - 16:39:24 up 21 min, 10 users, load average: 1.86, 3.00, 3.52 Tasks: 194 total, 3 running, 191 sleeping, 0 stopped, 0 zombie %Cpu(s): 11.1 us, 32.1 sy, 0.0 ni, 56.8 id, 0.0 wa, 0.0 hi, 0.0 si, 0.0 st MiB Mem : 7808.5 total, 5194.8 free, 873.9 used, 1739.7 buff/cache MiB Swap: 0.0 total, 0.0 free, 0.0 used. 6682.4 avail Mem PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 78617 ubuntu 20 0 8836 408 0 R 99.7 0.0 9:09.02 repro.sh ``` The stack of the CPU as reported by sysrq with the "l" option: ``` [ 876.197594] sysrq: Show backtrace of all active CPUs [ 876.202717] sysrq: CPU1: [ 876.202724] Call trace: [ 876.202727] dump_backtrace+0x0/0x1ec [ 876.202738] show_stack+0x24/0x30 [ 876.202743] showacpu+0x84/0x94 [ 876.202751] flush_smp_call_function_queue+0xd8/0x260 [ 876.202757] generic_smp_call_function_single_interrupt+0x20/0x2c [ 876.202762] do_handle_IPI+0x1d8/0x310 [ 876.202769] ipi_handler+0x28/0x40 [ 876.202774] handle_percpu_devid_irq+0x94/0x270 [ 876.202780] __handle_domain_irq+0x88/0xec [ 876.202787] gic_handle_irq+0x5c/0xf0 [ 876.202791] el1_irq+0xc0/0x15c [ 876.202795] schedule_tail+0x12c/0x1fc [ 876.202802] ret_from_fork+0x4/0x3c ``` The stack of the stuck: ``` > cat /proc/78617/stack [<0>] __switch_to+0x138/0x190 [<0>] exit_to_kernel_mode+0x28/0x80 ``` This is causing MicroK8s to fail after a few days. Thank you for your help. [1] https://github.com/ubuntu/microk8s/issues/2280#issuecomment-866912974 [2] https://github.com/ubuntu/microk8s/issues/2280#issuecomment-931306599 ** Affects: linux-raspi (Ubuntu) Importance: Undecided Status: New -- You received this bug notification because you are a member of Kernel Packages, which is subscribed to linux-raspi in Ubuntu. https://bugs.launchpad.net/bugs/1955174 Title: Unkillable process pegging CPU Status in linux-raspi package in Ubuntu: New Bug description: The following very simple bash script results in an unkillable process topping the CPU. ``` #!/bin/bash while true; do echo -n "." if echo "hw-tc-offload: off [fixed]" 2>&1 | grep 'x-checksumming' ; then echo "should never get here...." fi done ``` I can reproduce the issue by starting a few (8) concurrent executions and wait a few minutes. The system used is: ``` > cat /etc/os-release PRETTY_NAME="Ubuntu 21.10" NAME="Ubuntu" VERSION_ID="21.10" VERSION="21.10 (Impish Indri)" VERSION_CODENAME=impish ID=ubuntu ID_LIKE=debian HOME_URL="https://www.ubuntu.com/" SUPPORT_URL="https://help.ubuntu.com/" BUG_REPORT_URL="https://bugs.launchpad.net/ubuntu/" PRIVACY_POLICY_URL="https://www.ubuntu.com/legal/terms-and-policies/privacy-policy" UBUNTU_CODENAME=impish ``` With kernel: ``` > uname -a Linux ubuntu 5.13.0-1011-raspi #13-Ubuntu SMP PREEMPT Fri Nov 19 18:40:23 UTC 2021 aarch64 aarch64 aarch64 GNU/Linux ``` The issue reported over at [1] and it seems to not be present on debian [2]. At the time the issue is triggered top shows the script using one of the CPUs: ``` top - 16:39:24 up 21 min, 10 users, load average: 1.86, 3.00, 3.52 Tasks: 194 total, 3 running, 191 sleeping, 0 stopped, 0 zombie %Cpu(s): 11.1 us, 32.1 sy, 0.0 ni, 56.8 id, 0.0 wa, 0.0 hi, 0.0 si, 0.0 st MiB Mem : 7808.5 total, 5194.8 free, 873.9 used, 1739.7 buff/cache MiB Swap: 0.0 total, 0.0 free, 0.0 used. 6682.4 avail Mem PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND 78617 ubuntu 20 0 8836 408 0 R 99.7 0.0 9:09.02 repro.sh ``` The stack of the CPU as reported by sysrq with the "l" option: ``` [ 876.197594] sysrq: Show backtrace of all active CPUs [ 876.202717] sysrq: CPU1: [ 876.202724] Call trace: [ 876.202727] dump_backtrace+0x0/0x1ec [ 876.202738] show_stack+0x24/0x30 [ 876.202743] showacpu+0x84/0x94 [ 876.202751] flush_smp_call_function_queue+0xd8/0x260 [ 876.202757] generic_smp_call_function_single_interrupt+0x20/0x2c [ 876.202762] do_handle_IPI+0x1d8/0x310 [ 876.202769] ipi_handler+0x28/0x40 [ 876.202774] handle_percpu_devid_irq+0x94/0x270 [ 876.202780] __handle_domain_irq+0x88/0xec [ 876.202787] gic_handle_irq+0x5c/0xf0 [ 876.202791] el1_irq+0xc0/0x15c [ 876.202795] schedule_tail+0x12c/0x1fc [ 876.202802] ret_from_fork+0x4/0x3c ``` The stack of the stuck: ``` > cat /proc/78617/stack [<0>] __switch_to+0x138/0x190 [<0>] exit_to_kernel_mode+0x28/0x80 ``` This is causing MicroK8s to fail after a few days. Thank you for your help. [1] https://github.com/ubuntu/microk8s/issues/2280#issuecomment-866912974 [2] https://github.com/ubuntu/microk8s/issues/2280#issuecomment-931306599 To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu/+source/linux-raspi/+bug/1955174/+subscriptions -- Mailing list: https://launchpad.net/~kernel-packages Post to : kernel-packages@lists.launchpad.net Unsubscribe : https://launchpad.net/~kernel-packages More help : https://help.launchpad.net/ListHelp