Public bug reported: Observed on akis, blanka, cortez, and hidon. This occurs while NVIDIA fabric-manager is installed and active, as it binds to TCP port 16000.
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] invoked with './stress-ng -v -t 5 --sigurg 4 --sigurg-ops 3000 --ignite-cpu --syslog --verbose --verify --oomable' by user 0 'root' 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] stress-ng 0.18.06 g9ea345f5dfda 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] system: Linux akis 6.8.0-1022-nvidia #25-Ubuntu SMP PREEMPT_DYNAMIC Tue Jan 28 05:14:01 UTC 2025 x86_64, gcc 13.3.0, glibc 2.39, little endian 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] RAM total: 1.5T, RAM free: 1.5T, swap free: 9.0G 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] temporary file path: '/home/ubuntu/autotest/client/tmp/ubuntu_stress_smoke_test/src/stress-ng', filesystem type: ext2 (214458870 blocks available) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPUs have 5 idle states: C0, C1, C1E, C6, POLL 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 96 processors online, 96 processors configured 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] setting to a 5 secs run per stressor 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPU data cache: L1: 32K, L2: 1024K, L3: 33792K 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] cache allocate: shared cache buffer size: 67584K (LLC size x 2 NUMA nodes) 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] dispatching hogs: 4 sigurg 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] starting stressors 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 4 stressors started 18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] started (instance 0 on CPU 57) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] started (instance 1 on CPU 11) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] started (instance 2 on CPU 84) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] started (instance 3 on CPU 60) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: process [222796] using socket port 16000 18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: process [222797] using socket port 16001 18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: process [222798] using socket port 16002 18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: process [222799] using socket port 16003 18:35:14 DEBUG| [stdout] stress-ng: fail: [222796] sigurg: bind failed on port 16000, errno=98 (Address already in use) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] exited (instance 0 on CPU 57) 18:35:14 DEBUG| [stdout] stress-ng: error: [222795] sigurg: [222796] terminated with an error, exit status=2 (stressor failed) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222796] terminated (stressor failed) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] exited (instance 1 on CPU 11) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222797] terminated (success) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] exited (instance 2 on CPU 84) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222798] terminated (success) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] exited (instance 3 on CPU 60) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222799] terminated (success) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] metrics-check: all stressor metrics validated and sane 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] skipped: 0 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] passed: 3: sigurg (3) 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] failed: 1: sigurg (1) 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] metrics untrustworthy: 0 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] unsuccessful run completed in 0 secs 18:35:14 DEBUG| [stdout] 18:35:14 DEBUG| [stdout] 18:35:14 DEBUG| [stdout] 18:35:14 DEBUG| [stdout] Summary: 18:35:14 DEBUG| [stdout] Stressors run: 1 18:35:14 DEBUG| [stdout] Skipped: 0, 18:35:14 DEBUG| [stdout] Failed: 1, sigurg 18:35:14 DEBUG| [stdout] Oopsed: 0, 18:35:14 DEBUG| [stdout] Oomed: 0, 18:35:14 DEBUG| [stdout] Passed: 0, 18:35:14 DEBUG| [stdout] Badret: 0, 18:35:14 DEBUG| [stdout] 18:35:14 DEBUG| [stdout] Tests took 0 seconds to run ** Affects: ubuntu-kernel-tests Importance: Undecided Assignee: Jacob Martin (jacobmartin) Status: New ** Tags: amd64 sru-20250113 ubuntu-stress-smoke-test ** Summary changed: - ubuntu_stress_smoke_test sigurg failed to bind port 16000 with NVIDIA fabric manager installed + ubuntu_stress_smoke_test sigurg failed to bind port 16000 with NVIDIA fabric manager active -- You received this bug notification because you are a member of Canonical Platform QA Team, which is subscribed to ubuntu-kernel-tests. https://bugs.launchpad.net/bugs/2097652 Title: ubuntu_stress_smoke_test sigurg failed to bind port 16000 with NVIDIA fabric manager active Status in ubuntu-kernel-tests: New Bug description: Observed on akis, blanka, cortez, and hidon. This occurs while NVIDIA fabric-manager is installed and active, as it binds to TCP port 16000. 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] invoked with './stress-ng -v -t 5 --sigurg 4 --sigurg-ops 3000 --ignite-cpu --syslog --verbose --verify --oomable' by user 0 'root' 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] stress-ng 0.18.06 g9ea345f5dfda 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] system: Linux akis 6.8.0-1022-nvidia #25-Ubuntu SMP PREEMPT_DYNAMIC Tue Jan 28 05:14:01 UTC 2025 x86_64, gcc 13.3.0, glibc 2.39, little endian 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] RAM total: 1.5T, RAM free: 1.5T, swap free: 9.0G 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] temporary file path: '/home/ubuntu/autotest/client/tmp/ubuntu_stress_smoke_test/src/stress-ng', filesystem type: ext2 (214458870 blocks available) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPUs have 5 idle states: C0, C1, C1E, C6, POLL 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 96 processors online, 96 processors configured 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] setting to a 5 secs run per stressor 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPU data cache: L1: 32K, L2: 1024K, L3: 33792K 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] cache allocate: shared cache buffer size: 67584K (LLC size x 2 NUMA nodes) 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] dispatching hogs: 4 sigurg 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] starting stressors 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 4 stressors started 18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] started (instance 0 on CPU 57) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] started (instance 1 on CPU 11) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] started (instance 2 on CPU 84) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] started (instance 3 on CPU 60) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: process [222796] using socket port 16000 18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: process [222797] using socket port 16001 18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: process [222798] using socket port 16002 18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: process [222799] using socket port 16003 18:35:14 DEBUG| [stdout] stress-ng: fail: [222796] sigurg: bind failed on port 16000, errno=98 (Address already in use) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] exited (instance 0 on CPU 57) 18:35:14 DEBUG| [stdout] stress-ng: error: [222795] sigurg: [222796] terminated with an error, exit status=2 (stressor failed) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222796] terminated (stressor failed) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] exited (instance 1 on CPU 11) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222797] terminated (success) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] exited (instance 2 on CPU 84) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222798] terminated (success) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] exited (instance 3 on CPU 60) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222799] terminated (success) 18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] metrics-check: all stressor metrics validated and sane 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] skipped: 0 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] passed: 3: sigurg (3) 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] failed: 1: sigurg (1) 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] metrics untrustworthy: 0 18:35:14 DEBUG| [stdout] stress-ng: info: [222795] unsuccessful run completed in 0 secs 18:35:14 DEBUG| [stdout] 18:35:14 DEBUG| [stdout] 18:35:14 DEBUG| [stdout] 18:35:14 DEBUG| [stdout] Summary: 18:35:14 DEBUG| [stdout] Stressors run: 1 18:35:14 DEBUG| [stdout] Skipped: 0, 18:35:14 DEBUG| [stdout] Failed: 1, sigurg 18:35:14 DEBUG| [stdout] Oopsed: 0, 18:35:14 DEBUG| [stdout] Oomed: 0, 18:35:14 DEBUG| [stdout] Passed: 0, 18:35:14 DEBUG| [stdout] Badret: 0, 18:35:14 DEBUG| [stdout] 18:35:14 DEBUG| [stdout] Tests took 0 seconds to run To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/2097652/+subscriptions -- Mailing list: https://launchpad.net/~canonical-ubuntu-qa Post to : canonical-ubuntu-qa@lists.launchpad.net Unsubscribe : https://launchpad.net/~canonical-ubuntu-qa More help : https://help.launchpad.net/ListHelp