Public bug reported:

Observed on akis, blanka, cortez, and hidon. This occurs while NVIDIA
fabric-manager is installed and active, as it binds to TCP port 16000.

18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] invoked with './stress-ng 
-v -t 5 --sigurg 4 --sigurg-ops 3000 --ignite-cpu --syslog --verbose --verify 
--oomable' by user 0 'root'
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] stress-ng 0.18.06 
g9ea345f5dfda
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] system: Linux akis 
6.8.0-1022-nvidia #25-Ubuntu SMP PREEMPT_DYNAMIC Tue Jan 28 05:14:01 UTC 2025 
x86_64, gcc 13.3.0, glibc 2.39, little endian
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] RAM total: 1.5T, RAM free: 
1.5T, swap free: 9.0G
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] temporary file path: 
'/home/ubuntu/autotest/client/tmp/ubuntu_stress_smoke_test/src/stress-ng', 
filesystem type: ext2 (214458870 blocks available)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPUs have 5 idle states: 
C0, C1, C1E, C6, POLL
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 96 processors online, 96 
processors configured
18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] setting to a 5 secs run per 
stressor
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPU data cache: L1: 32K, 
L2: 1024K, L3: 33792K
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] cache allocate: shared 
cache buffer size: 67584K (LLC size x 2 NUMA nodes)
18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] dispatching hogs: 4 sigurg
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] starting stressors
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 4 stressors started
18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] started 
(instance 0 on CPU 57)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] started 
(instance 1 on CPU 11)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] started 
(instance 2 on CPU 84)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] started 
(instance 3 on CPU 60)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: process [222796] 
using socket port 16000
18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: process [222797] 
using socket port 16001
18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: process [222798] 
using socket port 16002
18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: process [222799] 
using socket port 16003
18:35:14 DEBUG| [stdout] stress-ng: fail:  [222796] sigurg: bind failed on port 
16000, errno=98 (Address already in use)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] exited 
(instance 0 on CPU 57)
18:35:14 DEBUG| [stdout] stress-ng: error: [222795] sigurg: [222796] terminated 
with an error, exit status=2 (stressor failed)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222796] terminated 
(stressor failed)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] exited 
(instance 1 on CPU 11)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222797] terminated 
(success)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] exited 
(instance 2 on CPU 84)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222798] terminated 
(success)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] exited 
(instance 3 on CPU 60)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222799] terminated 
(success)
18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] metrics-check: all stressor 
metrics validated and sane
18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] skipped: 0
18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] passed: 3: sigurg (3)
18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] failed: 1: sigurg (1)
18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] metrics untrustworthy: 0
18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] unsuccessful run completed 
in 0 secs
18:35:14 DEBUG| [stdout]  
18:35:14 DEBUG| [stdout]  
18:35:14 DEBUG| [stdout]  
18:35:14 DEBUG| [stdout] Summary:
18:35:14 DEBUG| [stdout]   Stressors run: 1
18:35:14 DEBUG| [stdout]   Skipped: 0, 
18:35:14 DEBUG| [stdout]   Failed:  1,  sigurg
18:35:14 DEBUG| [stdout]   Oopsed:  0, 
18:35:14 DEBUG| [stdout]   Oomed:   0, 
18:35:14 DEBUG| [stdout]   Passed:  0, 
18:35:14 DEBUG| [stdout]   Badret:  0, 
18:35:14 DEBUG| [stdout]  
18:35:14 DEBUG| [stdout] Tests took 0 seconds to run

** Affects: ubuntu-kernel-tests
     Importance: Undecided
     Assignee: Jacob Martin (jacobmartin)
         Status: New


** Tags: amd64 sru-20250113 ubuntu-stress-smoke-test

** Summary changed:

- ubuntu_stress_smoke_test sigurg failed to bind port 16000 with NVIDIA fabric 
manager installed
+ ubuntu_stress_smoke_test sigurg failed to bind port 16000 with NVIDIA fabric 
manager active

-- 
You received this bug notification because you are a member of Canonical
Platform QA Team, which is subscribed to ubuntu-kernel-tests.
https://bugs.launchpad.net/bugs/2097652

Title:
  ubuntu_stress_smoke_test sigurg failed to bind port 16000 with NVIDIA
  fabric manager active

Status in ubuntu-kernel-tests:
  New

Bug description:
  Observed on akis, blanka, cortez, and hidon. This occurs while NVIDIA
  fabric-manager is installed and active, as it binds to TCP port 16000.

  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] invoked with './stress-ng 
-v -t 5 --sigurg 4 --sigurg-ops 3000 --ignite-cpu --syslog --verbose --verify 
--oomable' by user 0 'root'
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] stress-ng 0.18.06 
g9ea345f5dfda
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] system: Linux akis 
6.8.0-1022-nvidia #25-Ubuntu SMP PREEMPT_DYNAMIC Tue Jan 28 05:14:01 UTC 2025 
x86_64, gcc 13.3.0, glibc 2.39, little endian
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] RAM total: 1.5T, RAM 
free: 1.5T, swap free: 9.0G
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] temporary file path: 
'/home/ubuntu/autotest/client/tmp/ubuntu_stress_smoke_test/src/stress-ng', 
filesystem type: ext2 (214458870 blocks available)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPUs have 5 idle states: 
C0, C1, C1E, C6, POLL
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 96 processors online, 96 
processors configured
  18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] setting to a 5 secs run 
per stressor
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] CPU data cache: L1: 32K, 
L2: 1024K, L3: 33792K
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] cache allocate: shared 
cache buffer size: 67584K (LLC size x 2 NUMA nodes)
  18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] dispatching hogs: 4 sigurg
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] starting stressors
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] 4 stressors started
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] started 
(instance 0 on CPU 57)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] started 
(instance 1 on CPU 11)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] started 
(instance 2 on CPU 84)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] started 
(instance 3 on CPU 60)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: process [222796] 
using socket port 16000
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: process [222797] 
using socket port 16001
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: process [222798] 
using socket port 16002
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: process [222799] 
using socket port 16003
  18:35:14 DEBUG| [stdout] stress-ng: fail:  [222796] sigurg: bind failed on 
port 16000, errno=98 (Address already in use)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222796] sigurg: [222796] exited 
(instance 0 on CPU 57)
  18:35:14 DEBUG| [stdout] stress-ng: error: [222795] sigurg: [222796] 
terminated with an error, exit status=2 (stressor failed)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222796] 
terminated (stressor failed)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222797] sigurg: [222797] exited 
(instance 1 on CPU 11)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222797] 
terminated (success)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222798] sigurg: [222798] exited 
(instance 2 on CPU 84)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222798] 
terminated (success)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222799] sigurg: [222799] exited 
(instance 3 on CPU 60)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] sigurg: [222799] 
terminated (success)
  18:35:14 DEBUG| [stdout] stress-ng: debug: [222795] metrics-check: all 
stressor metrics validated and sane
  18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] skipped: 0
  18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] passed: 3: sigurg (3)
  18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] failed: 1: sigurg (1)
  18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] metrics untrustworthy: 0
  18:35:14 DEBUG| [stdout] stress-ng: info:  [222795] unsuccessful run 
completed in 0 secs
  18:35:14 DEBUG| [stdout]  
  18:35:14 DEBUG| [stdout]  
  18:35:14 DEBUG| [stdout]  
  18:35:14 DEBUG| [stdout] Summary:
  18:35:14 DEBUG| [stdout]   Stressors run: 1
  18:35:14 DEBUG| [stdout]   Skipped: 0, 
  18:35:14 DEBUG| [stdout]   Failed:  1,  sigurg
  18:35:14 DEBUG| [stdout]   Oopsed:  0, 
  18:35:14 DEBUG| [stdout]   Oomed:   0, 
  18:35:14 DEBUG| [stdout]   Passed:  0, 
  18:35:14 DEBUG| [stdout]   Badret:  0, 
  18:35:14 DEBUG| [stdout]  
  18:35:14 DEBUG| [stdout] Tests took 0 seconds to run

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-kernel-tests/+bug/2097652/+subscriptions


-- 
Mailing list: https://launchpad.net/~canonical-ubuntu-qa
Post to     : canonical-ubuntu-qa@lists.launchpad.net
Unsubscribe : https://launchpad.net/~canonical-ubuntu-qa
More help   : https://help.launchpad.net/ListHelp

Reply via email to