** Description changed:

+ SRU:
+ ====
+ 
+ [Impact]
+ 
+  * the perftest tools (ib_*) incl. in the perftest package cannot be used at 
all, they all always core dump on all platforms
+  * a backport is required for getting again a working perftest package / tool 
set
+  * the fix was officially provided by Mellanox and fixes the version 
comparison that was broken before (partly using int, partly string compares)
+ 
+ [Test Case]
+ 
+  * the bug can easily be reproduced on two systems both with RoCE cards 
installed
+    and by starting a perftest run like this:
+    on one machine as 'server': sudo ./ib_read_bw -d mlx4_0 -a
+    and on a second machine as 'client', pointing to the servers IP address: 
sudo ./ib_read_bw <server IP> -d mlx4_0
+    
+ detailed instructions how to reproduce the bug
+ 
+  * install the perftest package including all dependencies
+  * configure the RoCE devices as network devices using a private network 
range in /etc/network/interfaces like this
+       # The 1st RoCE interface configuration
+       auto enP1p0s0
+       iface enP1p0s0 inet static
+               address 192.168.1.141
+               netmask 255.255.255.0
+               network 192.168.1.0
+               broadcast 192.168.1.255
+  * test if the network is okay, with ping (or rping, udaddy 
rdma_client/rdma_server)
+  * and run the Test Case above
+ 
+ [Regression Potential]
+ 
+  * the regression is low due to the fact that the current tools that are part 
of the perftest package just segfault and are unusable
+  * the target for this patched perftest package is Zesty (and higher), 
because only Zesty has the depending packages in an up-to-date version
+ 
+  * people may ask to SRU that to Xenial as well, but that would require
+ the update of several other packages ...
+ 
+ --------%<----------------%<----------------%<----------------%<--------
+ 
  == Comment: #0 - Helmut Grauer - 2016-03-04 06:46:50 ==
- Hi 
+ Hi
  Configure IB for perftest
  Ethernet Interface
  np0s0    Link encap:Ethernet  HWaddr 82:01:14:32:f0:90
-           inet addr:10.100.80.2  Bcast:10.100.255.255  Mask:255.255.0.0
-           inet6 addr: fe80::8001:14ff:fe32:f090/64 Scope:Link
-           inet6 addr: fd00:10:100::ff:80:2/80 Scope:Global
-           inet6 addr: fd00:10:100:0:8001:14ff:fe32:f090/64 Scope:Global
-           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
-           RX packets:25938 errors:0 dropped:0 overruns:0 frame:0
-           TX packets:253 errors:0 dropped:0 overruns:0 carrier:0
-           collisions:0 txqueuelen:1000
-           RX bytes:8228251 (8.2 MB)  TX bytes:21494 (21.4 KB)
+           inet addr:10.100.80.2  Bcast:10.100.255.255  Mask:255.255.0.0
+           inet6 addr: fe80::8001:14ff:fe32:f090/64 Scope:Link
+           inet6 addr: fd00:10:100::ff:80:2/80 Scope:Global
+           inet6 addr: fd00:10:100:0:8001:14ff:fe32:f090/64 Scope:Global
+           UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
+           RX packets:25938 errors:0 dropped:0 overruns:0 frame:0
+           TX packets:253 errors:0 dropped:0 overruns:0 carrier:0
+           collisions:0 txqueuelen:1000
+           RX bytes:8228251 (8.2 MB)  TX bytes:21494 (21.4 KB)
  
  Installing related packages for dapltest
  
  librdmacm-dev                                   install
  librdmacm1                                      install
  librdmacm1-dbg                                  install
  dapl2-utils                                     install
  libibumad3                                      install
  libibverbs-dev                                  install
  libibverbs1                                     install
  libmlx4-1                                       install
  libmlx4-1-dbg                                   install
  libmlx4-dev                                     install
  libmlx5-1                                       install
  libmlx5-1-dbg                                   install
  libmlx5-dev                                     install
  perftest                                        install
  
  ++++++++++++++++++++++++++++++++++++++
  ++++++++++++++++++++++++++++++++++++++
  +++         PCI-Overview:          +++
  ++++++++++++++++++++++++++++++++++++++
  ++++++++++++++++++++++++++++++++++++++
  
  ======================================
  DEVICE_List:
  -------------
  0000:00:00.0
  0001:00:00.0
  
  ======================================
  FunctionID_List:
  -----------------
  0x0000003e
  0x0000003f
  
  ======================================
  PCHID_List:
  ------------
  0x0108
  0x013c
  
  ======================================
  Interface_List:
  ----------------
  /sys/bus/pci/devices/0000:00:00.0/net/:
  enp0s0
  enp0s0d1
  
  /sys/bus/pci/devices/0001:00:00.0/net/:
  enP1p0s0
  enP1p0s0d1
  
  ======================================
  Infiniband_List:
  ----------------
  /sys/bus/pci/devices/0000:00:00.0/infiniband/:
  mlx4_0
  
  /sys/bus/pci/devices/0001:00:00.0/infiniband/:
  mlx4_1
  
  --------------------------------------------------------------------------
  
  server
  
  root@s83lp02:~# dpkg -S /etc/dat.conf
  libdapl2: /etc/dat.conf
  root@s83lp02:~# ib_read_bw -d mlx4_0 -a
  
  ************************************
  * Waiting for client to connect... *
  ************************************
  *** stack smashing detected ***: ib_read_bw terminated
  Aborted (core dumped)
  
  -----------------------------------------------------------------------------
  root@s83lp18:~# ./xpci.sh
  
  ++++++++++++++++++++++++++++++++++++++
  ++++++++++++++++++++++++++++++++++++++
  +++         PCI-Overview:          +++
  ++++++++++++++++++++++++++++++++++++++
  ++++++++++++++++++++++++++++++++++++++
  
  ======================================
  DEVICE_List:
  -------------
  0000:00:00.0
  0001:00:00.0
  
  ======================================
  FunctionID_List:
  -----------------
  0x00000040
  0x00000041
  
  ======================================
  PCHID_List:
  ------------
  0x0108
  0x013c
  
  ======================================
  Interface_List:
  ----------------
  /sys/bus/pci/devices/0000:00:00.0/net/:
  ens40
  ens40d1
  
  /sys/bus/pci/devices/0001:00:00.0/net/:
  enP1s41
  enP1s41d1
  
  ======================================
  Infiniband_List:
  ----------------
  /sys/bus/pci/devices/0000:00:00.0/infiniband/:
  mlx4_0
  
  /sys/bus/pci/devices/0001:00:00.0/infiniband/:
  mlx4_1
  
- 
  Client
  
  root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1
  Couldn't connect to 10.100.80.2:18515
  Unable to open file descriptor for socket connection Unable to init the 
socket connection
  root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1
  *** stack smashing detected ***: ib_read_bw terminated
  Aborted (core dumped)
  (reverse-i-search)`':
  
  I will add SOSReport and dgbinfo.sh

-- 
You received this bug notification because you are a member of Ubuntu
Bugs, which is subscribed to Ubuntu.
https://bugs.launchpad.net/bugs/1553185

Title:
  PCI RoCe IB perftest Aborted (core dumped)

To manage notifications about this bug go to:
https://bugs.launchpad.net/ubuntu-z-systems/+bug/1553185/+subscriptions

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to