** Description changed: + SRU: + ==== + + [Impact] + + * the perftest tools (ib_*) incl. in the perftest package cannot be used at all, they all always core dump on all platforms + * a backport is required for getting again a working perftest package / tool set + * the fix was officially provided by Mellanox and fixes the version comparison that was broken before (partly using int, partly string compares) + + [Test Case] + + * the bug can easily be reproduced on two systems both with RoCE cards installed + and by starting a perftest run like this: + on one machine as 'server': sudo ./ib_read_bw -d mlx4_0 -a + and on a second machine as 'client', pointing to the servers IP address: sudo ./ib_read_bw <server IP> -d mlx4_0 + + detailed instructions how to reproduce the bug + + * install the perftest package including all dependencies + * configure the RoCE devices as network devices using a private network range in /etc/network/interfaces like this + # The 1st RoCE interface configuration + auto enP1p0s0 + iface enP1p0s0 inet static + address 192.168.1.141 + netmask 255.255.255.0 + network 192.168.1.0 + broadcast 192.168.1.255 + * test if the network is okay, with ping (or rping, udaddy rdma_client/rdma_server) + * and run the Test Case above + + [Regression Potential] + + * the regression is low due to the fact that the current tools that are part of the perftest package just segfault and are unusable + * the target for this patched perftest package is Zesty (and higher), because only Zesty has the depending packages in an up-to-date version + + * people may ask to SRU that to Xenial as well, but that would require + the update of several other packages ... + + --------%<----------------%<----------------%<----------------%<-------- + == Comment: #0 - Helmut Grauer - 2016-03-04 06:46:50 == - Hi + Hi Configure IB for perftest Ethernet Interface np0s0 Link encap:Ethernet HWaddr 82:01:14:32:f0:90 - inet addr:10.100.80.2 Bcast:10.100.255.255 Mask:255.255.0.0 - inet6 addr: fe80::8001:14ff:fe32:f090/64 Scope:Link - inet6 addr: fd00:10:100::ff:80:2/80 Scope:Global - inet6 addr: fd00:10:100:0:8001:14ff:fe32:f090/64 Scope:Global - UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 - RX packets:25938 errors:0 dropped:0 overruns:0 frame:0 - TX packets:253 errors:0 dropped:0 overruns:0 carrier:0 - collisions:0 txqueuelen:1000 - RX bytes:8228251 (8.2 MB) TX bytes:21494 (21.4 KB) + inet addr:10.100.80.2 Bcast:10.100.255.255 Mask:255.255.0.0 + inet6 addr: fe80::8001:14ff:fe32:f090/64 Scope:Link + inet6 addr: fd00:10:100::ff:80:2/80 Scope:Global + inet6 addr: fd00:10:100:0:8001:14ff:fe32:f090/64 Scope:Global + UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1 + RX packets:25938 errors:0 dropped:0 overruns:0 frame:0 + TX packets:253 errors:0 dropped:0 overruns:0 carrier:0 + collisions:0 txqueuelen:1000 + RX bytes:8228251 (8.2 MB) TX bytes:21494 (21.4 KB) Installing related packages for dapltest librdmacm-dev install librdmacm1 install librdmacm1-dbg install dapl2-utils install libibumad3 install libibverbs-dev install libibverbs1 install libmlx4-1 install libmlx4-1-dbg install libmlx4-dev install libmlx5-1 install libmlx5-1-dbg install libmlx5-dev install perftest install ++++++++++++++++++++++++++++++++++++++ ++++++++++++++++++++++++++++++++++++++ +++ PCI-Overview: +++ ++++++++++++++++++++++++++++++++++++++ ++++++++++++++++++++++++++++++++++++++ ====================================== DEVICE_List: ------------- 0000:00:00.0 0001:00:00.0 ====================================== FunctionID_List: ----------------- 0x0000003e 0x0000003f ====================================== PCHID_List: ------------ 0x0108 0x013c ====================================== Interface_List: ---------------- /sys/bus/pci/devices/0000:00:00.0/net/: enp0s0 enp0s0d1 /sys/bus/pci/devices/0001:00:00.0/net/: enP1p0s0 enP1p0s0d1 ====================================== Infiniband_List: ---------------- /sys/bus/pci/devices/0000:00:00.0/infiniband/: mlx4_0 /sys/bus/pci/devices/0001:00:00.0/infiniband/: mlx4_1 -------------------------------------------------------------------------- server root@s83lp02:~# dpkg -S /etc/dat.conf libdapl2: /etc/dat.conf root@s83lp02:~# ib_read_bw -d mlx4_0 -a ************************************ * Waiting for client to connect... * ************************************ *** stack smashing detected ***: ib_read_bw terminated Aborted (core dumped) ----------------------------------------------------------------------------- root@s83lp18:~# ./xpci.sh ++++++++++++++++++++++++++++++++++++++ ++++++++++++++++++++++++++++++++++++++ +++ PCI-Overview: +++ ++++++++++++++++++++++++++++++++++++++ ++++++++++++++++++++++++++++++++++++++ ====================================== DEVICE_List: ------------- 0000:00:00.0 0001:00:00.0 ====================================== FunctionID_List: ----------------- 0x00000040 0x00000041 ====================================== PCHID_List: ------------ 0x0108 0x013c ====================================== Interface_List: ---------------- /sys/bus/pci/devices/0000:00:00.0/net/: ens40 ens40d1 /sys/bus/pci/devices/0001:00:00.0/net/: enP1s41 enP1s41d1 ====================================== Infiniband_List: ---------------- /sys/bus/pci/devices/0000:00:00.0/infiniband/: mlx4_0 /sys/bus/pci/devices/0001:00:00.0/infiniband/: mlx4_1 - Client root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1 Couldn't connect to 10.100.80.2:18515 Unable to open file descriptor for socket connection Unable to init the socket connection root@s83lp18:~# ib_read_bw 10.100.80.2 -d mlx4_1 *** stack smashing detected ***: ib_read_bw terminated Aborted (core dumped) (reverse-i-search)`': I will add SOSReport and dgbinfo.sh
-- You received this bug notification because you are a member of Ubuntu Bugs, which is subscribed to Ubuntu. https://bugs.launchpad.net/bugs/1553185 Title: PCI RoCe IB perftest Aborted (core dumped) To manage notifications about this bug go to: https://bugs.launchpad.net/ubuntu-z-systems/+bug/1553185/+subscriptions -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs