Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-27 Thread Leon Romanovsky
On Wed, Nov 27, 2024 at 06:48:03PM +0100, Francesco Poli wrote: > On Mon, 25 Nov 2024 21:38:37 +0200 Leon Romanovsky wrote: > > > On Mon, Nov 25, 2024 at 07:54:43PM +0100, Francesco Poli wrote: > [...] > > > I will try to continue to bisect by testing the resulting kernels on a > > > compute node:

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-27 Thread Francesco Poli
On Mon, 25 Nov 2024 21:38:37 +0200 Leon Romanovsky wrote: > On Mon, Nov 25, 2024 at 07:54:43PM +0100, Francesco Poli wrote: [...] > > I will try to continue to bisect by testing the resulting kernels on a > > compute node: there's no OpenSM there and it cannot run anyway, if > > there's another Op

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-26 Thread Leon Romanovsky
On Tue, Nov 26, 2024 at 10:38:59AM +0200, Leon Romanovsky wrote: > On Tue, Nov 26, 2024 at 08:18:24AM +0100, Francesco Poli wrote: > > On Tue, 26 Nov 2024 09:21:37 +0800 Mark Zhang wrote: > > > > [...] > > > Yes looks like FW reports vport.num_plane > 0. What is your hw type and > > > FW version

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-26 Thread Leon Romanovsky
On Tue, Nov 26, 2024 at 08:18:24AM +0100, Francesco Poli wrote: > On Tue, 26 Nov 2024 09:21:37 +0800 Mark Zhang wrote: > > [...] > > Yes looks like FW reports vport.num_plane > 0. What is your hw type and > > FW version ("ethtool -i ")? I don't think it > > supports multiplane. > > $ /sbin/et

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-25 Thread Francesco Poli
On Tue, 26 Nov 2024 09:21:37 +0800 Mark Zhang wrote: [...] > Yes looks like FW reports vport.num_plane > 0. What is your hw type and > FW version ("ethtool -i ")? I don't think it > supports multiplane. $ /sbin/ethtool -i ibp129s0f0 driver: mlx5_core[ib_ipoib] version: 6.10.11-amd64 fir

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-25 Thread Mark Zhang
On 11/26/2024 3:38 AM, Leon Romanovsky wrote: On Mon, Nov 25, 2024 at 07:54:43PM +0100, Francesco Poli wrote: On Thu, 21 Nov 2024 11:04:13 +0100 Uwe Kleine-König wrote: [...] It looks like the commit that is biting you is https://git.kernel.org/linus/50660c5197f52b8137e223dc3ba8d43661179a1d

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-25 Thread Leon Romanovsky
On Mon, Nov 25, 2024 at 07:54:43PM +0100, Francesco Poli wrote: > On Thu, 21 Nov 2024 11:04:13 +0100 Uwe Kleine-König wrote: > > [...] > > It looks like the commit that is biting you is > > > > https://git.kernel.org/linus/50660c5197f52b8137e223dc3ba8d43661179a1d > > > > So if you bisect, try 50

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-25 Thread Francesco Poli
On Thu, 21 Nov 2024 11:04:13 +0100 Uwe Kleine-König wrote: [...] > It looks like the commit that is biting you is > > https://git.kernel.org/linus/50660c5197f52b8137e223dc3ba8d43661179a1d > > So if you bisect, try 50660c5197f52b8137e223dc3ba8d43661179a1d and its > parent 24943dcdc156cf294d97a36b

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-21 Thread Uwe Kleine-König
Hello Francesco, [for the new-comers: This is about a regression in 6.11. Details available at https://bugs.debian.org/1086520. The TL;DR; is that on 6.10.11 opensm works as expected, while it fails to start on 6.11.7.] On Mon, Nov 18, 2024 at 08:06:16PM +0100, Francesco Poli wrote: > On Mon, 18

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-18 Thread Francesco Poli
On Mon, 18 Nov 2024 09:58:03 +0100 Uwe Kleine-König wrote: [...] > On Wed, Nov 13, 2024 at 11:15:03PM +0100, Francesco Poli wrote: > > On Mon, 11 Nov 2024 11:22:26 +0100 Uwe Kleine-König wrote: [...] > > > I guess the kernel provides a directory "/sys/class/infiniband_mad". Do > > > its contents l

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-18 Thread Uwe Kleine-König
Hello Francesco, On Wed, Nov 13, 2024 at 11:15:03PM +0100, Francesco Poli wrote: > On Mon, 11 Nov 2024 11:22:26 +0100 Uwe Kleine-König wrote: > > [...] > > Hello, > > Hi Uwe, thanks for your followup. > > > > > On Thu, Oct 31, 2024 at 07:53:52PM +0100, Francesco Poli (wintermute) wrote: > [...

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-13 Thread Francesco Poli
On Mon, 11 Nov 2024 11:22:26 +0100 Uwe Kleine-König wrote: [...] > Hello, Hi Uwe, thanks for your followup. > > On Thu, Oct 31, 2024 at 07:53:52PM +0100, Francesco Poli (wintermute) wrote: [...] > > I filed this bug report against the Debian Linux kernel, in order > > to warn other users about

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-11 Thread Uwe Kleine-König
Control: tag -1 + moreinfo Control: forwarded -1 https://github.com/linux-rdma/opensm/issues/37 Hello, On Thu, Oct 31, 2024 at 07:53:52PM +0100, Francesco Poli (wintermute) wrote: > Package: src:linux > Version: 6.11.2-1 > Severity: important > X-Debbugs-Cc: invernom...@paranoici.org > > Hello,

Processed: Re: Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-11-11 Thread Debian Bug Tracking System
Processing control commands: > tag -1 + moreinfo Bug #1086520 [src:linux] linux-image-6.11.2-amd64: makes opensm fail to start Added tag(s) moreinfo. > forwarded -1 https://github.com/linux-rdma/opensm/issues/37 Bug #1086520 [src:linux] linux-image-6.11.2-amd64: makes opensm fail to start Set Bug

Bug#1086520: linux-image-6.11.2-amd64: makes opensm fail to start

2024-10-31 Thread Francesco Poli (wintermute)
Package: src:linux Version: 6.11.2-1 Severity: important X-Debbugs-Cc: invernom...@paranoici.org Hello, I encountered a major issue on an HPC cluster head node, as soon as I upgraded the Linux kernel from version 6.10.11-1 to version 6.11.2-1 . The issue is that the head node runs OpenSM (InfiniB