Hello,

(Please note that I don't speak English, sorry if the traction is not faithful 
to your language)

Following Bjorn Helgaas's advice 
(https://bugzilla.kernel.org/show_bug.cgi?id=218050#c14), I'm coming to you in 
the hope of finding a solution to a problem encountered by several users of the 
ixgbe driver. The subject has been discussed in the messages and comments on 
the following pages:
https://marc.info/?l=linux-netdev&m=170118007007901&w=2
https://forum.proxmox.com/threads/intel-x553-sfp-ixgbe-no-go-on-pve8.135129/
https://www.servethehome.com/the-everything-fanless-home-server-firewall-router-and-nas-appliance-qotom-qnap-teamgroup/
https://www.servethehome.com/intel-x553-networking-and-proxmox-ve-8-1-3/?unapproved=518173&moderation-hash=e57a05288058d3ff253ceb42e9ada905
https://forum.proxmox.com/threads/proxmox-8-kernel-6-2-16-4-pve-ixgbe-driver-fails-to-load-due-to-pci-device-probing-failure.131203/
https://bugzilla.kernel.org/show_bug.cgi?id=218491
https://bugzilla.kernel.org/show_bug.cgi?id=218050

Having myself decided to purchase a Qotom Q20332G9-S10 machine with X553 
chipset for testing purposes, I can see the effectiveness of the connection 
problem between the PC's X553 SFP+ and a Cisco switch SFP+. For my part, this 
happens under GNU/Linux Debian 12 - kernel 6.1.76 and Sid - kernel 6.6.13. So 
it's not specific to Proxmox.
I should point out that under GNU/Linux Debian 11 - kernel 5.10, the network 
card (X553 via ixgbe) works without problems. So this is a relatively "recent" 
bug.

Here's my test environment:
- 1 Qotom Q20332G9-S10 (I used a 16GB Intel Optane M10 M.2 SSD with a fresh 
GNU/Linux Debian 12)
- 1 Cisco DAC cable (tested with a 1M and a 3M)
- 1 PC with Mellanox Connectx-3 2x SFP+ network card (running GNU/Linux Debian 
SID installed several years ago)
- 1 Cisco 3560CX-12PD-S switch (2 SFP+ ports) with IOS 15.2(7)E2

Connecting the Qotom Q20332G9-S10 (X553) to the Mellanox Connectx-3 works 
without a hitch and without any special handling (the 
linux-image-6.1.0-17-amd64 ixgbe driver works in this configuration). Full 
10gbps speeds between the two with an "iperf".

At this stage, I've ruled out a hardware incompatibility (OSI level 1) since 
the DAC works with the X553. So there's no need to use compatibility tricks as 
suggested in the link comments with the "allow_unsupported_sfp=1" parameter. 
This will be useless in the following tests (I've checked).

Where it gets tricky is when you connect it (the Qotom) to the Cisco switch.
Before an "ip link eno1 up", the Cisco raises the link on its side, but the 
Debian doesn't (link DOWN). After the "ip link eno1 up", the link drops and 
never comes back. There does seem to be a driver problem in recent kernels 
(GNU/Linux Debian Stable and Sid).

After compiling the driver manually 
(https://downloadmirror.intel.com/812532/ixgbe-5.19.9.tar.gz) following the 
documentation already shared by others 
(https://www.xmodulo.com/download-install-ixgbe-driver-ubuntu-debian.html), it 
works with the Cisco (after a "shut/no shut" of the latter's 10gbe port).

So we end up with a working machine (I even configured and used the SR-IOV 
successfully right afterwards).

PS: I also tested with Debian Sid

I've finally tried the commands you were giving Skyler without any result 
(rmmod ixgbe; modprobe ixgbe; ethtool -S eno1 | grep fault).

For the moment, the Qotom machine is dedicated to testing, so I'm available to 
carry out any manipulations you may wish to make to advance the subject.
Can we work on diagnosing this problem so that the next stable release of 
Debian is fully functional with this Intel network card?

Best regards.

⢀⣴⠾⠻⢶⣦⠀
⣾⠁⢠⠒⠀⣿⡁ Yohan Charbi
⢿⡄⠘⠷⠚⠋⠀ Cordialement
⠈⠳⣄⠀

Reply via email to