Hi Ben, Returning to this issue, last discussed in June.
> > Thanks, now I tried it (the Patchset 2 variant) but it seems to > > behave like before, the delay is sitll happening. > > Hmm thanks ☹ > Can you try to export "MLX5_SHUT_UP_BF=1" in your environment before > starting VPP (ie, VPP environment must contain this)? This should > disable the "BlueFlame" mechanism in Mellanox NIC. Otherwise I'll > need to take a deeper look. Unfortunately that did not help, it seemed to behave the same also with "MLX5_SHUT_UP_BF=1" set. We are still having this problem now, with the current master branch. Like before, the behavior seems to be that when 2 packets are to be sent, only the first one gets sent directly, the second packet gets delayed. I have a test case now where the delay is more than 3 seconds. It seems the delay lasts until something else is to be sent, then the old packet gets sent also. So nothing gets lost, just delayed. But things can anyway fail, for example some ping tests fail because they time out. I have looked a bit more at it and tried to understand what happens, but I did not get much wiser, still just seems to me like VPP rings the "doorbell" and expects the packets to be sent, but somehow only one packet is sent and the other is delayed. Am I right to assume that the "doorbell" action is the last thing VPP is doing that we can check in the VPP source code itself, then we would need to go poke around inside the underlying rdma-core driver to see what is happening? Can you help more with this? Best regards, Elias
-=-=-=-=-=-=-=-=-=-=-=- Links: You receive all messages sent to this group. View/Reply Online (#18130): https://lists.fd.io/g/vpp-dev/message/18130 Mute This Topic: https://lists.fd.io/mt/75120690/21656 Mute #mellanox:https://lists.fd.io/g/vpp-dev/mutehashtag/mellanox Mute #rdma:https://lists.fd.io/g/vpp-dev/mutehashtag/rdma Group Owner: vpp-dev+ow...@lists.fd.io Unsubscribe: https://lists.fd.io/g/vpp-dev/unsub [arch...@mail-archive.com] -=-=-=-=-=-=-=-=-=-=-=-