On 4/26/24 22:05, Gavin McKee wrote:
> Thanks again for coming back on this Ilya,
> 
> Another option I am looking at here is to switch the kernal path (Open
> vSwitch kernel module) with OVS-DOCA as we are using the CX6/7 card
> https://docs.nvidia.com/doca/archive/doca-v2.0.2/ovs-doca/index.html
> 
> I'm trying to wrangle the documented Known Limitations
> 
> - Only one insertion thread is supported (n-offload-threads=1).
> - Only a single PF is currently supported.
> - Geneve options are not supported.
> - Only 250K connections are offloaded by CT offload.
> - Only 8 CT zones are supported by CT offload.
> - OVS restart on non-tunnel configuration is not supported. Remove all
> ports before restarting.
> 
> The one that concerns me is the 8 CT zones supported by CT offload, as
> potentially with OVN I may have many CT zones if we have many customer
> colocated on the same compute node.
> 
> Seems any way I turn I'm getting kicked :D

You'll also need a special binary build from NVIDIA.  The docs you linked
are closed-source fork and not an upstream OVS.  OVS doesn't support
offloading of connection tracking in userspace datapath.

> 
> Gav
> 
> On Fri, 26 Apr 2024 at 12:42, Ilya Maximets <i.maxim...@ovn.org> wrote:
>>
>> On 4/26/24 20:12, Gavin McKee wrote:
>>> Thanks for coming back to me on this.
>>>
>>> Moving kernal versions around is not a straightforward option here -
>>> especially when you are using hardware offload .  The OFED driver
>>> version is coupled to the kernal so if we move from that we are out of
>>> support coverage .
>>>
>>> Doing an  ovn-appctl -t ovn-controller recompute does not resolve the
>>> problem , again just taking a big hammer like restarting openvswitch
>>> does.
>>>
>>> How would we proceed here ?  Are there any Open vSwitch kernal module
>>> patches we could try to get a resolution ?
>>
>> You can try the commit I linked below.  That will mean that you'll
>> need to re-build your kernel.  There is no other way.
>>
>> In the past we had out-of-tree module, but it is deprecated for a
>> long time, contains multiple issues and is unlikely to work with
>> new kernels, especially heavily modified ones, like RHEL kernels.
>>
>> Note that the issue is not localized to OVS, but affects TC as well
>> as they now share the NAT implementation.  So, even if just swapping
>> the openvswitch kernel module was possible, it wouldn't help much.
>>
>>>
>>> One option we are looking at is regressing the entire stack back to Rocky 
>>> 9.1 .
>>
>> This may be an option.  The bug I mentioned in a previous email exists
>> in RHEL 9.3, so it exists in Rocky 9.3 as well, at least it should
>> since they claim "bug-for-bug compatibility".  So, 3 options to fix this
>> particular bug (I don't know if it is causing your issue, but it is
>> a severe bug that can potentially be a cause):
>>
>> 1. Re-build the kernel to include the fix.
>> 2. Downgrade from 9.3 to an earlier release.
>> 3. Wait for 9.4.
>>
>> Best regards, Ilya Maximets.
>>
>>>
>>> Gav
>>>
>>> On Wed, 24 Apr 2024 at 04:44, Ilya Maximets <i.maxim...@ovn.org> wrote:
>>>>
>>>> On 4/23/24 17:39, Gavin McKee wrote:
>>>>> If you look at both traces (non working and working) the thing that
>>>>> stands out to me is this
>>>>>
>>>>> At line 10 in the working file the following entry exists
>>>>>     ct_state NEW tcp (SYN_SENT) orig [172.27.16.11.38793 >
>>>>> 172.27.31.189.9100] reply [172.27.31.189.9100 > 172.27.16.11.38793]
>>>>> zone 195
>>>>>
>>>>> his doesn't happen in the non working file - I just see the following
>>>>>
>>>>> 3904992932904126 [swapper/125] 0 [kr] queue_userspace_packet
>>>>> #ddf92049d47aeff1d0b6625620000 (skb 18382861792850905088) n 4
>>>>>     if 38 (enp148s0f0_1) rxif 38 172.27.16.11.42303 >
>>>>> 172.27.31.189.9100 ttl 64 tos 0x0 id 36932 off 0 [DF] len 52 proto TCP
>>>>> (6) flags [S] seq 2266263186 win 42340
>>>>>     upcall_enqueue (miss) (125/3904992932890052) q 3682874017 ret 0
>>>>>   + 3904992932907247 [swapper/125] 0 [kr] ovs_dp_upcall
>>>>> #ddf92049d47aeff1d0b6625620000 (skb 18382861792850905088) n 5
>>>>>     if 38 (enp148s0f0_1) rxif 38 172.27.16.11.42303 >
>>>>> 172.27.31.189.9100 ttl 64 tos 0x0 id 36932 off 0 [DF] len 52 proto TCP
>>>>> (6) flags [S] seq 2266263186 win 42340
>>>>>     upcall_ret (125/3904992932890052) ret 0
>>>>>
>>>>> I am wondering if a failure to track the ct_state SYN is causing the
>>>>> returning ACK to drop ?
>>>>>
>>>>>   + 3904992936344421 [swapper/125] 0 [tp] skb:kfree_skb
>>>>> #ddf9204d21c48ff1d0b676c330c00 (skb 18382861792850913792) n 3 drop
>>>>> (TC_INGRESS)
>>>>>     if 33 (genev_sys_6081) rxif 33 172.27.31.189.9100 >
>>>>> 172.27.16.11.42303 ttl 64 tos 0x0 id 0 off 0 [DF] len 52 proto TCP (6)
>>>>> flags [S.] seq 605271182 ack 2266263187 win 42340
>>>>>
>>>>> On Mon, 22 Apr 2024 at 18:54, Gavin McKee <gavmcke...@googlemail.com> 
>>>>> wrote:
>>>>>>
>>>>>> Ok @Adrian Moreno @Flavio Leitner
>>>>>>
>>>>>> Two more detailed Retis traces attached.  One is not working - the
>>>>>> same session that I can't establish a TCP session to on port 9010
>>>>>> 172.27.16.11.42303 > 172.27.31.189.9100
>>>>>>
>>>>>> Then I restart Open vSwtich and try again
>>>>>> 172.27.16.11.38793 > 172.27.31.189.9100 (this works post restart)
>>>>>>
>>>>>> It looks to me in the non working example that we -
>>>>>> SEND SYN -> exits the tunnel interface genev_sys_6081 via
>>>>>> enp148s0f0np0 - exactly as expected
>>>>>> RECV ACK -> tcp_gro_receive then -> net:netif_receive_skb where we hit
>>>>>> drop (TC_INGRESS)
>>>>>>
>>>>>> After a restart things seem to be very different
>>>>>>
>>>>>> Any ideas where to look next ?
>>>>
>>>> You mentioned that you're using 5.14.0-362.8.1.el9_3.x86 kernel.
>>>> RHEL 9.3 contains a large refactor for OVS connection tracking,
>>>> but it doesn't contain at least one fix for this refactor:
>>>>   
>>>> https://github.com/torvalds/linux/commit/e6345d2824a3f58aab82428d11645e0da861ac13
>>>>
>>>> This may cause all sorts of incorrect packet processing in the
>>>> kernel.  I'd suggest trying the latest upstream v6.8.7 kernel
>>>> that has all the known fixes or try the 9.4 beta kernel that
>>>> should have the fix mentioned above.  Downgrading to 9.2 may also
>>>> be an option since 9.2 doesn't contain the refactoring, IIRC.
>>>>
>>>> I'd not recommend running with this bug present anyway.
>>>>
>>>> Updating your OVN 23.09.1 to 23.09.3 may also be worth trying.
>>>> The fact that OVS restart fixes the issue may also indicate a
>>>> problem with incremental processing in ovn-controller.
>>>> Next time the issue happens try to force flow recompute with
>>>>   ovn-appctl -t ovn-controller recompute
>>>> And see if that fixes the issue.  If it does, it would be great
>>>> to have OpenFlow dumps before and after the recompute for
>>>> comparison.
>>>>
>>>> Best regards, Ilya Maximets.
>>

_______________________________________________
discuss mailing list
disc...@openvswitch.org
https://mail.openvswitch.org/mailman/listinfo/ovs-discuss

Reply via email to