On 9/26/24 18:55, Sri kor wrote: > Hi @Dumitru Ceara <dce...@redhat.com> and @Ilya Maximets > <i.maxim...@ovn.org> ,
Hi Srini, > I was hoping that this issue was fixed on *23.09.4*. The other members > in the community faced the similar issue. Looks like it was seen on 24.03 > also. The only kind of similar report I'm aware of is this one: https://github.com/ovn-org/ovn/issues/259 But that's strictly related to meters as far as I understand while your problem is related to groups. Are you referring to some other community reported issue? > Any recommendations? What exact OVN 24.03 version are you running? What exact OVS version are you running in that setup? Would it be possible to share the OVN NB and the OVS local databases? Thanks, Dumitru > > Thanks > Srini > > > > On Thu, Jun 6, 2024 at 5:54 PM Sri kor <esrik...@gmail.com> wrote: > >> Ok, thanks for the recommendation. >> >> On Thu, Jun 6, 2024 at 3:32 PM Dumitru Ceara <dce...@redhat.com> wrote: >> >>> On 6/6/24 22:41, Ilya Maximets wrote: >>>> On 6/6/24 20:59, Sri kor via discuss wrote: >>>>> Hi Team, >>>>> >>>>> Currently we are facing /ERR|group-table: out of table ids .W/e >>> are >>>>> running OVN 23.09 version and OVS 3.2.2. From the retis trace, the >>> packet >>>>> appears to be dropped shortly after the upcall is generated. The exact >>>>> reason for the drop isn't specified, but it indicates that the packet >>> is not >>>>> forwarded further within the OVS kernel datapath at this point. >>>>> >>>>> As per https://issues.redhat.com/browse/FDP-70 , this issue was fixed >>> in OVN 23.09. >>>>> >>>>> >>>>> Jun 05 21:21:05 vaeq-cu1a-r207-prod-hv-03.vaeq-cu.infra.cx >>> systemd[1]: Started OVN controller daemon. >>>>> Jun 05 21:21:05 vaeq-cu1a-r207-prod-hv-03.vaeq-cu.infra.cx >>> ovn-controller[1253156]: ovs|00023|extend_table|ERR|*table group-table: out >>> of table ids.* >>>>> >>>>> >>>>> >>>>> [root@cloud-user]# >>>>> >>>>> Jun 05 21:21:05 vaeq-cu1a-r207-prod-hv-03.vaeq-cu.infra.cx >>> systemd[1]: Started OVN controller daemon. >>>>> >>>>> Jun 05 21:21:05 vaeq-cu1a-r207-prod-hv-03.vaeq-cu.infra.cx >>> ovn-controller[1253156]: ovs|00023|extend_table|ERR|table group-table: out >>> of table ids. >>>>> >>>>> >>>>> #retis sort /tmp/playground-test1.json >>>>> >>>>> >>>>> 1306445529784444 (102) [swapper/102] 0 [tp] openvswitch:ovs_dp_upcall >>> #4a4348db8a07cff2a94af7da68c00 (skb ff2a93b2b82afd00) n 0 >>>>> if 4 (enp148s0f0np0) rxif 4 91.107.186.166.55805 > 204.52.24.59.22 >>> ttl 235 tos 0x0 id 54321 off 0 len 40 proto TCP (6) flags [S] seq >>> 3425966139 win 65535 >>>>> upcall (miss) port 2774067634 cpu 102 >>>>> * + 1306445529798391 (102) [swapper/102] 0 [tp] skb:kfree_skb >>> #4a4348db8a07cff2a94af7da68c00 (skb ff2a93b2b82afd00) n 1 drop (reason >>> NOT_SPECIFIED)* >>>>> * * if 4 (enp148s0f0np0) rxif 4 >>>>> + 1306445529802935 (102) [swapper/102] 0 [kr] queue_userspace_packet >>> #4a4348db8a07cff2a94af7da68c00 (skb ff2a93b2b82afd00) n 2 >>>>> if 4 (enp148s0f0np0) rxif 4 91.107.186.166.55805 > 204.52.24.59.22 >>> ttl 235 tos 0x0 id 54321 off 0 len 40 proto TCP (6) flags [S] seq >>> 3425966139 win 65535 >>>>> upcall_enqueue (miss) (102/1306445529784444) q 1636019689 ret 0 >>>>> + 1306445529807829 (102) [swapper/102] 0 [kr] ovs_dp_upcall >>> #4a4348db8a07cff2a94af7da68c00 (skb ff2a93b2b82afd00) n 3 >>>>> if 4 (enp148s0f0np0) rxif 4 91.107.186.166.55805 > 204.52.24.59.22 >>> ttl 235 tos 0x0 id 54321 off 0 len 40 proto TCP (6) flags [S] seq >>> 3425966139 win 65535 >>>>> upcall_ret (102/1306445529784444) ret 0 >>>>> >>>>> >>>>> [root@cloud-user]# ovs-vsctl --version >>>>> ovs-vsctl (Open vSwitch) *3.2.2* >>>>> DB Schema 8.4.0 >>>>> >>>>> [root@vcloud-user]# ovn-controller --version >>>>> ovn-controller *23.09.1* >>>> >>>> Are you building this package yourself? If so, on which exact commit >>> it is based? >>>> If not, what distribution are you using and what is the exact rpm/deb >>> package version? >>>> >>>> My suspicion is that it is not exactly v23.09.1, but a code a few >>> commits earlier >>>> than that. In this case, it may not include the fix. >>>> >>> >>> I agree, the versions listed above look a bit off. If I run OVN >>> v23.09.1 in a sandbox I get: >>> >>> $ ovn-controller --version >>> ovn-controller 23.09.1 >>> Open vSwitch Library 3.3.90 <<< this differs from 3.2.2 listed above >>> OpenFlow versions 0x6:0x6 >>> SB DB Schema 20.29. >>> >>> Checking when we bumped the OVS submodule from 3.2.2 to the tip (at that >>> moment) of 3.3, it was: >>> 1fa7628db415 ("ovs: Bump submodule to include E721 fixes.") >>> >>> The log between that version and the actual v23.09.1 release is: >>> $ git log --oneline 1fa7628db415..v23.09.1 >>> 0afd4e59e9 (HEAD, tag: v23.09.1) Set release date for 23.09.1. >>> <snip> >>> e9e716ad53 controller: Don't artificially limit group and meter IDs to >>> 16bit. >>> <snip> >>> 627955eb79 ci: Pin Python, Fedora and Ubuntu runner versions. >>> >>> What we need is actually: >>> >>> commit e9e716ad531e34766d2f02783ac08955096bf636 >>> Author: Dumitru Ceara <dce...@redhat.com> >>> Date: Tue Oct 31 18:00:44 2023 +0100 >>> >>> controller: Don't artificially limit group and meter IDs to 16bit. >>> >>> There were a few follow up fixes for it though: >>> acc63727d14f ("controller: fix group_table and meter_table allocation") >>> c0c9e5074704 ("features.c: Always wait on the rconn.") >>> 40b670e6ee94 ("ovn-controller: Fix busy loop when ofctrl is >>> disconnected.") >>> >>> So I guess the recommendation would be to use the most recent v23.09 >>> release, >>> that is: v23.09.4 >>> >>> Regards, >>> Dumitru >>> >>> > _______________________________________________ discuss mailing list disc...@openvswitch.org https://mail.openvswitch.org/mailman/listinfo/ovs-discuss