If you look at both traces (non working and working) the thing that stands out to me is this
At line 10 in the working file the following entry exists ct_state NEW tcp (SYN_SENT) orig [172.27.16.11.38793 > 172.27.31.189.9100] reply [172.27.31.189.9100 > 172.27.16.11.38793] zone 195 his doesn't happen in the non working file - I just see the following 3904992932904126 [swapper/125] 0 [kr] queue_userspace_packet #ddf92049d47aeff1d0b6625620000 (skb 18382861792850905088) n 4 if 38 (enp148s0f0_1) rxif 38 172.27.16.11.42303 > 172.27.31.189.9100 ttl 64 tos 0x0 id 36932 off 0 [DF] len 52 proto TCP (6) flags [S] seq 2266263186 win 42340 upcall_enqueue (miss) (125/3904992932890052) q 3682874017 ret 0 + 3904992932907247 [swapper/125] 0 [kr] ovs_dp_upcall #ddf92049d47aeff1d0b6625620000 (skb 18382861792850905088) n 5 if 38 (enp148s0f0_1) rxif 38 172.27.16.11.42303 > 172.27.31.189.9100 ttl 64 tos 0x0 id 36932 off 0 [DF] len 52 proto TCP (6) flags [S] seq 2266263186 win 42340 upcall_ret (125/3904992932890052) ret 0 I am wondering if a failure to track the ct_state SYN is causing the returning ACK to drop ? + 3904992936344421 [swapper/125] 0 [tp] skb:kfree_skb #ddf9204d21c48ff1d0b676c330c00 (skb 18382861792850913792) n 3 drop (TC_INGRESS) if 33 (genev_sys_6081) rxif 33 172.27.31.189.9100 > 172.27.16.11.42303 ttl 64 tos 0x0 id 0 off 0 [DF] len 52 proto TCP (6) flags [S.] seq 605271182 ack 2266263187 win 42340 On Mon, 22 Apr 2024 at 18:54, Gavin McKee <gavmcke...@googlemail.com> wrote: > > Ok @Adrian Moreno @Flavio Leitner > > Two more detailed Retis traces attached. One is not working - the > same session that I can't establish a TCP session to on port 9010 > 172.27.16.11.42303 > 172.27.31.189.9100 > > Then I restart Open vSwtich and try again > 172.27.16.11.38793 > 172.27.31.189.9100 (this works post restart) > > It looks to me in the non working example that we - > SEND SYN -> exits the tunnel interface genev_sys_6081 via > enp148s0f0np0 - exactly as expected > RECV ACK -> tcp_gro_receive then -> net:netif_receive_skb where we hit > drop (TC_INGRESS) > > After a restart things seem to be very different > > Any ideas where to look next ? > > On Mon, 22 Apr 2024 at 12:49, Gavin McKee <gavmcke...@googlemail.com> wrote: > > > > Hi Guys, > > > > > > We have had another occurrence of the issue today. > > > > In short - we try to open a connection from 172.27.22.90 -> > > 172.27.31.189 on port 9100. We see the SYN being received and the ACK > > being sent from the server back to the client 172.27.22.90. The > > server retransmits the ACK as it's never received by the server. > > > > When we run retis on the server side the ACK is being dropped - see > > (drop (TC_INGRESS) below) > > > > retis -p generic collect -c ovs,ct --ovs-track -f 'port 9100' > > > > (filtered on the ephemeral TCP port 40735) > > > > 3879878550398423 [swapper/125] 0 [tp] net:napi_gro_receive_entry > > #dc8ba9ec4a5d7ff4167e4dd466000 (skb 18393096588613658112) n 0 > > if 37 (enp148s0f0_1) > > + 3879878550401407 [swapper/125] 0 [k] tcp_gro_receive > > #dc8ba9ec4a5d7ff4167e4dd466000 (skb 18393096588613658112) n 1 > > if 37 (enp148s0f0_1) 172.27.22.90.40735 > 172.27.31.189.9100 ttl > > 64 tos 0x0 id 31996 off 0 [DF] len 52 proto TCP (6) flags [S] seq > > 1864480616 win 42340 > > + 3879878550402477 [swapper/125] 0 [tp] net:netif_receive_skb > > #dc8ba9ec4a5d7ff4167e4dd466000 (skb 18393096588613658112) n 2 > > if 37 (enp148s0f0_1) 172.27.22.90.40735 > 172.27.31.189.9100 ttl > > 64 tos 0x0 id 31996 off 0 [DF] len 52 proto TCP (6) flags [S] seq > > 1864480616 win 42340 > > + 3879878550409874 [swapper/125] 0 [tp] net:net_dev_queue > > #dc8ba9ec4a5d7ff4167e4dd466000 (skb 18393096588613658112) n 3 > > if 43 (genev_sys_6081) rxif 37 172.27.22.90.40735 > > > 172.27.31.189.9100 ttl 64 tos 0x0 id 31996 off 0 [DF] len 52 proto TCP > > (6) flags [S] seq 1864480616 win 42340 > > + 3879878550411535 [swapper/125] 0 [tp] net:net_dev_start_xmit > > #dc8ba9ec4a5d7ff4167e4dd466000 (skb 18393096588613658112) n 4 > > if 43 (genev_sys_6081) rxif 37 172.27.22.90.40735 > > > 172.27.31.189.9100 ttl 64 tos 0x0 id 31996 off 0 [DF] len 52 proto TCP > > (6) flags [S] seq 1864480616 win 42340 > > + 3879878550419012 [swapper/125] 0 [k] skb_scrub_packet > > #dc8ba9ec4a5d7ff4167e4dd466000 (skb 18393096588613658112) n 5 > > if 43 (genev_sys_6081) rxif 37 172.27.22.90.40735 > > > 172.27.31.189.9100 ttl 64 tos 0x0 id 31996 off 0 [DF] len 52 proto TCP > > (6) flags [S] seq 1864480616 win 42340 > > + 3879878550421222 [swapper/125] 0 [k] skb_scrub_packet > > #dc8ba9ec4a5d7ff4167e4dd466000 (skb 18393096588613658112) n 6 > > if 43 (genev_sys_6081) 172.27.22.90.39026 > 172.27.31.189.6081 ttl > > 64 tos 0x0 id 31996 off 0 [DF] len 52 proto TCP (6) flags [] seq > > 5916145:5916177 win 62976 > > + 3879878550424101 [swapper/125] 0 [k] ip_output > > #dc8ba9ec4a5d7ff4167e4dd466000 (skb 18393096588613658112) n 7 > > if 43 (genev_sys_6081) 10.25.24.43.39026 > 10.25.25.41.6081 ttl 64 > > tos 0x0 id 53033 off 0 len 110 proto UDP (17) len 82 > > ct_state NEW udp orig [10.25.24.43.39026 > 10.25.25.41.6081] reply > > [10.25.25.41.6081 > 10.25.24.43.39026] zone 0 > > > > > > > > 3879878550702925 [swapper/125] 0 [tp] net:napi_gro_receive_entry > > #dc8ba9ec94b4dff4167e881720000 (skb 18393096588613649408) n 0 > > if 43 (genev_sys_6081) > > + 3879878550705930 [swapper/125] 0 [k] tcp_gro_receive > > #dc8ba9ec94b4dff4167e881720000 (skb 18393096588613649408) n 1 > > if 43 (genev_sys_6081) 172.27.31.189.9100 > 172.27.22.90.40735 ttl > > 64 tos 0x0 id 0 off 0 [DF] len 52 proto TCP (6) flags [S.] seq > > 3930851750 ack 1864480617 win 42340 > > + 3879878550706899 [swapper/125] 0 [tp] net:netif_receive_skb > > #dc8ba9ec94b4dff4167e881720000 (skb 18393096588613649408) n 2 > > if 43 (genev_sys_6081) 172.27.31.189.9100 > 172.27.22.90.40735 ttl > > 64 tos 0x0 id 0 off 0 [DF] len 52 proto TCP (6) flags [S.] seq > > 3930851750 ack 1864480617 win 42340 > > + 3879878550708324 [swapper/125] 0 [tp] skb:kfree_skb > > #dc8ba9ec94b4dff4167e881720000 (skb 18393096588613649408) n 3 drop > > (TC_INGRESS) > > if 43 (genev_sys_6081) rxif 43 172.27.31.189.9100 > > > 172.27.22.90.40735 ttl 64 tos 0x0 id 0 off 0 [DF] len 52 proto TCP (6) > > flags [S.] seq 3930851750 ack 1864480617 win 42340 > > > > 3879879581610887 [swapper/125] 0 [tp] net:napi_gro_receive_entry > > #dc8badc3bb387ff4167e881720400 (skb 18393097140186661632) n 0 > > if 43 (genev_sys_6081) > > + 3879879581614196 [swapper/125] 0 [k] tcp_gro_receive > > #dc8badc3bb387ff4167e881720400 (skb 18393097140186661632) n 1 > > if 43 (genev_sys_6081) 172.27.31.189.9100 > 172.27.22.90.40735 ttl > > 64 tos 0x0 id 0 off 0 [DF] len 52 proto TCP (6) flags [S.] seq > > 3930851750 ack 1864480617 win 42340 > > + 3879879581615551 [swapper/125] 0 [tp] net:netif_receive_skb > > #dc8badc3bb387ff4167e881720400 (skb 18393097140186661632) n 2 > > if 43 (genev_sys_6081) 172.27.31.189.9100 > 172.27.22.90.40735 ttl > > 64 tos 0x0 id 0 off 0 [DF] len 52 proto TCP (6) flags [S.] seq > > 3930851750 ack 1864480617 win 42340 > > + 3879879581617380 [swapper/125] 0 [tp] skb:kfree_skb > > #dc8badc3bb387ff4167e881720400 (skb 18393097140186661632) n 3 drop > > (TC_INGRESS) > > if 43 (genev_sys_6081) rxif 43 172.27.31.189.9100 > > > 172.27.22.90.40735 ttl 64 tos 0x0 id 0 off 0 [DF] len 52 proto TCP (6) > > flags [S.] seq 3930851750 ack 1864480617 win 42340 > > > > > > When doing a tc monitor I am not even seeing the tc monitor entry for > > this session (tcp src port 40735) > > > > Again a restart of Open vSwitch fixes the issue - but this is a heavy > > hammer! > > > > Gav > > > > On Thu, 18 Apr 2024 at 17:55, Gavin McKee <gavmcke...@googlemail.com> wrote: > > > > > > I thought ct was in that retis trace . I’ll need to capture these events > > > when they occur again . > > > > > > I’m also having some issues with huge amounts of loss in the kernal > > > pipeline that I can’t explain . > > > > > > When sending traffic VM to VM within a logical switch but going across > > > GENEVE tunnel I get 30 - 40 Gbps on iperf . When I use the DAT (by > > > sending traffic to the VM DNAT ) I get huge amounts of loss in the sender > > > kernal , retis is saying sk_bff drops . > > > > > > 3.2.2 is absolutely killing me right now. Should I open another thread on > > > this ? > > > > > > On Thu, Apr 18, 2024 at 00:15 Adrian Moreno <amore...@redhat.com> wrote: > > >> > > >> Hi Gavin > > >> > > >> On 4/18/24 02:38, Gavin McKee via discuss wrote: > > >> > This is an example. > > >> > > > >> > Again the TCP 3 handshake completes , but the next packet fails to NAT > > >> > and goes out onto the physical network using the private address . An > > >> > example of this is in the packet trace I provided. > > >> > > > >> > > >> Given you were using retis in your initial troubleshooting, you can use > > >> it with > > >> the additional "ct" collector to see if the kernel datapath is > > >> retrieving the > > >> right conntrack entry and what's its state. If that shows some unexpected > > >> conntrack entry change, it can be confirmed by monitoring "conntrack -E". > > >> > > >> Additionally, a dp flow dump (ovs-appctl dpctl/dump-flows -m) when the > > >> problem > > >> is happening might be also useful. > > >> > > >> -- > > >> Adrián > > >> > > >> > > >> > ovs-appctl ofproto/trace br-int > > >> > in_port=7753,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,tcp,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_ttl=32,tcp_src=52776,tcp_dst=443,tcp_flags=2 > > >> > Flow: > > >> > tcp,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=32,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > >> > > > >> > bridge("br-int") > > >> > ---------------- > > >> > 0. in_port=7753, priority 100, cookie 0xc33f39c4 > > >> > set_field:0x16d->reg13 > > >> > set_field:0x155->reg11 > > >> > set_field:0x1cf->reg12 > > >> > set_field:0x12->metadata > > >> > set_field:0xca->reg14 > > >> > resubmit(,8) > > >> > 8. metadata=0x12, priority 50, cookie 0x1645d3f2 > > >> > set_field:0/0x1000->reg10 > > >> > resubmit(,73) > > >> > 73. > > >> > ip,reg14=0xca,metadata=0x12,dl_src=4e:42:14:a1:2a:fb,nw_src=172.27.18.244, > > >> > priority 90, cookie 0xc33f39c4 > > >> > set_field:0/0x1000->reg10 > > >> > move:NXM_NX_REG10[12]->NXM_NX_XXREG0[111] > > >> > -> NXM_NX_XXREG0[111] is now 0 > > >> > resubmit(,9) > > >> > 9. metadata=0x12, priority 0, cookie 0xcc4fd106 > > >> > resubmit(,10) > > >> > 10. metadata=0x12, priority 0, cookie 0x9e10ad0e > > >> > resubmit(,11) > > >> > 11. metadata=0x12, priority 0, cookie 0x557f3249 > > >> > resubmit(,12) > > >> > 12. ip,metadata=0x12, priority 100, cookie 0x14131a67 > > >> > > > >> > set_field:0x1000000000000000000000000/0x1000000000000000000000000->xxreg0 > > >> > resubmit(,13) > > >> > 13. metadata=0x12, priority 0, cookie 0x85f9ed4f > > >> > resubmit(,14) > > >> > 14. ip,reg0=0x1/0x1,metadata=0x12, priority 100, cookie 0x279651c > > >> > ct(table=15,zone=NXM_NX_REG13[0..15]) > > >> > drop > > >> > -> A clone of the packet is forked to recirculate. The forked > > >> > pipeline will be resumed at table 15. > > >> > -> Sets the packet to an untracked state, and clears all the > > >> > conntrack fields. > > >> > > > >> > Final flow: > > >> > tcp,reg0=0x1,reg11=0x155,reg12=0x1cf,reg13=0x16d,reg14=0xca,metadata=0x12,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=32,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > >> > Megaflow: > > >> > recirc_id=0,eth,tcp,in_port=7753,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_frag=no > > >> > Datapath actions: ct(zone=365),recirc(0x13b5a) > > >> > > > >> > =============================================================================== > > >> > recirc(0x13b5a) - resume conntrack with default ct_state=trk|new (use > > >> > --ct-next to customize) > > >> > =============================================================================== > > >> > > > >> > Flow: > > >> > recirc_id=0x13b5a,ct_state=new|trk,ct_zone=365,eth,tcp,reg0=0x1,reg11=0x155,reg12=0x1cf,reg13=0x16d,reg14=0xca,metadata=0x12,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=32,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > >> > > > >> > bridge("br-int") > > >> > ---------------- > > >> > thaw > > >> > Resuming from table 15 > > >> > 15. ct_state=+new-est+trk,metadata=0x12, priority 7, cookie 0xa9e0ee6f > > >> > > > >> > set_field:0x80000000000000000000000000/0x80000000000000000000000000->xxreg0 > > >> > > > >> > set_field:0x200000000000000000000000000/0x200000000000000000000000000->xxreg0 > > >> > resubmit(,16) > > >> > 16. conj_id=2865573479,tcp,reg0=0x80/0x80,reg14=0xca,metadata=0x12, > > >> > priority 3000, cookie 0xabdec111 > > >> > set_field:0x1000000000000/0x1000000000000->xreg4 > > >> > > > >> > set_field:0x2000000000000000000000000/0x2000000000000000000000000->xxreg0 > > >> > resubmit(,17) > > >> > 17. reg8=0x10000/0x10000,metadata=0x12, priority 1000, cookie > > >> > 0x8171c04a > > >> > set_field:0/0x1000000000000->xreg4 > > >> > set_field:0/0x2000000000000->xreg4 > > >> > set_field:0/0x4000000000000->xreg4 > > >> > resubmit(,18) > > >> > 18. metadata=0x12, priority 0, cookie 0x62454929 > > >> > resubmit(,19) > > >> > 19. metadata=0x12, priority 0, cookie 0x3bb47080 > > >> > resubmit(,20) > > >> > 20. metadata=0x12, priority 0, cookie 0x73face9d > > >> > resubmit(,21) > > >> > 21. metadata=0x12, priority 0, cookie 0x8e46634d > > >> > resubmit(,22) > > >> > 22. metadata=0x12, priority 0, cookie 0xbddac461 > > >> > resubmit(,23) > > >> > 23. metadata=0x12, priority 0, cookie 0x9a32c0de > > >> > resubmit(,24) > > >> > 24. metadata=0x12, priority 0, cookie 0x79b1c074 > > >> > resubmit(,25) > > >> > 25. metadata=0x12, priority 0, cookie 0xc02374d3 > > >> > resubmit(,26) > > >> > 26. metadata=0x12, priority 0, cookie 0x218dc750 > > >> > resubmit(,27) > > >> > 27. metadata=0x12, priority 0, cookie 0xf6943631 > > >> > set_field:0/0x1000000000000->xreg4 > > >> > set_field:0/0x2000000000000->xreg4 > > >> > set_field:0/0x4000000000000->xreg4 > > >> > resubmit(,28) > > >> > 28. ip,reg0=0x2/0x2002,metadata=0x12, priority 100, cookie 0x7fdca4fb > > >> > > > >> > ct(commit,zone=NXM_NX_REG13[0..15],nat(src),exec(set_field:0/0x1->ct_mark)) > > >> > nat(src) > > >> > set_field:0/0x1->ct_mark > > >> > -> Sets the packet to an untracked state, and clears all the > > >> > conntrack fields. > > >> > resubmit(,29) > > >> > 29. metadata=0x12, priority 0, cookie 0x34e5fac7 > > >> > resubmit(,30) > > >> > 30. metadata=0x12, priority 0, cookie 0xac3f53bd > > >> > resubmit(,31) > > >> > 31. metadata=0x12, priority 0, cookie 0x54000bd5 > > >> > resubmit(,32) > > >> > 32. metadata=0x12, priority 0, cookie 0x29ab49f3 > > >> > resubmit(,33) > > >> > 33. metadata=0x12, priority 0, cookie 0xae6aefcb > > >> > resubmit(,34) > > >> > 34. metadata=0x12, priority 0, cookie 0x632dc93b > > >> > resubmit(,35) > > >> > 35. metadata=0x12,dl_dst=1a:16:b1:58:e1:cd, priority 50, cookie > > >> > 0x3fa33935 > > >> > set_field:0x1->reg15 > > >> > resubmit(,37) > > >> > 37. priority 0 > > >> > resubmit(,39) > > >> > 39. priority 0 > > >> > resubmit(,40) > > >> > 40. reg15=0x1,metadata=0x12, priority 100, cookie 0x95c785db > > >> > set_field:0x155->reg11 > > >> > set_field:0x1cf->reg12 > > >> > resubmit(,41) > > >> > 41. priority 0 > > >> > set_field:0->reg0 > > >> > set_field:0->reg1 > > >> > set_field:0->reg2 > > >> > set_field:0->reg3 > > >> > set_field:0->reg4 > > >> > set_field:0->reg5 > > >> > set_field:0->reg6 > > >> > set_field:0->reg7 > > >> > set_field:0->reg8 > > >> > set_field:0->reg9 > > >> > resubmit(,42) > > >> > 42. ip,reg15=0x1,metadata=0x12, priority 110, cookie 0xc9a79824 > > >> > resubmit(,43) > > >> > 43. ip,reg15=0x1,metadata=0x12, priority 110, cookie 0xac7d5d78 > > >> > resubmit(,44) > > >> > 44. metadata=0x12, priority 0, cookie 0xa1ddb4f6 > > >> > resubmit(,45) > > >> > 45. ct_state=-trk,metadata=0x12, priority 5, cookie 0xb2622a65 > > >> > > > >> > set_field:0x100000000000000000000000000/0x100000000000000000000000000->xxreg0 > > >> > > > >> > set_field:0x200000000000000000000000000/0x200000000000000000000000000->xxreg0 > > >> > resubmit(,46) > > >> > 46. metadata=0x12, priority 0, cookie 0x267ae0e3 > > >> > resubmit(,47) > > >> > 47. metadata=0x12, priority 0, cookie 0x914392c3 > > >> > set_field:0/0x1000000000000->xreg4 > > >> > set_field:0/0x2000000000000->xreg4 > > >> > set_field:0/0x4000000000000->xreg4 > > >> > resubmit(,48) > > >> > 48. metadata=0x12, priority 0, cookie 0xc4f6a97f > > >> > resubmit(,49) > > >> > 49. metadata=0x12, priority 0, cookie 0x8ebb0c71 > > >> > resubmit(,50) > > >> > 50. metadata=0x12, priority 0, cookie 0x61e385f4 > > >> > resubmit(,51) > > >> > 51. metadata=0x12, priority 0, cookie 0x4c722ce1 > > >> > set_field:0/0x1000->reg10 > > >> > resubmit(,75) > > >> > 75. No match. > > >> > drop > > >> > move:NXM_NX_REG10[12]->NXM_NX_XXREG0[111] > > >> > -> NXM_NX_XXREG0[111] is now 0 > > >> > resubmit(,52) > > >> > 52. metadata=0x12, priority 0, cookie 0x3c01b89d > > >> > resubmit(,64) > > >> > 64. priority 0 > > >> > resubmit(,65) > > >> > 65. reg15=0x1,metadata=0x12, priority 100, cookie 0x95c785db > > >> > > > >> > clone(ct_clear,set_field:0->reg11,set_field:0->reg12,set_field:0->reg13,set_field:0x4cc->reg11,set_field:0x383->reg12,set_field:0x13->metadata,set_field:0x1->reg14,set_field:0->reg10,set_field:0->reg15,set_field:0->reg0,set_field:0->reg1,set_field:0->reg2,set_field:0->reg3,set_field:0->reg4,set_field:0->reg5,set_field:0->reg6,set_field:0->reg7,set_field:0->reg8,set_field:0->reg9,resubmit(,8)) > > >> > ct_clear > > >> > set_field:0->reg11 > > >> > set_field:0->reg12 > > >> > set_field:0->reg13 > > >> > set_field:0x4cc->reg11 > > >> > set_field:0x383->reg12 > > >> > set_field:0x13->metadata > > >> > set_field:0x1->reg14 > > >> > set_field:0->reg10 > > >> > set_field:0->reg15 > > >> > set_field:0->reg0 > > >> > set_field:0->reg1 > > >> > set_field:0->reg2 > > >> > set_field:0->reg3 > > >> > set_field:0->reg4 > > >> > set_field:0->reg5 > > >> > set_field:0->reg6 > > >> > set_field:0->reg7 > > >> > set_field:0->reg8 > > >> > set_field:0->reg9 > > >> > resubmit(,8) > > >> > 8. reg14=0x1,metadata=0x13,dl_dst=1a:16:b1:58:e1:cd, priority 50, > > >> > cookie 0x8522832c > > >> > > > >> > set_field:0x1a16b158e1cd0000000000000000/0xffffffffffff0000000000000000->xxreg0 > > >> > resubmit(,9) > > >> > 9. metadata=0x13, priority 0, cookie 0xef4dd8a9 > > >> > set_field:0x4/0x4->xreg4 > > >> > resubmit(,10) > > >> > 10. reg9=0x4/0x4,metadata=0x13, priority 100, cookie 0x37718dd7 > > >> > resubmit(,79) > > >> > 79. > > >> > ip,reg14=0x1,metadata=0x13,dl_src=4e:42:14:a1:2a:fb,nw_src=172.27.18.244, > > >> > priority 100, cookie 0x2e544767 > > >> > drop > > >> > resubmit(,11) > > >> > 11. metadata=0x13, priority 0, cookie 0x3c2eb1b > > >> > resubmit(,12) > > >> > 12. metadata=0x13, priority 0, cookie 0x44f62446 > > >> > resubmit(,13) > > >> > 13. metadata=0x13, priority 0, cookie 0x3842bec6 > > >> > resubmit(,14) > > >> > 14. metadata=0x13, priority 0, cookie 0xfd7b2ab9 > > >> > resubmit(,15) > > >> > 15. metadata=0x13, priority 0, cookie 0xefbd7e27 > > >> > resubmit(,16) > > >> > 16. metadata=0x13, priority 0, cookie 0xf439d853 > > >> > resubmit(,17) > > >> > 17. metadata=0x13, priority 0, cookie 0x123f01f0 > > >> > resubmit(,18) > > >> > 18. metadata=0x13, priority 0, cookie 0x142cd59b > > >> > resubmit(,19) > > >> > 19. metadata=0x13, priority 0, cookie 0x297e0190 > > >> > resubmit(,20) > > >> > 20. metadata=0x13, priority 0, cookie 0x61e9e3c7 > > >> > set_field:0/0xffffffff->xxreg1 > > >> > resubmit(,21) > > >> > 21. ip,reg7=0,metadata=0x13, priority 1, cookie 0x9f114f3d > > >> > dec_ttl() > > >> > set_field:0/0xffff00000000->xreg4 > > >> > > > >> > set_field:0x64646401000000000000000000000000/0xffffffff000000000000000000000000->xxreg0 > > >> > > > >> > set_field:0x646464030000000000000000/0xffffffff0000000000000000->xxreg0 > > >> > set_field:2a:d5:d1:e8:89:cc->eth_src > > >> > set_field:0x2->reg15 > > >> > set_field:0x1/0x1->reg10 > > >> > resubmit(,22) > > >> > 22. reg8=0/0xffff,metadata=0x13, priority 150, cookie 0x6ece899a > > >> > resubmit(,23) > > >> > 23. metadata=0x13, priority 0, cookie 0x72437536 > > >> > set_field:0/0xffff00000000->xreg4 > > >> > resubmit(,24) > > >> > 24. reg8=0/0xffff,metadata=0x13, priority 150, cookie 0xce06f1 > > >> > resubmit(,25) > > >> > 25. ip,metadata=0x13, priority 1, cookie 0xf690342c > > >> > push:NXM_NX_REG0[] > > >> > push:NXM_NX_XXREG0[96..127] > > >> > pop:NXM_NX_REG0[] > > >> > -> NXM_NX_REG0[] is now 0x64646401 > > >> > set_field:00:00:00:00:00:00->eth_dst > > >> > resubmit(,66) > > >> > 66. reg0=0x64646401,reg15=0x2,metadata=0x13, priority 100, cookie > > >> > 0x97b4353d > > >> > set_field:00:00:5e:00:01:ff->eth_dst > > >> > set_field:0x40/0x40->reg10 > > >> > pop:NXM_NX_REG0[] > > >> > -> NXM_NX_REG0[] is now 0x64646401 > > >> > resubmit(,26) > > >> > 26. metadata=0x13, priority 0, cookie 0x6c2ad4cc > > >> > resubmit(,27) > > >> > 27. metadata=0x13, priority 0, cookie 0xadc9fb4f > > >> > resubmit(,28) > > >> > 28. ip,reg15=0x2,metadata=0x13,nw_src=172.27.18.244, priority 100, > > >> > cookie 0xfa923492 > > >> > set_field:4e:42:14:a1:2a:fb->eth_src > > >> > > > >> > set_field:0xcc3418740000000000000000/0xffffffff0000000000000000->xxreg0 > > >> > resubmit(,29) > > >> > 29. metadata=0x13, priority 0, cookie 0x571bbf77 > > >> > resubmit(,37) > > >> > 37. priority 0 > > >> > resubmit(,39) > > >> > 39. priority 0 > > >> > resubmit(,40) > > >> > 40. reg15=0x2,metadata=0x13, priority 100, cookie 0x407086d0 > > >> > set_field:0x4cc->reg11 > > >> > set_field:0x383->reg12 > > >> > resubmit(,41) > > >> > 41. priority 0 > > >> > set_field:0->reg0 > > >> > set_field:0->reg1 > > >> > set_field:0->reg2 > > >> > set_field:0->reg3 > > >> > set_field:0->reg4 > > >> > set_field:0->reg5 > > >> > set_field:0->reg6 > > >> > set_field:0->reg7 > > >> > set_field:0->reg8 > > >> > set_field:0->reg9 > > >> > resubmit(,42) > > >> > 42. metadata=0x13, priority 0, cookie 0x4c3b7fcc > > >> > set_field:0/0x10->xreg4 > > >> > resubmit(,43) > > >> > 43. ip,reg15=0x2,metadata=0x13,nw_src=172.27.18.244, priority 100, > > >> > cookie 0x540f90b2 > > >> > set_field:4e:42:14:a1:2a:fb->eth_src > > >> > ct(table=44,zone=NXM_NX_REG11[0..15],nat) > > >> > nat > > >> > -> A clone of the packet is forked to recirculate. The forked > > >> > pipeline will be resumed at table 44. > > >> > -> Sets the packet to an untracked state, and clears all the > > >> > conntrack fields. > > >> > > > >> > Final flow: > > >> > recirc_id=0x13b5a,eth,tcp,reg0=0x300,reg11=0x155,reg12=0x1cf,reg13=0x16d,reg14=0xca,reg15=0x1,metadata=0x12,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=32,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > >> > Megaflow: > > >> > recirc_id=0x13b5a,ct_state=+new-est-rel-rpl-inv+trk,ct_mark=0/0x1,eth,tcp,in_port=7753,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst=104.0.0.0/5,nw_ttl=32,nw_frag=no,tp_src=0x8000/0x8000,tp_dst=0x180/0xffc0 > > >> > Datapath actions: > > >> > ct(commit,zone=365,mark=0/0x1,nat(src)),set(eth(dst=00:00:5e:00:01:ff)),set(ipv4(ttl=31)),ct(zone=1228,nat),recirc(0x13b5c) > > >> > > > >> > =============================================================================== > > >> > recirc(0x13b5c) - resume conntrack with default ct_state=trk|new (use > > >> > --ct-next to customize) > > >> > Replacing src/dst IP/ports to simulate NAT: > > >> > Initial flow: > > >> > Modified flow: > > >> > =============================================================================== > > >> > > > >> > Flow: > > >> > recirc_id=0x13b5c,ct_state=new|trk,ct_zone=1228,eth,tcp,reg10=0x41,reg11=0x4cc,reg12=0x383,reg14=0x1,reg15=0x2,metadata=0x13,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=00:00:5e:00:01:ff,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=31,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > >> > > > >> > bridge("br-int") > > >> > ---------------- > > >> > thaw > > >> > Resuming from table 44 > > >> > 44. metadata=0x13, priority 0, cookie 0x48ef0c42 > > >> > resubmit(,45) > > >> > 45. ct_state=-rpl+trk,ip,reg15=0x2,metadata=0x13,nw_src=172.27.18.244, > > >> > priority 161, cookie 0x5414262a > > >> > set_field:4e:42:14:a1:2a:fb->eth_src > > >> > > > >> > ct(commit,table=46,zone=NXM_NX_REG12[0..15],nat(src=204.52.24.116)) > > >> > nat(src=204.52.24.116) > > >> > -> A clone of the packet is forked to recirculate. The forked > > >> > pipeline will be resumed at table 46. > > >> > -> Sets the packet to an untracked state, and clears all the > > >> > conntrack fields. > > >> > > > >> > Final flow: > > >> > recirc_id=0x13b5c,eth,tcp,reg10=0x41,reg11=0x4cc,reg12=0x383,reg14=0x1,reg15=0x2,metadata=0x13,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=00:00:5e:00:01:ff,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=31,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > >> > Megaflow: > > >> > recirc_id=0x13b5c,ct_state=-rpl+trk,eth,ip,in_port=7753,dl_src=4e:42:14:a1:2a:fb,nw_src=172.27.18.244,nw_frag=no > > >> > Datapath actions: > > >> > ct(commit,zone=899,nat(src=204.52.24.116)),recirc(0x25f9b) > > >> > > > >> > =============================================================================== > > >> > recirc(0x25f9b) - resume conntrack with default ct_state=trk|new (use > > >> > --ct-next to customize) > > >> > Replacing src/dst IP/ports to simulate NAT: > > >> > Initial flow: > > >> > nw_src=172.27.18.244,tp_src=52776,nw_dst=104.18.3.35,tp_dst=443 > > >> > Modified flow: > > >> > nw_src=204.52.24.116,tp_src=52776,nw_dst=104.18.3.35,tp_dst=443 > > >> > =============================================================================== > > >> > > > >> > Flow: > > >> > recirc_id=0x25f9b,ct_state=new|trk,ct_zone=899,eth,tcp,reg10=0x41,reg11=0x4cc,reg12=0x383,reg14=0x1,reg15=0x2,metadata=0x13,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=00:00:5e:00:01:ff,nw_src=204.52.24.116,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=31,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > >> > > > >> > bridge("br-int") > > >> > ---------------- > > >> > thaw > > >> > Resuming from table 46 > > >> > 46. metadata=0x13, priority 0, cookie 0x3800cb88 > > >> > resubmit(,47) > > >> > 47. metadata=0x13, priority 0, cookie 0xc2575be2 > > >> > resubmit(,48) > > >> > 48. reg15=0x2,metadata=0x13, priority 100, cookie 0xaf19a22a > > >> > resubmit(,64) > > >> > 64. reg10=0x1/0x1,reg15=0x2,metadata=0x13, priority 100, cookie > > >> > 0x407086d0 > > >> > push:NXM_OF_IN_PORT[] > > >> > set_field:ANY->in_port > > >> > resubmit(,65) > > >> > 65. reg15=0x2,metadata=0x13, priority 100, cookie 0x407086d0 > > >> > > > >> > clone(ct_clear,set_field:0->reg11,set_field:0->reg12,set_field:0->reg13,set_field:0x2c2->reg11,set_field:0x327->reg12,set_field:0x1->metadata,set_field:0xa->reg14,set_field:0->reg10,set_field:0->reg15,set_field:0->reg0,set_field:0->reg1,set_field:0->reg2,set_field:0->reg3,set_field:0->reg4,set_field:0->reg5,set_field:0->reg6,set_field:0->reg7,set_field:0->reg8,set_field:0->reg9,resubmit(,8)) > > >> > ct_clear > > >> > set_field:0->reg11 > > >> > set_field:0->reg12 > > >> > set_field:0->reg13 > > >> > set_field:0x2c2->reg11 > > >> > set_field:0x327->reg12 > > >> > set_field:0x1->metadata > > >> > set_field:0xa->reg14 > > >> > set_field:0->reg10 > > >> > set_field:0->reg15 > > >> > set_field:0->reg0 > > >> > set_field:0->reg1 > > >> > set_field:0->reg2 > > >> > set_field:0->reg3 > > >> > set_field:0->reg4 > > >> > set_field:0->reg5 > > >> > set_field:0->reg6 > > >> > set_field:0->reg7 > > >> > set_field:0->reg8 > > >> > set_field:0->reg9 > > >> > resubmit(,8) > > >> > 8. metadata=0x1, priority 50, cookie 0x1645d3f2 > > >> > set_field:0/0x1000->reg10 > > >> > resubmit(,73) > > >> > 73. No match. > > >> > drop > > >> > move:NXM_NX_REG10[12]->NXM_NX_XXREG0[111] > > >> > -> NXM_NX_XXREG0[111] is now 0 > > >> > resubmit(,9) > > >> > 9. metadata=0x1, priority 0, cookie 0xcc4fd106 > > >> > resubmit(,10) > > >> > 10. metadata=0x1, priority 0, cookie 0x9e10ad0e > > >> > resubmit(,11) > > >> > 11. metadata=0x1, priority 0, cookie 0x557f3249 > > >> > resubmit(,12) > > >> > 12. metadata=0x1, priority 0, cookie 0x915c56b1 > > >> > resubmit(,13) > > >> > 13. ip,reg14=0xa,metadata=0x1, priority 110, cookie 0xccdcd3e1 > > >> > resubmit(,14) > > >> > 14. metadata=0x1, priority 0, cookie 0x134ce32f > > >> > resubmit(,15) > > >> > 15. metadata=0x1, priority 65535, cookie 0x49627e5f > > >> > resubmit(,16) > > >> > 16. metadata=0x1, priority 65535, cookie 0xc947843d > > >> > resubmit(,17) > > >> > 17. metadata=0x1, priority 0, cookie 0xd2fb4d2 > > >> > resubmit(,18) > > >> > 18. metadata=0x1, priority 0, cookie 0x62454929 > > >> > resubmit(,19) > > >> > 19. metadata=0x1, priority 0, cookie 0x3bb47080 > > >> > resubmit(,20) > > >> > 20. metadata=0x1, priority 0, cookie 0x73face9d > > >> > resubmit(,21) > > >> > 21. metadata=0x1, priority 0, cookie 0x8e46634d > > >> > resubmit(,22) > > >> > 22. metadata=0x1, priority 0, cookie 0xbddac461 > > >> > resubmit(,23) > > >> > 23. metadata=0x1, priority 0, cookie 0x9a32c0de > > >> > resubmit(,24) > > >> > 24. metadata=0x1, priority 0, cookie 0x79b1c074 > > >> > resubmit(,25) > > >> > 25. metadata=0x1, priority 0, cookie 0xc02374d3 > > >> > resubmit(,26) > > >> > 26. metadata=0x1, priority 0, cookie 0x218dc750 > > >> > resubmit(,27) > > >> > 27. metadata=0x1, priority 0, cookie 0x8db4eebc > > >> > resubmit(,28) > > >> > 28. metadata=0x1, priority 0, cookie 0x6deecbbe > > >> > resubmit(,29) > > >> > 29. metadata=0x1, priority 0, cookie 0x34e5fac7 > > >> > resubmit(,30) > > >> > 30. metadata=0x1, priority 0, cookie 0xac3f53bd > > >> > resubmit(,31) > > >> > 31. metadata=0x1, priority 0, cookie 0x54000bd5 > > >> > resubmit(,32) > > >> > 32. metadata=0x1, priority 0, cookie 0x29ab49f3 > > >> > resubmit(,33) > > >> > 33. metadata=0x1, priority 0, cookie 0xae6aefcb > > >> > resubmit(,34) > > >> > 34. metadata=0x1, priority 0, cookie 0x632dc93b > > >> > resubmit(,35) > > >> > 35. metadata=0x1, priority 0, cookie 0x9961115c > > >> > set_field:0->reg15 > > >> > resubmit(,71) > > >> > 71. No match. > > >> > drop > > >> > resubmit(,36) > > >> > 36. reg15=0,metadata=0x1, priority 50, cookie 0x62255993 > > >> > set_field:0x8001->reg15 > > >> > resubmit(,37) > > >> > 37. priority 0 > > >> > resubmit(,39) > > >> > 39. priority 0 > > >> > resubmit(,40) > > >> > 40. reg15=0x8001,metadata=0x1, priority 100, cookie 0x11699429 > > >> > set_field:0x14->reg13 > > >> > set_field:0x1->reg15 > > >> > resubmit(,41) > > >> > 41. priority 0 > > >> > set_field:0->reg0 > > >> > set_field:0->reg1 > > >> > set_field:0->reg2 > > >> > set_field:0->reg3 > > >> > set_field:0->reg4 > > >> > set_field:0->reg5 > > >> > set_field:0->reg6 > > >> > set_field:0->reg7 > > >> > set_field:0->reg8 > > >> > set_field:0->reg9 > > >> > resubmit(,42) > > >> > 42. metadata=0x1, priority 0, cookie 0x9ad1480d > > >> > resubmit(,43) > > >> > 43. ip,reg15=0x1,metadata=0x1, priority 110, cookie > > >> > 0xca4977d3 > > >> > ct_clear > > >> > resubmit(,44) > > >> > 44. metadata=0x1, priority 0, cookie 0xa1ddb4f6 > > >> > resubmit(,45) > > >> > 45. metadata=0x1, priority 65535, cookie 0x91b4350e > > >> > resubmit(,46) > > >> > 46. metadata=0x1, priority 65535, cookie 0xfc9c351d > > >> > resubmit(,47) > > >> > 47. metadata=0x1, priority 0, cookie 0xb984696e > > >> > resubmit(,48) > > >> > 48. metadata=0x1, priority 0, cookie 0xc4f6a97f > > >> > resubmit(,49) > > >> > 49. metadata=0x1, priority 0, cookie 0x8ebb0c71 > > >> > resubmit(,50) > > >> > 50. metadata=0x1, priority 0, cookie 0x61e385f4 > > >> > resubmit(,51) > > >> > 51. metadata=0x1, priority 0, cookie 0x4c722ce1 > > >> > set_field:0/0x1000->reg10 > > >> > resubmit(,75) > > >> > 75. No match. > > >> > drop > > >> > move:NXM_NX_REG10[12]->NXM_NX_XXREG0[111] > > >> > -> NXM_NX_XXREG0[111] is now 0 > > >> > resubmit(,52) > > >> > 52. metadata=0x1, priority 0, cookie 0x3c01b89d > > >> > resubmit(,64) > > >> > 64. priority 0 > > >> > resubmit(,65) > > >> > 65. reg15=0x1,metadata=0x1, priority 100, cookie > > >> > 0x4581da82 > > >> > push_vlan:0x8100 > > >> > set_field:4216->vlan_vid > > >> > output:7754 > > >> > > > >> > bridge("br-provider") > > >> > --------------------- > > >> > 0. priority 0 > > >> > NORMAL > > >> > -> forwarding to learned port > > >> > pop_vlan > > >> > set_field:0x8001->reg15 > > >> > pop:NXM_OF_IN_PORT[] > > >> > -> NXM_OF_IN_PORT[] is now 7753 > > >> > > > >> > Final flow: unchanged > > >> > Megaflow: > > >> > recirc_id=0x25f9b,ct_state=+new-est-rel-rpl-inv+trk,ct_mark=0/0x1,eth,ip,in_port=7753,dl_src=4e:42:14:a1:2a:fb,dl_dst=00:00:5e:00:01:ff,nw_dst=0.0.0.0/1,nw_frag=no > > >> > Datapath actions: ct_clear,push_vlan(vid=120,pcp=0),5 > > >> > > > >> > On Wed, 17 Apr 2024 at 17:32, Gavin McKee <gavmcke...@googlemail.com> > > >> > wrote: > > >> >> > > >> >> That information is all in the email > > >> >> > > >> >> The openflow trace is showing that the pipeline is fine . This is why > > >> >> I’m worried about a deeper issue with the kernal / openvswitch kernal > > >> >> module / connection tracking > > >> >> > > >> >> > > >> >> > > >> >> On Wed, Apr 17, 2024 at 16:33 Flavio Leitner <f...@sysclose.org> > > >> >> wrote: > > >> >>> > > >> >>> On Wed, 17 Apr 2024 12:26:27 -0700 > > >> >>> Gavin McKee <gavmcke...@googlemail.com> wrote: > > >> >>> > > >> >>>> Hi Flavio, > > >> >>>> > > >> >>>> I had to restart the Open vSwitch across 16 machines to resolve the > > >> >>>> issue for a customer . I think it will occur again and when it does > > >> >>>> I'll use that command to gather the tc information. > > >> >>>> > > >> >>>> Until then I think I have found why the issue is occurring . > > >> >>>> > > >> >>>> Take a look at the output below (this is a packet capture from the > > >> >>>> physical interface on the compute node , so traffic that has gone > > >> >>>> through the OVS Openflow pipeline) - we make a 3 way handshake with > > >> >>>> R2 > > >> >>>> , and establish the connection. A packet goes missing - TLS > > >> >>>> handshake, it then appears that it hasn't gone through NAT as it's > > >> >>>> using the Private IP of the VM . > > >> >>> > > >> >>> > > >> >>> If that is the case, you might be able to see if the data path > > >> >>> is matching correctly and the actions are using NAT with > > >> >>> ovs-appctl ofproto/trace. > > >> >>> > > >> >>> fbl > > >> >>> > > >> >>> > > >> >>>> > > >> >>>> Take a look at frame 14 > > >> >>>> > > >> >>>> No. Time Source Destination > > >> >>>> Protocol Length Info > > >> >>>> Delta > > >> >>>> 14 09:24:08.064432 172.27.18.244 104.18.2.35 > > >> >>>> TLSv1 502 Client Hello, Alert (Level: Fatal, Description: > > >> >>>> Decode > > >> >>>> Error) 2.362983 > > >> >>>> > > >> >>>> Frame 14: 502 bytes on wire (4016 bits), 502 bytes captured (4016 > > >> >>>> bits) Ethernet II, Src: 4e:42:14:a1:2a:fb (4e:42:14:a1:2a:fb), Dst: > > >> >>>> IETF-VRRP-VRID_ff (00:00:5e:00:01:ff) > > >> >>>> 802.1Q Virtual LAN, PRI: 0, DEI: 0, ID: 120 > > >> >>>> Internet Protocol Version 4, Src: 172.27.18.244, Dst: 104.18.2.35 > > >> >>>> Transmission Control Protocol, Src Port: 57394, Dst Port: 443, Seq: > > >> >>>> 1, > > >> >>>> Ack: 1, Len: 444 > > >> >>>> Transport Layer Security > > >> >>>> TLSv1 Record Layer: Handshake Protocol: Client Hello > > >> >>>> Content Type: Handshake (22) > > >> >>>> Version: TLS 1.0 (0x0301) > > >> >>>> Length: 432 > > >> >>>> Handshake Protocol: Client Hello > > >> >>>> TLSv1 Record Layer: Alert (Level: Fatal, Description: Decode > > >> >>>> Error) Content Type: Alert (21) > > >> >>>> Version: TLS 1.0 (0x0301) > > >> >>>> Length: 2 > > >> >>>> Alert Message > > >> >>>> Level: Fatal (2) > > >> >>>> Description: Decode Error (50) > > >> >>>> > > >> >>>> > > >> >>>> > > >> >>>> ------------------------------------------------------------------------------------------------------------ > > >> >>>> > > >> >>>> On Wed, 17 Apr 2024 at 11:28, Flavio Leitner <f...@sysclose.org> > > >> >>>> wrote: > > >> >>>>> > > >> >>>>> > > >> >>>>> Hi Gavin, > > >> >>>>> > > >> >>>>> It would be helpful if you can provide some TC dumps from the > > >> >>>>> "good" state to the "bad" state to see how it was and what changes. > > >> >>>>> Something like: > > >> >>>>> > > >> >>>>> # tc -s filter show dev enp148s0f0_1 ingress > > >> >>>>> > > >> >>>>> I haven't checked the attached files, but one suggestion is to > > >> >>>>> check if this is not a csum issue. > > >> >>>>> > > >> >>>>> Thanks, > > >> >>>>> fbl > > >> >>>>> > > >> >>>>> > > >> >>>>> On Tue, 16 Apr 2024 13:17:10 -0700 > > >> >>>>> Gavin McKee via discuss <ovs-discuss@openvswitch.org> wrote: > > >> >>>>> > > >> >>>>>> Adding information relating to the Open VSwitch kernal module > > >> >>>>>> @Ilya Maximets @Numan Siddique Can either of you help out here? > > >> >>>>>> > > >> >>>>>> > > >> >>>>>> modinfo openvswitch > > >> >>>>>> filename: > > >> >>>>>> /lib/modules/5.14.0-362.8.1.el9_3.x86_64/kernel/net/openvswitch/openvswitch.ko.xz > > >> >>>>>> alias: net-pf-16-proto-16-family-ovs_ct_limit > > >> >>>>>> alias: net-pf-16-proto-16-family-ovs_meter > > >> >>>>>> alias: net-pf-16-proto-16-family-ovs_packet > > >> >>>>>> alias: net-pf-16-proto-16-family-ovs_flow > > >> >>>>>> alias: net-pf-16-proto-16-family-ovs_vport > > >> >>>>>> alias: net-pf-16-proto-16-family-ovs_datapath > > >> >>>>>> license: GPL > > >> >>>>>> description: Open vSwitch switching datapath > > >> >>>>>> rhelversion: 9.3 > > >> >>>>>> srcversion: 8A2159D727C8BADC82261B8 > > >> >>>>>> depends: nf_conntrack,nf_conncount,libcrc32c,nf_nat > > >> >>>>>> retpoline: Y > > >> >>>>>> intree: Y > > >> >>>>>> name: openvswitch > > >> >>>>>> vermagic: 5.14.0-362.8.1.el9_3.x86_64 SMP preempt mod_unload > > >> >>>>>> modversions sig_id: PKCS#7 > > >> >>>>>> signer: Rocky kernel signing key > > >> >>>>>> sig_key: > > >> >>>>>> 17:CA:DE:1F:EC:D1:59:2D:9F:52:34:C6:7C:09:06:81:3D:74:7C:F7 > > >> >>>>>> sig_hashalgo: sha256 signature: > > >> >>>>>> 67:31:56:70:86:DB:57:69:8D:4A:9B:A7:ED:17:F3:67:65:98:97:08: > > >> >>>>>> 1F:FB:4D:F8:A8:2D:7C:A7:7D:3A:57:85:CA:67:9D:82:72:EB:54:14: > > >> >>>>>> F2:BB:40:78:AD:85:56:2D:EF:D5:00:95:38:A4:86:9F:5F:29:1A:81: > > >> >>>>>> 32:94:B4:87:41:94:A0:3E:71:A5:97:44:2E:42:DD:F7:42:6B:69:94: > > >> >>>>>> E3:AB:6E:E5:4F:C9:60:57:70:07:5F:CA:C7:83:7A:2F:C7:81:62:FF: > > >> >>>>>> 53:AF:AC:2B:06:D8:08:D3:1D:A7:F0:43:10:98:DE:B1:62:AE:89:A5: > > >> >>>>>> FE:EF:74:09:0F:2D:0F:D9:73:A5:59:75:D0:87:1E:EA:3A:40:86:1E: > > >> >>>>>> 76:E5:E7:3B:59:2E:3A:7E:65:F3:92:A1:B4:84:48:3F:43:A0:D7:1C: > > >> >>>>>> 21:29:E0:B6:D1:10:36:15:88:43:6A:11:8F:55:EE:1B:F9:53:3B:86: > > >> >>>>>> EF:81:71:17:81:08:EC:53:30:D6:69:8E:13:11:D5:DF:15:75:88:50: > > >> >>>>>> 69:19:51:3B:41:6B:6F:E0:7A:30:33:32:E6:60:18:02:A6:0C:63:9B: > > >> >>>>>> C5:D7:2F:6A:D0:BA:45:03:19:0E:21:E8:18:FB:E8:D1:C1:33:05:36: > > >> >>>>>> 1F:9B:0F:29:3F:05:51:7A:30:86:88:B7:C7:44:2E:2B:50:F9:EF:4F: > > >> >>>>>> D4:70:EA:1B:33:E2:F0:E3:E2:88:00:E5:BF:06:E2:D4:B7:81:EE:6E: > > >> >>>>>> 89:02:18:65:8B:1C:84:42:2F:89:14:63:1D:51:70:37:42:C5:68:DD: > > >> >>>>>> 4D:12:7B:07:33:2B:C6:BC:8F:7F:23:D7:58:DF:47:AC:DE:08:67:FE: > > >> >>>>>> CB:E8:E6:4D:95:2F:6B:F5:07:4D:32:92:80:0A:7C:D1:B6:81:EE:AB: > > >> >>>>>> 26:C3:C6:22:77:00:5E:64:DE:96:0E:9F:A4:A0:F0:45:9F:19:73:EB: > > >> >>>>>> CC:60:AE:E9:63:E2:6D:2E:BA:65:9B:BD:04:CC:13:C2:55:88:05:03: > > >> >>>>>> 1B:30:18:8B > > >> >>>>>> > > >> >>>>>> On Tue, 16 Apr 2024 at 11:12, Gavin McKee > > >> >>>>>> <gavmcke...@googlemail.com> wrote: > > >> >>>>>>> > > >> >>>>>>> Hi, > > >> >>>>>>> > > >> >>>>>>> I need some help with strange OVS behaviours. > > >> >>>>>>> > > >> >>>>>>> ovs-vsctl (Open vSwitch) 3.2.2 > > >> >>>>>>> ovn-controller 23.09.1 > > >> >>>>>>> Open vSwitch Library 3.2.2 > > >> >>>>>>> > > >> >>>>>>> TLDR: We need to restart Open VSwitch in order for TLS traffic > > >> >>>>>>> to work between a VM and Cloudflare R2. After restarting Open > > >> >>>>>>> VSwitch the TLS connection works fine. > > >> >>>>>>> (see attached pcap tls-error.txt) > > >> >>>>>>> > > >> >>>>>>> See the attached openflow traces - they show a flow trace from > > >> >>>>>>> Open Vswitch. > > >> >>>>>>> > > >> >>>>>>> Also there is a retis trace (retis tool discussed at Open > > >> >>>>>>> VSwitch conference 2023). > > >> >>>>>>> > > >> >>>>>>> Note the drop (TC_INGRESS) in this file > > >> >>>>>>> + 1702601116185568 [swapper/140] 0 [tp] skb:kfree_skb > > >> >>>>>>> #60c81b6b91e2cff284fb3a3d65800 (skb 18386033671255367680) n 3 > > >> >>>>>>> drop (TC_INGRESS) > > >> >>>>>>> if 21 (enp148s0f0_1) rxif 21 172.27.18.244.57394 > > > >> >>>>>>> 104.18.2.35.443 ttl 63 tos 0x0 id 26162 off 0 [DF] len 477 proto > > >> >>>>>>> TCP (6) flags [P.] seq 792060930:792061367 ack 951229219 win 11 > > >> >>>>>>> > > >> >>>>>>> Again , once I restart Open vSwitch the problem goes away for a > > >> >>>>>>> time and comes back sometime later (not sure what that time > > >> >>>>>>> frame is but its a recurring issue.) > > >> >>>>>> _______________________________________________ > > >> >>>>>> discuss mailing list > > >> >>>>>> disc...@openvswitch.org > > >> >>>>>> https://mail.openvswitch.org/mailman/listinfo/ovs-discuss > > >> >>>>> > > >> >>>>> > > >> >>>>> > > >> >>>>> -- > > >> >>>>> fbl > > >> >>> > > >> >>> > > >> >>> > > >> >>> -- > > >> >>> fbl > > >> > _______________________________________________ > > >> > discuss mailing list > > >> > disc...@openvswitch.org > > >> > https://mail.openvswitch.org/mailman/listinfo/ovs-discuss > > >> _______________________________________________ discuss mailing list disc...@openvswitch.org https://mail.openvswitch.org/mailman/listinfo/ovs-discuss