I thought ct was in that retis trace . I’ll need to capture these events when they occur again .
I’m also having some issues with huge amounts of loss in the kernal pipeline that I can’t explain . When sending traffic VM to VM within a logical switch but going across GENEVE tunnel I get 30 - 40 Gbps on iperf . When I use the DAT (by sending traffic to the VM DNAT ) I get huge amounts of loss in the sender kernal , retis is saying sk_bff drops . 3.2.2 is absolutely killing me right now. Should I open another thread on this ? On Thu, Apr 18, 2024 at 00:15 Adrian Moreno <amore...@redhat.com> wrote: > Hi Gavin > > On 4/18/24 02:38, Gavin McKee via discuss wrote: > > This is an example. > > > > Again the TCP 3 handshake completes , but the next packet fails to NAT > > and goes out onto the physical network using the private address . An > > example of this is in the packet trace I provided. > > > > Given you were using retis in your initial troubleshooting, you can use it > with > the additional "ct" collector to see if the kernel datapath is retrieving > the > right conntrack entry and what's its state. If that shows some unexpected > conntrack entry change, it can be confirmed by monitoring "conntrack -E". > > Additionally, a dp flow dump (ovs-appctl dpctl/dump-flows -m) when the > problem > is happening might be also useful. > > -- > Adrián > > > > ovs-appctl ofproto/trace br-int > > > in_port=7753,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,tcp,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_ttl=32,tcp_src=52776,tcp_dst=443,tcp_flags=2 > > Flow: > tcp,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=32,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > > > bridge("br-int") > > ---------------- > > 0. in_port=7753, priority 100, cookie 0xc33f39c4 > > set_field:0x16d->reg13 > > set_field:0x155->reg11 > > set_field:0x1cf->reg12 > > set_field:0x12->metadata > > set_field:0xca->reg14 > > resubmit(,8) > > 8. metadata=0x12, priority 50, cookie 0x1645d3f2 > > set_field:0/0x1000->reg10 > > resubmit(,73) > > 73. > ip,reg14=0xca,metadata=0x12,dl_src=4e:42:14:a1:2a:fb,nw_src=172.27.18.244, > > priority 90, cookie 0xc33f39c4 > > set_field:0/0x1000->reg10 > > move:NXM_NX_REG10[12]->NXM_NX_XXREG0[111] > > -> NXM_NX_XXREG0[111] is now 0 > > resubmit(,9) > > 9. metadata=0x12, priority 0, cookie 0xcc4fd106 > > resubmit(,10) > > 10. metadata=0x12, priority 0, cookie 0x9e10ad0e > > resubmit(,11) > > 11. metadata=0x12, priority 0, cookie 0x557f3249 > > resubmit(,12) > > 12. ip,metadata=0x12, priority 100, cookie 0x14131a67 > > > set_field:0x1000000000000000000000000/0x1000000000000000000000000->xxreg0 > > resubmit(,13) > > 13. metadata=0x12, priority 0, cookie 0x85f9ed4f > > resubmit(,14) > > 14. ip,reg0=0x1/0x1,metadata=0x12, priority 100, cookie 0x279651c > > ct(table=15,zone=NXM_NX_REG13[0..15]) > > drop > > -> A clone of the packet is forked to recirculate. The forked > > pipeline will be resumed at table 15. > > -> Sets the packet to an untracked state, and clears all the > > conntrack fields. > > > > Final flow: > tcp,reg0=0x1,reg11=0x155,reg12=0x1cf,reg13=0x16d,reg14=0xca,metadata=0x12,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=32,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > Megaflow: > recirc_id=0,eth,tcp,in_port=7753,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_frag=no > > Datapath actions: ct(zone=365),recirc(0x13b5a) > > > > > =============================================================================== > > recirc(0x13b5a) - resume conntrack with default ct_state=trk|new (use > > --ct-next to customize) > > > =============================================================================== > > > > Flow: > recirc_id=0x13b5a,ct_state=new|trk,ct_zone=365,eth,tcp,reg0=0x1,reg11=0x155,reg12=0x1cf,reg13=0x16d,reg14=0xca,metadata=0x12,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=32,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > > > bridge("br-int") > > ---------------- > > thaw > > Resuming from table 15 > > 15. ct_state=+new-est+trk,metadata=0x12, priority 7, cookie 0xa9e0ee6f > > > set_field:0x80000000000000000000000000/0x80000000000000000000000000->xxreg0 > > > set_field:0x200000000000000000000000000/0x200000000000000000000000000->xxreg0 > > resubmit(,16) > > 16. conj_id=2865573479,tcp,reg0=0x80/0x80,reg14=0xca,metadata=0x12, > > priority 3000, cookie 0xabdec111 > > set_field:0x1000000000000/0x1000000000000->xreg4 > > > set_field:0x2000000000000000000000000/0x2000000000000000000000000->xxreg0 > > resubmit(,17) > > 17. reg8=0x10000/0x10000,metadata=0x12, priority 1000, cookie 0x8171c04a > > set_field:0/0x1000000000000->xreg4 > > set_field:0/0x2000000000000->xreg4 > > set_field:0/0x4000000000000->xreg4 > > resubmit(,18) > > 18. metadata=0x12, priority 0, cookie 0x62454929 > > resubmit(,19) > > 19. metadata=0x12, priority 0, cookie 0x3bb47080 > > resubmit(,20) > > 20. metadata=0x12, priority 0, cookie 0x73face9d > > resubmit(,21) > > 21. metadata=0x12, priority 0, cookie 0x8e46634d > > resubmit(,22) > > 22. metadata=0x12, priority 0, cookie 0xbddac461 > > resubmit(,23) > > 23. metadata=0x12, priority 0, cookie 0x9a32c0de > > resubmit(,24) > > 24. metadata=0x12, priority 0, cookie 0x79b1c074 > > resubmit(,25) > > 25. metadata=0x12, priority 0, cookie 0xc02374d3 > > resubmit(,26) > > 26. metadata=0x12, priority 0, cookie 0x218dc750 > > resubmit(,27) > > 27. metadata=0x12, priority 0, cookie 0xf6943631 > > set_field:0/0x1000000000000->xreg4 > > set_field:0/0x2000000000000->xreg4 > > set_field:0/0x4000000000000->xreg4 > > resubmit(,28) > > 28. ip,reg0=0x2/0x2002,metadata=0x12, priority 100, cookie 0x7fdca4fb > > > ct(commit,zone=NXM_NX_REG13[0..15],nat(src),exec(set_field:0/0x1->ct_mark)) > > nat(src) > > set_field:0/0x1->ct_mark > > -> Sets the packet to an untracked state, and clears all the > > conntrack fields. > > resubmit(,29) > > 29. metadata=0x12, priority 0, cookie 0x34e5fac7 > > resubmit(,30) > > 30. metadata=0x12, priority 0, cookie 0xac3f53bd > > resubmit(,31) > > 31. metadata=0x12, priority 0, cookie 0x54000bd5 > > resubmit(,32) > > 32. metadata=0x12, priority 0, cookie 0x29ab49f3 > > resubmit(,33) > > 33. metadata=0x12, priority 0, cookie 0xae6aefcb > > resubmit(,34) > > 34. metadata=0x12, priority 0, cookie 0x632dc93b > > resubmit(,35) > > 35. metadata=0x12,dl_dst=1a:16:b1:58:e1:cd, priority 50, cookie > 0x3fa33935 > > set_field:0x1->reg15 > > resubmit(,37) > > 37. priority 0 > > resubmit(,39) > > 39. priority 0 > > resubmit(,40) > > 40. reg15=0x1,metadata=0x12, priority 100, cookie 0x95c785db > > set_field:0x155->reg11 > > set_field:0x1cf->reg12 > > resubmit(,41) > > 41. priority 0 > > set_field:0->reg0 > > set_field:0->reg1 > > set_field:0->reg2 > > set_field:0->reg3 > > set_field:0->reg4 > > set_field:0->reg5 > > set_field:0->reg6 > > set_field:0->reg7 > > set_field:0->reg8 > > set_field:0->reg9 > > resubmit(,42) > > 42. ip,reg15=0x1,metadata=0x12, priority 110, cookie 0xc9a79824 > > resubmit(,43) > > 43. ip,reg15=0x1,metadata=0x12, priority 110, cookie 0xac7d5d78 > > resubmit(,44) > > 44. metadata=0x12, priority 0, cookie 0xa1ddb4f6 > > resubmit(,45) > > 45. ct_state=-trk,metadata=0x12, priority 5, cookie 0xb2622a65 > > > set_field:0x100000000000000000000000000/0x100000000000000000000000000->xxreg0 > > > set_field:0x200000000000000000000000000/0x200000000000000000000000000->xxreg0 > > resubmit(,46) > > 46. metadata=0x12, priority 0, cookie 0x267ae0e3 > > resubmit(,47) > > 47. metadata=0x12, priority 0, cookie 0x914392c3 > > set_field:0/0x1000000000000->xreg4 > > set_field:0/0x2000000000000->xreg4 > > set_field:0/0x4000000000000->xreg4 > > resubmit(,48) > > 48. metadata=0x12, priority 0, cookie 0xc4f6a97f > > resubmit(,49) > > 49. metadata=0x12, priority 0, cookie 0x8ebb0c71 > > resubmit(,50) > > 50. metadata=0x12, priority 0, cookie 0x61e385f4 > > resubmit(,51) > > 51. metadata=0x12, priority 0, cookie 0x4c722ce1 > > set_field:0/0x1000->reg10 > > resubmit(,75) > > 75. No match. > > drop > > move:NXM_NX_REG10[12]->NXM_NX_XXREG0[111] > > -> NXM_NX_XXREG0[111] is now 0 > > resubmit(,52) > > 52. metadata=0x12, priority 0, cookie 0x3c01b89d > > resubmit(,64) > > 64. priority 0 > > resubmit(,65) > > 65. reg15=0x1,metadata=0x12, priority 100, cookie 0x95c785db > > > clone(ct_clear,set_field:0->reg11,set_field:0->reg12,set_field:0->reg13,set_field:0x4cc->reg11,set_field:0x383->reg12,set_field:0x13->metadata,set_field:0x1->reg14,set_field:0->reg10,set_field:0->reg15,set_field:0->reg0,set_field:0->reg1,set_field:0->reg2,set_field:0->reg3,set_field:0->reg4,set_field:0->reg5,set_field:0->reg6,set_field:0->reg7,set_field:0->reg8,set_field:0->reg9,resubmit(,8)) > > ct_clear > > set_field:0->reg11 > > set_field:0->reg12 > > set_field:0->reg13 > > set_field:0x4cc->reg11 > > set_field:0x383->reg12 > > set_field:0x13->metadata > > set_field:0x1->reg14 > > set_field:0->reg10 > > set_field:0->reg15 > > set_field:0->reg0 > > set_field:0->reg1 > > set_field:0->reg2 > > set_field:0->reg3 > > set_field:0->reg4 > > set_field:0->reg5 > > set_field:0->reg6 > > set_field:0->reg7 > > set_field:0->reg8 > > set_field:0->reg9 > > resubmit(,8) > > 8. reg14=0x1,metadata=0x13,dl_dst=1a:16:b1:58:e1:cd, priority 50, > > cookie 0x8522832c > > > set_field:0x1a16b158e1cd0000000000000000/0xffffffffffff0000000000000000->xxreg0 > > resubmit(,9) > > 9. metadata=0x13, priority 0, cookie 0xef4dd8a9 > > set_field:0x4/0x4->xreg4 > > resubmit(,10) > > 10. reg9=0x4/0x4,metadata=0x13, priority 100, cookie 0x37718dd7 > > resubmit(,79) > > 79. > ip,reg14=0x1,metadata=0x13,dl_src=4e:42:14:a1:2a:fb,nw_src=172.27.18.244, > > priority 100, cookie 0x2e544767 > > drop > > resubmit(,11) > > 11. metadata=0x13, priority 0, cookie 0x3c2eb1b > > resubmit(,12) > > 12. metadata=0x13, priority 0, cookie 0x44f62446 > > resubmit(,13) > > 13. metadata=0x13, priority 0, cookie 0x3842bec6 > > resubmit(,14) > > 14. metadata=0x13, priority 0, cookie 0xfd7b2ab9 > > resubmit(,15) > > 15. metadata=0x13, priority 0, cookie 0xefbd7e27 > > resubmit(,16) > > 16. metadata=0x13, priority 0, cookie 0xf439d853 > > resubmit(,17) > > 17. metadata=0x13, priority 0, cookie 0x123f01f0 > > resubmit(,18) > > 18. metadata=0x13, priority 0, cookie 0x142cd59b > > resubmit(,19) > > 19. metadata=0x13, priority 0, cookie 0x297e0190 > > resubmit(,20) > > 20. metadata=0x13, priority 0, cookie 0x61e9e3c7 > > set_field:0/0xffffffff->xxreg1 > > resubmit(,21) > > 21. ip,reg7=0,metadata=0x13, priority 1, cookie 0x9f114f3d > > dec_ttl() > > set_field:0/0xffff00000000->xreg4 > > > set_field:0x64646401000000000000000000000000/0xffffffff000000000000000000000000->xxreg0 > > > set_field:0x646464030000000000000000/0xffffffff0000000000000000->xxreg0 > > set_field:2a:d5:d1:e8:89:cc->eth_src > > set_field:0x2->reg15 > > set_field:0x1/0x1->reg10 > > resubmit(,22) > > 22. reg8=0/0xffff,metadata=0x13, priority 150, cookie 0x6ece899a > > resubmit(,23) > > 23. metadata=0x13, priority 0, cookie 0x72437536 > > set_field:0/0xffff00000000->xreg4 > > resubmit(,24) > > 24. reg8=0/0xffff,metadata=0x13, priority 150, cookie 0xce06f1 > > resubmit(,25) > > 25. ip,metadata=0x13, priority 1, cookie 0xf690342c > > push:NXM_NX_REG0[] > > push:NXM_NX_XXREG0[96..127] > > pop:NXM_NX_REG0[] > > -> NXM_NX_REG0[] is now 0x64646401 > > set_field:00:00:00:00:00:00->eth_dst > > resubmit(,66) > > 66. reg0=0x64646401,reg15=0x2,metadata=0x13, priority 100, cookie > 0x97b4353d > > set_field:00:00:5e:00:01:ff->eth_dst > > set_field:0x40/0x40->reg10 > > pop:NXM_NX_REG0[] > > -> NXM_NX_REG0[] is now 0x64646401 > > resubmit(,26) > > 26. metadata=0x13, priority 0, cookie 0x6c2ad4cc > > resubmit(,27) > > 27. metadata=0x13, priority 0, cookie 0xadc9fb4f > > resubmit(,28) > > 28. ip,reg15=0x2,metadata=0x13,nw_src=172.27.18.244, priority 100, > > cookie 0xfa923492 > > set_field:4e:42:14:a1:2a:fb->eth_src > > > set_field:0xcc3418740000000000000000/0xffffffff0000000000000000->xxreg0 > > resubmit(,29) > > 29. metadata=0x13, priority 0, cookie 0x571bbf77 > > resubmit(,37) > > 37. priority 0 > > resubmit(,39) > > 39. priority 0 > > resubmit(,40) > > 40. reg15=0x2,metadata=0x13, priority 100, cookie 0x407086d0 > > set_field:0x4cc->reg11 > > set_field:0x383->reg12 > > resubmit(,41) > > 41. priority 0 > > set_field:0->reg0 > > set_field:0->reg1 > > set_field:0->reg2 > > set_field:0->reg3 > > set_field:0->reg4 > > set_field:0->reg5 > > set_field:0->reg6 > > set_field:0->reg7 > > set_field:0->reg8 > > set_field:0->reg9 > > resubmit(,42) > > 42. metadata=0x13, priority 0, cookie 0x4c3b7fcc > > set_field:0/0x10->xreg4 > > resubmit(,43) > > 43. ip,reg15=0x2,metadata=0x13,nw_src=172.27.18.244, priority 100, > > cookie 0x540f90b2 > > set_field:4e:42:14:a1:2a:fb->eth_src > > ct(table=44,zone=NXM_NX_REG11[0..15],nat) > > nat > > -> A clone of the packet is forked to recirculate. The forked > > pipeline will be resumed at table 44. > > -> Sets the packet to an untracked state, and clears all the > > conntrack fields. > > > > Final flow: > recirc_id=0x13b5a,eth,tcp,reg0=0x300,reg11=0x155,reg12=0x1cf,reg13=0x16d,reg14=0xca,reg15=0x1,metadata=0x12,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=32,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > Megaflow: > recirc_id=0x13b5a,ct_state=+new-est-rel-rpl-inv+trk,ct_mark=0/0x1,eth,tcp,in_port=7753,dl_src=4e:42:14:a1:2a:fb,dl_dst=1a:16:b1:58:e1:cd,nw_src=172.27.18.244,nw_dst= > 104.0.0.0/5,nw_ttl=32,nw_frag=no,tp_src=0x8000/0x8000,tp_dst=0x180/0xffc0 > > Datapath actions: > > > ct(commit,zone=365,mark=0/0x1,nat(src)),set(eth(dst=00:00:5e:00:01:ff)),set(ipv4(ttl=31)),ct(zone=1228,nat),recirc(0x13b5c) > > > > > =============================================================================== > > recirc(0x13b5c) - resume conntrack with default ct_state=trk|new (use > > --ct-next to customize) > > Replacing src/dst IP/ports to simulate NAT: > > Initial flow: > > Modified flow: > > > =============================================================================== > > > > Flow: > recirc_id=0x13b5c,ct_state=new|trk,ct_zone=1228,eth,tcp,reg10=0x41,reg11=0x4cc,reg12=0x383,reg14=0x1,reg15=0x2,metadata=0x13,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=00:00:5e:00:01:ff,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=31,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > > > bridge("br-int") > > ---------------- > > thaw > > Resuming from table 44 > > 44. metadata=0x13, priority 0, cookie 0x48ef0c42 > > resubmit(,45) > > 45. ct_state=-rpl+trk,ip,reg15=0x2,metadata=0x13,nw_src=172.27.18.244, > > priority 161, cookie 0x5414262a > > set_field:4e:42:14:a1:2a:fb->eth_src > > ct(commit,table=46,zone=NXM_NX_REG12[0..15],nat(src=204.52.24.116)) > > nat(src=204.52.24.116) > > -> A clone of the packet is forked to recirculate. The forked > > pipeline will be resumed at table 46. > > -> Sets the packet to an untracked state, and clears all the > > conntrack fields. > > > > Final flow: > recirc_id=0x13b5c,eth,tcp,reg10=0x41,reg11=0x4cc,reg12=0x383,reg14=0x1,reg15=0x2,metadata=0x13,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=00:00:5e:00:01:ff,nw_src=172.27.18.244,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=31,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > Megaflow: > recirc_id=0x13b5c,ct_state=-rpl+trk,eth,ip,in_port=7753,dl_src=4e:42:14:a1:2a:fb,nw_src=172.27.18.244,nw_frag=no > > Datapath actions: > ct(commit,zone=899,nat(src=204.52.24.116)),recirc(0x25f9b) > > > > > =============================================================================== > > recirc(0x25f9b) - resume conntrack with default ct_state=trk|new (use > > --ct-next to customize) > > Replacing src/dst IP/ports to simulate NAT: > > Initial flow: > nw_src=172.27.18.244,tp_src=52776,nw_dst=104.18.3.35,tp_dst=443 > > Modified flow: > nw_src=204.52.24.116,tp_src=52776,nw_dst=104.18.3.35,tp_dst=443 > > > =============================================================================== > > > > Flow: > recirc_id=0x25f9b,ct_state=new|trk,ct_zone=899,eth,tcp,reg10=0x41,reg11=0x4cc,reg12=0x383,reg14=0x1,reg15=0x2,metadata=0x13,in_port=7753,vlan_tci=0x0000,dl_src=4e:42:14:a1:2a:fb,dl_dst=00:00:5e:00:01:ff,nw_src=204.52.24.116,nw_dst=104.18.3.35,nw_tos=0,nw_ecn=0,nw_ttl=31,nw_frag=no,tp_src=52776,tp_dst=443,tcp_flags=syn > > > > bridge("br-int") > > ---------------- > > thaw > > Resuming from table 46 > > 46. metadata=0x13, priority 0, cookie 0x3800cb88 > > resubmit(,47) > > 47. metadata=0x13, priority 0, cookie 0xc2575be2 > > resubmit(,48) > > 48. reg15=0x2,metadata=0x13, priority 100, cookie 0xaf19a22a > > resubmit(,64) > > 64. reg10=0x1/0x1,reg15=0x2,metadata=0x13, priority 100, cookie > 0x407086d0 > > push:NXM_OF_IN_PORT[] > > set_field:ANY->in_port > > resubmit(,65) > > 65. reg15=0x2,metadata=0x13, priority 100, cookie 0x407086d0 > > > clone(ct_clear,set_field:0->reg11,set_field:0->reg12,set_field:0->reg13,set_field:0x2c2->reg11,set_field:0x327->reg12,set_field:0x1->metadata,set_field:0xa->reg14,set_field:0->reg10,set_field:0->reg15,set_field:0->reg0,set_field:0->reg1,set_field:0->reg2,set_field:0->reg3,set_field:0->reg4,set_field:0->reg5,set_field:0->reg6,set_field:0->reg7,set_field:0->reg8,set_field:0->reg9,resubmit(,8)) > > ct_clear > > set_field:0->reg11 > > set_field:0->reg12 > > set_field:0->reg13 > > set_field:0x2c2->reg11 > > set_field:0x327->reg12 > > set_field:0x1->metadata > > set_field:0xa->reg14 > > set_field:0->reg10 > > set_field:0->reg15 > > set_field:0->reg0 > > set_field:0->reg1 > > set_field:0->reg2 > > set_field:0->reg3 > > set_field:0->reg4 > > set_field:0->reg5 > > set_field:0->reg6 > > set_field:0->reg7 > > set_field:0->reg8 > > set_field:0->reg9 > > resubmit(,8) > > 8. metadata=0x1, priority 50, cookie 0x1645d3f2 > > set_field:0/0x1000->reg10 > > resubmit(,73) > > 73. No match. > > drop > > move:NXM_NX_REG10[12]->NXM_NX_XXREG0[111] > > -> NXM_NX_XXREG0[111] is now 0 > > resubmit(,9) > > 9. metadata=0x1, priority 0, cookie 0xcc4fd106 > > resubmit(,10) > > 10. metadata=0x1, priority 0, cookie 0x9e10ad0e > > resubmit(,11) > > 11. metadata=0x1, priority 0, cookie 0x557f3249 > > resubmit(,12) > > 12. metadata=0x1, priority 0, cookie 0x915c56b1 > > resubmit(,13) > > 13. ip,reg14=0xa,metadata=0x1, priority 110, cookie 0xccdcd3e1 > > resubmit(,14) > > 14. metadata=0x1, priority 0, cookie 0x134ce32f > > resubmit(,15) > > 15. metadata=0x1, priority 65535, cookie 0x49627e5f > > resubmit(,16) > > 16. metadata=0x1, priority 65535, cookie 0xc947843d > > resubmit(,17) > > 17. metadata=0x1, priority 0, cookie 0xd2fb4d2 > > resubmit(,18) > > 18. metadata=0x1, priority 0, cookie 0x62454929 > > resubmit(,19) > > 19. metadata=0x1, priority 0, cookie 0x3bb47080 > > resubmit(,20) > > 20. metadata=0x1, priority 0, cookie 0x73face9d > > resubmit(,21) > > 21. metadata=0x1, priority 0, cookie 0x8e46634d > > resubmit(,22) > > 22. metadata=0x1, priority 0, cookie 0xbddac461 > > resubmit(,23) > > 23. metadata=0x1, priority 0, cookie 0x9a32c0de > > resubmit(,24) > > 24. metadata=0x1, priority 0, cookie 0x79b1c074 > > resubmit(,25) > > 25. metadata=0x1, priority 0, cookie 0xc02374d3 > > resubmit(,26) > > 26. metadata=0x1, priority 0, cookie 0x218dc750 > > resubmit(,27) > > 27. metadata=0x1, priority 0, cookie 0x8db4eebc > > resubmit(,28) > > 28. metadata=0x1, priority 0, cookie 0x6deecbbe > > resubmit(,29) > > 29. metadata=0x1, priority 0, cookie 0x34e5fac7 > > resubmit(,30) > > 30. metadata=0x1, priority 0, cookie 0xac3f53bd > > resubmit(,31) > > 31. metadata=0x1, priority 0, cookie 0x54000bd5 > > resubmit(,32) > > 32. metadata=0x1, priority 0, cookie 0x29ab49f3 > > resubmit(,33) > > 33. metadata=0x1, priority 0, cookie 0xae6aefcb > > resubmit(,34) > > 34. metadata=0x1, priority 0, cookie 0x632dc93b > > resubmit(,35) > > 35. metadata=0x1, priority 0, cookie 0x9961115c > > set_field:0->reg15 > > resubmit(,71) > > 71. No match. > > drop > > resubmit(,36) > > 36. reg15=0,metadata=0x1, priority 50, cookie 0x62255993 > > set_field:0x8001->reg15 > > resubmit(,37) > > 37. priority 0 > > resubmit(,39) > > 39. priority 0 > > resubmit(,40) > > 40. reg15=0x8001,metadata=0x1, priority 100, cookie 0x11699429 > > set_field:0x14->reg13 > > set_field:0x1->reg15 > > resubmit(,41) > > 41. priority 0 > > set_field:0->reg0 > > set_field:0->reg1 > > set_field:0->reg2 > > set_field:0->reg3 > > set_field:0->reg4 > > set_field:0->reg5 > > set_field:0->reg6 > > set_field:0->reg7 > > set_field:0->reg8 > > set_field:0->reg9 > > resubmit(,42) > > 42. metadata=0x1, priority 0, cookie 0x9ad1480d > > resubmit(,43) > > 43. ip,reg15=0x1,metadata=0x1, priority 110, cookie > 0xca4977d3 > > ct_clear > > resubmit(,44) > > 44. metadata=0x1, priority 0, cookie 0xa1ddb4f6 > > resubmit(,45) > > 45. metadata=0x1, priority 65535, cookie 0x91b4350e > > resubmit(,46) > > 46. metadata=0x1, priority 65535, cookie 0xfc9c351d > > resubmit(,47) > > 47. metadata=0x1, priority 0, cookie 0xb984696e > > resubmit(,48) > > 48. metadata=0x1, priority 0, cookie 0xc4f6a97f > > resubmit(,49) > > 49. metadata=0x1, priority 0, cookie 0x8ebb0c71 > > resubmit(,50) > > 50. metadata=0x1, priority 0, cookie 0x61e385f4 > > resubmit(,51) > > 51. metadata=0x1, priority 0, cookie 0x4c722ce1 > > set_field:0/0x1000->reg10 > > resubmit(,75) > > 75. No match. > > drop > > move:NXM_NX_REG10[12]->NXM_NX_XXREG0[111] > > -> NXM_NX_XXREG0[111] is now 0 > > resubmit(,52) > > 52. metadata=0x1, priority 0, cookie 0x3c01b89d > > resubmit(,64) > > 64. priority 0 > > resubmit(,65) > > 65. reg15=0x1,metadata=0x1, priority 100, cookie > 0x4581da82 > > push_vlan:0x8100 > > set_field:4216->vlan_vid > > output:7754 > > > > bridge("br-provider") > > --------------------- > > 0. priority 0 > > NORMAL > > -> forwarding to learned port > > pop_vlan > > set_field:0x8001->reg15 > > pop:NXM_OF_IN_PORT[] > > -> NXM_OF_IN_PORT[] is now 7753 > > > > Final flow: unchanged > > Megaflow: > recirc_id=0x25f9b,ct_state=+new-est-rel-rpl-inv+trk,ct_mark=0/0x1,eth,ip,in_port=7753,dl_src=4e:42:14:a1:2a:fb,dl_dst=00:00:5e:00:01:ff,nw_dst= > 0.0.0.0/1,nw_frag=no > > Datapath actions: ct_clear,push_vlan(vid=120,pcp=0),5 > > > > On Wed, 17 Apr 2024 at 17:32, Gavin McKee <gavmcke...@googlemail.com> > wrote: > >> > >> That information is all in the email > >> > >> The openflow trace is showing that the pipeline is fine . This is why > I’m worried about a deeper issue with the kernal / openvswitch kernal > module / connection tracking > >> > >> > >> > >> On Wed, Apr 17, 2024 at 16:33 Flavio Leitner <f...@sysclose.org> wrote: > >>> > >>> On Wed, 17 Apr 2024 12:26:27 -0700 > >>> Gavin McKee <gavmcke...@googlemail.com> wrote: > >>> > >>>> Hi Flavio, > >>>> > >>>> I had to restart the Open vSwitch across 16 machines to resolve the > >>>> issue for a customer . I think it will occur again and when it does > >>>> I'll use that command to gather the tc information. > >>>> > >>>> Until then I think I have found why the issue is occurring . > >>>> > >>>> Take a look at the output below (this is a packet capture from the > >>>> physical interface on the compute node , so traffic that has gone > >>>> through the OVS Openflow pipeline) - we make a 3 way handshake with R2 > >>>> , and establish the connection. A packet goes missing - TLS > >>>> handshake, it then appears that it hasn't gone through NAT as it's > >>>> using the Private IP of the VM . > >>> > >>> > >>> If that is the case, you might be able to see if the data path > >>> is matching correctly and the actions are using NAT with > >>> ovs-appctl ofproto/trace. > >>> > >>> fbl > >>> > >>> > >>>> > >>>> Take a look at frame 14 > >>>> > >>>> No. Time Source Destination > >>>> Protocol Length Info > >>>> Delta > >>>> 14 09:24:08.064432 172.27.18.244 104.18.2.35 > >>>> TLSv1 502 Client Hello, Alert (Level: Fatal, Description: Decode > >>>> Error) 2.362983 > >>>> > >>>> Frame 14: 502 bytes on wire (4016 bits), 502 bytes captured (4016 > >>>> bits) Ethernet II, Src: 4e:42:14:a1:2a:fb (4e:42:14:a1:2a:fb), Dst: > >>>> IETF-VRRP-VRID_ff (00:00:5e:00:01:ff) > >>>> 802.1Q Virtual LAN, PRI: 0, DEI: 0, ID: 120 > >>>> Internet Protocol Version 4, Src: 172.27.18.244, Dst: 104.18.2.35 > >>>> Transmission Control Protocol, Src Port: 57394, Dst Port: 443, Seq: 1, > >>>> Ack: 1, Len: 444 > >>>> Transport Layer Security > >>>> TLSv1 Record Layer: Handshake Protocol: Client Hello > >>>> Content Type: Handshake (22) > >>>> Version: TLS 1.0 (0x0301) > >>>> Length: 432 > >>>> Handshake Protocol: Client Hello > >>>> TLSv1 Record Layer: Alert (Level: Fatal, Description: Decode > >>>> Error) Content Type: Alert (21) > >>>> Version: TLS 1.0 (0x0301) > >>>> Length: 2 > >>>> Alert Message > >>>> Level: Fatal (2) > >>>> Description: Decode Error (50) > >>>> > >>>> > >>>> > >>>> > ------------------------------------------------------------------------------------------------------------ > >>>> > >>>> On Wed, 17 Apr 2024 at 11:28, Flavio Leitner <f...@sysclose.org> > wrote: > >>>>> > >>>>> > >>>>> Hi Gavin, > >>>>> > >>>>> It would be helpful if you can provide some TC dumps from the > >>>>> "good" state to the "bad" state to see how it was and what changes. > >>>>> Something like: > >>>>> > >>>>> # tc -s filter show dev enp148s0f0_1 ingress > >>>>> > >>>>> I haven't checked the attached files, but one suggestion is to > >>>>> check if this is not a csum issue. > >>>>> > >>>>> Thanks, > >>>>> fbl > >>>>> > >>>>> > >>>>> On Tue, 16 Apr 2024 13:17:10 -0700 > >>>>> Gavin McKee via discuss <ovs-discuss@openvswitch.org> wrote: > >>>>> > >>>>>> Adding information relating to the Open VSwitch kernal module > >>>>>> @Ilya Maximets @Numan Siddique Can either of you help out here? > >>>>>> > >>>>>> > >>>>>> modinfo openvswitch > >>>>>> filename: > >>>>>> > /lib/modules/5.14.0-362.8.1.el9_3.x86_64/kernel/net/openvswitch/openvswitch.ko.xz > >>>>>> alias: net-pf-16-proto-16-family-ovs_ct_limit > >>>>>> alias: net-pf-16-proto-16-family-ovs_meter > >>>>>> alias: net-pf-16-proto-16-family-ovs_packet > >>>>>> alias: net-pf-16-proto-16-family-ovs_flow > >>>>>> alias: net-pf-16-proto-16-family-ovs_vport > >>>>>> alias: net-pf-16-proto-16-family-ovs_datapath > >>>>>> license: GPL > >>>>>> description: Open vSwitch switching datapath > >>>>>> rhelversion: 9.3 > >>>>>> srcversion: 8A2159D727C8BADC82261B8 > >>>>>> depends: nf_conntrack,nf_conncount,libcrc32c,nf_nat > >>>>>> retpoline: Y > >>>>>> intree: Y > >>>>>> name: openvswitch > >>>>>> vermagic: 5.14.0-362.8.1.el9_3.x86_64 SMP preempt mod_unload > >>>>>> modversions sig_id: PKCS#7 > >>>>>> signer: Rocky kernel signing key > >>>>>> sig_key: > >>>>>> 17:CA:DE:1F:EC:D1:59:2D:9F:52:34:C6:7C:09:06:81:3D:74:7C:F7 > >>>>>> sig_hashalgo: sha256 signature: > >>>>>> 67:31:56:70:86:DB:57:69:8D:4A:9B:A7:ED:17:F3:67:65:98:97:08: > >>>>>> 1F:FB:4D:F8:A8:2D:7C:A7:7D:3A:57:85:CA:67:9D:82:72:EB:54:14: > >>>>>> F2:BB:40:78:AD:85:56:2D:EF:D5:00:95:38:A4:86:9F:5F:29:1A:81: > >>>>>> 32:94:B4:87:41:94:A0:3E:71:A5:97:44:2E:42:DD:F7:42:6B:69:94: > >>>>>> E3:AB:6E:E5:4F:C9:60:57:70:07:5F:CA:C7:83:7A:2F:C7:81:62:FF: > >>>>>> 53:AF:AC:2B:06:D8:08:D3:1D:A7:F0:43:10:98:DE:B1:62:AE:89:A5: > >>>>>> FE:EF:74:09:0F:2D:0F:D9:73:A5:59:75:D0:87:1E:EA:3A:40:86:1E: > >>>>>> 76:E5:E7:3B:59:2E:3A:7E:65:F3:92:A1:B4:84:48:3F:43:A0:D7:1C: > >>>>>> 21:29:E0:B6:D1:10:36:15:88:43:6A:11:8F:55:EE:1B:F9:53:3B:86: > >>>>>> EF:81:71:17:81:08:EC:53:30:D6:69:8E:13:11:D5:DF:15:75:88:50: > >>>>>> 69:19:51:3B:41:6B:6F:E0:7A:30:33:32:E6:60:18:02:A6:0C:63:9B: > >>>>>> C5:D7:2F:6A:D0:BA:45:03:19:0E:21:E8:18:FB:E8:D1:C1:33:05:36: > >>>>>> 1F:9B:0F:29:3F:05:51:7A:30:86:88:B7:C7:44:2E:2B:50:F9:EF:4F: > >>>>>> D4:70:EA:1B:33:E2:F0:E3:E2:88:00:E5:BF:06:E2:D4:B7:81:EE:6E: > >>>>>> 89:02:18:65:8B:1C:84:42:2F:89:14:63:1D:51:70:37:42:C5:68:DD: > >>>>>> 4D:12:7B:07:33:2B:C6:BC:8F:7F:23:D7:58:DF:47:AC:DE:08:67:FE: > >>>>>> CB:E8:E6:4D:95:2F:6B:F5:07:4D:32:92:80:0A:7C:D1:B6:81:EE:AB: > >>>>>> 26:C3:C6:22:77:00:5E:64:DE:96:0E:9F:A4:A0:F0:45:9F:19:73:EB: > >>>>>> CC:60:AE:E9:63:E2:6D:2E:BA:65:9B:BD:04:CC:13:C2:55:88:05:03: > >>>>>> 1B:30:18:8B > >>>>>> > >>>>>> On Tue, 16 Apr 2024 at 11:12, Gavin McKee > >>>>>> <gavmcke...@googlemail.com> wrote: > >>>>>>> > >>>>>>> Hi, > >>>>>>> > >>>>>>> I need some help with strange OVS behaviours. > >>>>>>> > >>>>>>> ovs-vsctl (Open vSwitch) 3.2.2 > >>>>>>> ovn-controller 23.09.1 > >>>>>>> Open vSwitch Library 3.2.2 > >>>>>>> > >>>>>>> TLDR: We need to restart Open VSwitch in order for TLS traffic > >>>>>>> to work between a VM and Cloudflare R2. After restarting Open > >>>>>>> VSwitch the TLS connection works fine. > >>>>>>> (see attached pcap tls-error.txt) > >>>>>>> > >>>>>>> See the attached openflow traces - they show a flow trace from > >>>>>>> Open Vswitch. > >>>>>>> > >>>>>>> Also there is a retis trace (retis tool discussed at Open > >>>>>>> VSwitch conference 2023). > >>>>>>> > >>>>>>> Note the drop (TC_INGRESS) in this file > >>>>>>> + 1702601116185568 [swapper/140] 0 [tp] skb:kfree_skb > >>>>>>> #60c81b6b91e2cff284fb3a3d65800 (skb 18386033671255367680) n 3 > >>>>>>> drop (TC_INGRESS) > >>>>>>> if 21 (enp148s0f0_1) rxif 21 172.27.18.244.57394 > > >>>>>>> 104.18.2.35.443 ttl 63 tos 0x0 id 26162 off 0 [DF] len 477 proto > >>>>>>> TCP (6) flags [P.] seq 792060930:792061367 ack 951229219 win 11 > >>>>>>> > >>>>>>> Again , once I restart Open vSwitch the problem goes away for a > >>>>>>> time and comes back sometime later (not sure what that time > >>>>>>> frame is but its a recurring issue.) > >>>>>> _______________________________________________ > >>>>>> discuss mailing list > >>>>>> disc...@openvswitch.org > >>>>>> https://mail.openvswitch.org/mailman/listinfo/ovs-discuss > >>>>> > >>>>> > >>>>> > >>>>> -- > >>>>> fbl > >>> > >>> > >>> > >>> -- > >>> fbl > > _______________________________________________ > > discuss mailing list > > disc...@openvswitch.org > > https://mail.openvswitch.org/mailman/listinfo/ovs-discuss > >
_______________________________________________ discuss mailing list disc...@openvswitch.org https://mail.openvswitch.org/mailman/listinfo/ovs-discuss