Hi Jiayu, Please find comments inline > -----Original Message----- > From: Hu, Jiayu [mailto:jiayu...@intel.com] > Sent: Monday, April 23, 2018 7:14 AM > To: Ophir Munk <ophi...@mellanox.com>; dev@dpdk.org; Ananyev, > Konstantin <konstantin.anan...@intel.com> > Cc: Thomas Monjalon <tho...@monjalon.net>; Olga Shern > <ol...@mellanox.com>; Pascal Mazon <pascal.ma...@6wind.com>; > sta...@dpdk.org > Subject: RE: [PATCH v1] gso: fix marking TCP checksum flag in TCP segments > > Hi Ophir, > > In the GSO design, the GSO library doesn't care about checksums, which > means it doesn't check if input packets have correct checksums, and it > doesn't do any checksum related work for the output GSO segments. It > depends on the callers to use HW or SW checksum calculation for output > packets. This is why the GSO library doesn't set PKT_TX_TCP_CKSUM. So I > don't think it's a bug. >
Can you please reconsider this design? I think the GSO library should imitate the HW behavior where TCP segments checksum is automatically calculated without explicitly requesting it. I am not saying that GSO library itself should calculate the checksums - but at least it should mark each segment as requiring this calculation. > In my opinion, it's not a good idea to enable HW TCP checksum calculation > silently, and without the aware of the caller. In fact, the caller always > know it > does SW TSO (i.e. GSO), instead of real HW TSO. This is not correct. Consider net_failsafe with 2 sub-devices: one is a HW PCI device, the other one is a SW TAP device. Failsafe must work transparently with these two sub-devices and the caller cannot tell if TSO is done in SW or HW. > If the caller wants HW > checksum calculation, it can add PKT_TX_TCP_CKSUM to ol_flags before or > after calling the GSO library. > FYI - TAP TSO patches were submitted to dpdk.org mailing list. These patches use the GSO library. https://dpdk.org/dev/patchwork/patch/38666/ https://dpdk.org/dev/patchwork/patch/38667/ Running testpmd with TAP TSO is currently broken without the suggested librte_gso patch. Please note testpmd implementation (app/test-pmd/csumonly.c b/app/test-pmd/csumonly.c) in case *both* TSO and TCP CKSUM are configured: if (tso_segsz) ol_flags |= PKT_TX_TCP_SEG; // *** if TSO is applicable - the packet flags are only marked with PKT_TX_TCP_SEG and no PKT_TX_TCP_CKSUM *** else if (tx_offloads & DEV_TX_OFFLOAD_TCP_CKSUM) ol_flags |= PKT_TX_TCP_CKSUM; // *** PKT_TX_TCP_CKSUM is marked only if TSO is not applicable *** else { tcp_hdr->cksum = get_udptcp_checksum(l3_hdr, tcp_hdr, In other words - testpmd does not set TCP_CKSUM along with TCP_SEG therefore using testpmd with TAP/TSO will result in TCP segments with 0 (incorrect) TCP checksums. In addition - please note the comments in lib/librte_mbuf/rte_mbuf.h which specify that PKT_TX_TCP_SEG flag implies the PKT_TX_TCP_CKSUM (hence it is not required to be explicitly set by the caller) /** * TCP segmentation offload. To enable this offload feature for a * packet to be transmitted on hardware supporting TSO: * - set the PKT_TX_TCP_SEG flag in mbuf->ol_flags (this flag implies * PKT_TX_TCP_CKSUM) ... > Add Konstantin for more suggestions. > > Thanks, > Jiayu > > > -----Original Message----- > > From: Ophir Munk [mailto:ophi...@mellanox.com] > > Sent: Sunday, April 22, 2018 10:21 PM > > To: dev@dpdk.org; Hu, Jiayu <jiayu...@intel.com> > > Cc: Thomas Monjalon <tho...@monjalon.net>; Olga Shern > > <ol...@mellanox.com>; Pascal Mazon <pascal.ma...@6wind.com>; > Ophir > > Munk <ophi...@mellanox.com>; sta...@dpdk.org > > Subject: [PATCH v1] gso: fix marking TCP checksum flag in TCP segments > > > > Large TCP packets which are marked with PKT_TX_TCP_SEG flag are > > segmented and the flag is cleared in the resulting segments, however, > > the segments checksum is not updated. It is therefore required to set > > the PKT_TX_TCP_CKSUM flag in each TCP segment in order to mark for the > > sending driver the need to update the TCP checksum before transmitting > > the segment. > > > > Fixes: 119583797b6a ("gso: support TCP/IPv4 GSO") > > Cc: sta...@dpdk.org > > > > Signed-off-by: Ophir Munk <ophi...@mellanox.com> > > --- > > lib/librte_gso/rte_gso.c | 2 ++ > > 1 file changed, 2 insertions(+) > > > > diff --git a/lib/librte_gso/rte_gso.c b/lib/librte_gso/rte_gso.c index > > a44e3d4..e9ce9ce 100644 > > --- a/lib/librte_gso/rte_gso.c > > +++ b/lib/librte_gso/rte_gso.c > > @@ -50,12 +50,14 @@ rte_gso_segment(struct rte_mbuf *pkt, > > ((IS_IPV4_GRE_TCP4(pkt->ol_flags) && > > (gso_ctx->gso_types & > > DEV_TX_OFFLOAD_GRE_TNL_TSO)))) { > > pkt->ol_flags &= (~PKT_TX_TCP_SEG); > > + pkt->ol_flags |= PKT_TX_TCP_CKSUM; > > ret = gso_tunnel_tcp4_segment(pkt, gso_size, ipid_delta, > > direct_pool, indirect_pool, > > pkts_out, nb_pkts_out); > > } else if (IS_IPV4_TCP(pkt->ol_flags) && > > (gso_ctx->gso_types & > > DEV_TX_OFFLOAD_TCP_TSO)) { > > pkt->ol_flags &= (~PKT_TX_TCP_SEG); > > + pkt->ol_flags |= PKT_TX_TCP_CKSUM; > > ret = gso_tcp4_segment(pkt, gso_size, ipid_delta, > > direct_pool, indirect_pool, > > pkts_out, nb_pkts_out); > > -- > > 2.7.4