For Konstantin

> -----Original Message-----
> From: Feifei Wang
> Sent: Saturday, September 23, 2023 1:52 PM
> To: Konstantin Ananyev <konstantin.anan...@huawei.com>; Konstantin
> Ananyev <konstantin.v.anan...@yandex.ru>
> Cc: dev@dpdk.org; nd <n...@arm.com>; Honnappa Nagarahalli
> <honnappa.nagaraha...@arm.com>; Ruifeng Wang
> <ruifeng.w...@arm.com>; Yuying Zhang <yuying.zh...@intel.com>; Beilei
> Xing <beilei.x...@intel.com>; nd <n...@arm.com>; nd <n...@arm.com>; nd
> <n...@arm.com>; nd <n...@arm.com>; nd <n...@arm.com>; nd <n...@arm.com>;
> nd <n...@arm.com>; nd <n...@arm.com>
> Subject: RE: [PATCH v11 2/4] net/i40e: implement mbufs recycle mode
> 
> 
> 
> > -----Original Message-----
> > From: Konstantin Ananyev <konstantin.anan...@huawei.com>
> > Sent: Saturday, September 23, 2023 12:41 AM
> > To: Feifei Wang <feifei.wa...@arm.com>; Konstantin Ananyev
> > <konstantin.v.anan...@yandex.ru>
> > Cc: dev@dpdk.org; nd <n...@arm.com>; Honnappa Nagarahalli
> > <honnappa.nagaraha...@arm.com>; Ruifeng Wang
> <ruifeng.w...@arm.com>;
> > Yuying Zhang <yuying.zh...@intel.com>; Beilei Xing
> > <beilei.x...@intel.com>; nd <n...@arm.com>; nd <n...@arm.com>; nd
> > <n...@arm.com>; nd <n...@arm.com>; nd <n...@arm.com>; nd
> <n...@arm.com>; nd
> > <n...@arm.com>
> > Subject: RE: [PATCH v11 2/4] net/i40e: implement mbufs recycle mode
> >
> >
> > Hi Feifei,
> >
> > > > > -----Original Message-----
> > > > > From: Feifei Wang
> > > > > Sent: Tuesday, September 5, 2023 11:11 AM
> > > > > To: Konstantin Ananyev <konstantin.anan...@huawei.com>;
> > > > > Konstantin Ananyev <konstantin.v.anan...@yandex.ru>
> > > > > Cc: dev@dpdk.org; nd <n...@arm.com>; Honnappa Nagarahalli
> > > > > <honnappa.nagaraha...@arm.com>; Ruifeng Wang
> > > > <ruifeng.w...@arm.com>;
> > > > > Yuying Zhang <yuying.zh...@intel.com>; Beilei Xing
> > > > > <beilei.x...@intel.com>; nd <n...@arm.com>; nd <n...@arm.com>; nd
> > > > > <n...@arm.com>; nd <n...@arm.com>; nd <n...@arm.com>
> > > > > Subject: RE: [PATCH v11 2/4] net/i40e: implement mbufs recycle
> > > > > mode
> > > > >
> > > > >
> > > > >
> > > > > > -----Original Message-----
> > > > > > From: Konstantin Ananyev <konstantin.anan...@huawei.com>
> > > > > > Sent: Monday, September 4, 2023 6:22 PM
> > > > > > To: Feifei Wang <feifei.wa...@arm.com>; Konstantin Ananyev
> > > > > > <konstantin.v.anan...@yandex.ru>
> > > > > > Cc: dev@dpdk.org; nd <n...@arm.com>; Honnappa Nagarahalli
> > > > > > <honnappa.nagaraha...@arm.com>; Ruifeng Wang
> > > > > <ruifeng.w...@arm.com>;
> > > > > > Yuying Zhang <yuying.zh...@intel.com>; Beilei Xing
> > > > > > <beilei.x...@intel.com>; nd <n...@arm.com>; nd <n...@arm.com>; nd
> > > > > > <n...@arm.com>; nd <n...@arm.com>
> > > > > > Subject: RE: [PATCH v11 2/4] net/i40e: implement mbufs recycle
> > > > > > mode
> > > > > >
> > > > > >
> > > > > >
> > > > > > > > > > > > > > Define specific function implementation for i40e 
> > > > > > > > > > > > > > driver.
> > > > > > > > > > > > > > Currently, mbufs recycle mode can support
> > > > > > > > > > > > > > 128bit vector path and
> > > > > > > > > > > > > > avx2
> > > > > > > > > > > > path.
> > > > > > > > > > > > > > And can be enabled both in fast free and no
> > > > > > > > > > > > > > fast free
> > mode.
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > Suggested-by: Honnappa Nagarahalli
> > > > > > > > > > > > > > <honnappa.nagaraha...@arm.com>
> > > > > > > > > > > > > > Signed-off-by: Feifei Wang
> > > > > > > > > > > > > > <feifei.wa...@arm.com>
> > > > > > > > > > > > > > Reviewed-by: Ruifeng Wang
> > > > > > > > > > > > > > <ruifeng.w...@arm.com>
> > > > > > > > > > > > > > Reviewed-by: Honnappa Nagarahalli
> > > > > > > > > > <honnappa.nagaraha...@arm.com>
> > > > > > > > > > > > > > ---
> > > > > > > > > > > > > >  drivers/net/i40e/i40e_ethdev.c                |   
> > > > > > > > > > > > > > 1 +
> > > > > > > > > > > > > >  drivers/net/i40e/i40e_ethdev.h                |   
> > > > > > > > > > > > > > 2 +
> > > > > > > > > > > > > >  .../net/i40e/i40e_recycle_mbufs_vec_common.c
> > > > > > > > > > > > > > |
> > > > > > > > > > > > > > 147
> > > > > > > > > > > > > > ++++++++++++++++++
> > > > > > > > > > > > > >  drivers/net/i40e/i40e_rxtx.c                  |  
> > > > > > > > > > > > > > 32 ++++
> > > > > > > > > > > > > >  drivers/net/i40e/i40e_rxtx.h                  |   
> > > > > > > > > > > > > > 4 +
> > > > > > > > > > > > > >  drivers/net/i40e/meson.build                  |   
> > > > > > > > > > > > > > 1 +
> > > > > > > > > > > > > >  6 files changed, 187 insertions(+)  create
> > > > > > > > > > > > > > mode
> > > > > > > > > > > > > > 100644
> > > > > > > > > > > > > > drivers/net/i40e/i40e_recycle_mbufs_vec_common
> > > > > > > > > > > > > > .c
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > diff --git a/drivers/net/i40e/i40e_ethdev.c
> > > > > > > > > > > > > > b/drivers/net/i40e/i40e_ethdev.c index
> > > > > > > > > > > > > > 8271bbb394..50ba9aac94
> > > > > > > > > > > > > > 100644
> > > > > > > > > > > > > > --- a/drivers/net/i40e/i40e_ethdev.c
> > > > > > > > > > > > > > +++ b/drivers/net/i40e/i40e_ethdev.c
> > > > > > > > > > > > > > @@ -496,6 +496,7 @@ static const struct
> > > > > > > > > > > > > > eth_dev_ops i40e_eth_dev_ops
> > > > > > > > > > > > = {
> > > > > > > > > > > > > >     .flow_ops_get                 =
> i40e_dev_flow_ops_get,
> > > > > > > > > > > > > >     .rxq_info_get                 = 
> > > > > > > > > > > > > > i40e_rxq_info_get,
> > > > > > > > > > > > > >     .txq_info_get                 = 
> > > > > > > > > > > > > > i40e_txq_info_get,
> > > > > > > > > > > > > > +   .recycle_rxq_info_get         =
> > > > > i40e_recycle_rxq_info_get,
> > > > > > > > > > > > > >     .rx_burst_mode_get            =
> > > > > i40e_rx_burst_mode_get,
> > > > > > > > > > > > > >     .tx_burst_mode_get            =
> > > > > i40e_tx_burst_mode_get,
> > > > > > > > > > > > > >     .timesync_enable              =
> i40e_timesync_enable,
> > > > > > > > > > > > > > diff --git a/drivers/net/i40e/i40e_ethdev.h
> > > > > > > > > > > > > > b/drivers/net/i40e/i40e_ethdev.h index
> > > > > > > > > > > > > > 6f65d5e0ac..af758798e1
> > > > > > > > > > > > > > 100644
> > > > > > > > > > > > > > --- a/drivers/net/i40e/i40e_ethdev.h
> > > > > > > > > > > > > > +++ b/drivers/net/i40e/i40e_ethdev.h
> > > > > > > > > > > > > > @@ -1355,6 +1355,8 @@ void
> > > > > > > > > > > > > > i40e_rxq_info_get(struct rte_eth_dev *dev,
> > > > > > > > > > > > > > uint16_t
> > queue_id,
> > > > > > > > > > > > > >     struct rte_eth_rxq_info *qinfo);  void
> > > > > > > > > > > > > > i40e_txq_info_get(struct rte_eth_dev *dev,
> > > > > > > > > > > > > > uint16_t
> > > > queue_id,
> > > > > > > > > > > > > >     struct rte_eth_txq_info *qinfo);
> > > > > > > > > > > > > > +void i40e_recycle_rxq_info_get(struct
> > > > > > > > > > > > > > +rte_eth_dev *dev, uint16_t
> > > > > > > > > > > > queue_id,
> > > > > > > > > > > > > > +   struct rte_eth_recycle_rxq_info
> > > > > > > > > > > > > > +*recycle_rxq_info);
> > > > > > > > > > > > > >  int i40e_rx_burst_mode_get(struct rte_eth_dev
> > > > > > > > > > > > > > *dev, uint16_t
> > > > > > > > > > queue_id,
> > > > > > > > > > > > > >                        struct rte_eth_burst_mode
> *mode);
> > > > > int
> > > > > > > > > > > > > > i40e_tx_burst_mode_get(struct rte_eth_dev
> > > > > > > > > > > > > > *dev, uint16_t queue_id, diff -- git
> > > > > > > > > > > > > > a/drivers/net/i40e/i40e_recycle_mbufs_vec_comm
> > > > > > > > > > > > > > on
> > > > > > > > > > > > > > .c
> > > > > > > > > > > > > > b/drivers/net/i40e/i40e_recycle_mbufs_vec_comm
> > > > > > > > > > > > > > on .c new file mode 100644 index
> > > > > > > > > > > > > > 0000000000..5663ecccde
> > > > > > > > > > > > > > --- /dev/null
> > > > > > > > > > > > > > +++ b/drivers/net/i40e/i40e_recycle_mbufs_vec_
> > > > > > > > > > > > > > +++ co
> > > > > > > > > > > > > > +++ mmon
> > > > > > > > > > > > > > +++ .c
> > > > > > > > > > > > > > @@ -0,0 +1,147 @@
> > > > > > > > > > > > > > +/* SPDX-License-Identifier: BSD-3-Clause
> > > > > > > > > > > > > > + * Copyright (c) 2023 Arm Limited.
> > > > > > > > > > > > > > + */
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +#include <stdint.h> #include
> > > > > > > > > > > > > > +<ethdev_driver.h>
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +#include "base/i40e_prototype.h"
> > > > > > > > > > > > > > +#include "base/i40e_type.h"
> > > > > > > > > > > > > > +#include "i40e_ethdev.h"
> > > > > > > > > > > > > > +#include "i40e_rxtx.h"
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +#pragma GCC diagnostic ignored "-Wcast-qual"
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +void
> > > > > > > > > > > > > > +i40e_recycle_rx_descriptors_refill_vec(void
> > > > > > > > > > > > > > +*rx_queue, uint16_t
> > > > > > > > > > > > > > +nb_mbufs) {
> > > > > > > > > > > > > > +   struct i40e_rx_queue *rxq = rx_queue;
> > > > > > > > > > > > > > +   struct i40e_rx_entry *rxep;
> > > > > > > > > > > > > > +   volatile union i40e_rx_desc *rxdp;
> > > > > > > > > > > > > > +   uint16_t rx_id;
> > > > > > > > > > > > > > +   uint64_t paddr;
> > > > > > > > > > > > > > +   uint64_t dma_addr;
> > > > > > > > > > > > > > +   uint16_t i;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   rxdp = rxq->rx_ring + rxq->rxrearm_start;
> > > > > > > > > > > > > > +   rxep = &rxq->sw_ring[rxq->rxrearm_start];
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   for (i = 0; i < nb_mbufs; i++) {
> > > > > > > > > > > > > > +           /* Initialize rxdp descs. */
> > > > > > > > > > > > > > +           paddr = (rxep[i].mbuf)->buf_iova +
> > > > > > > > > > > > > > RTE_PKTMBUF_HEADROOM;
> > > > > > > > > > > > > > +           dma_addr = rte_cpu_to_le_64(paddr);
> > > > > > > > > > > > > > +           /* flush desc with pa dma_addr */
> > > > > > > > > > > > > > +           rxdp[i].read.hdr_addr = 0;
> > > > > > > > > > > > > > +           rxdp[i].read.pkt_addr = dma_addr;
> > > > > > > > > > > > > > +   }
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   /* Update the descriptor initializer index */
> > > > > > > > > > > > > > +   rxq->rxrearm_start += nb_mbufs;
> > > > > > > > > > > > > > +   rx_id = rxq->rxrearm_start - 1;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   if (unlikely(rxq->rxrearm_start >= rxq-
> >nb_rx_desc)) {
> > > > > > > > > > > > > > +           rxq->rxrearm_start = 0;
> > > > > > > > > > > > > > +           rx_id = rxq->nb_rx_desc - 1;
> > > > > > > > > > > > > > +   }
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   rxq->rxrearm_nb -= nb_mbufs;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   rte_io_wmb();
> > > > > > > > > > > > > > +   /* Update the tail pointer on the NIC */
> > > > > > > > > > > > > > +   I40E_PCI_REG_WRITE_RELAXED(rxq->qrx_tail,
> > > > > > > > > > > > > > +rx_id); }
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +uint16_t
> > > > > > > > > > > > > > +i40e_recycle_tx_mbufs_reuse_vec(void *tx_queue,
> > > > > > > > > > > > > > +   struct rte_eth_recycle_rxq_info
> *recycle_rxq_info) {
> > > > > > > > > > > > > > +   struct i40e_tx_queue *txq = tx_queue;
> > > > > > > > > > > > > > +   struct i40e_tx_entry *txep;
> > > > > > > > > > > > > > +   struct rte_mbuf **rxep;
> > > > > > > > > > > > > > +   int i, n;
> > > > > > > > > > > > > > +   uint16_t nb_recycle_mbufs;
> > > > > > > > > > > > > > +   uint16_t avail = 0;
> > > > > > > > > > > > > > +   uint16_t mbuf_ring_size = recycle_rxq_info-
> > > > > >mbuf_ring_size;
> > > > > > > > > > > > > > +   uint16_t mask = recycle_rxq_info-
> >mbuf_ring_size - 1;
> > > > > > > > > > > > > > +   uint16_t refill_requirement =
> > > > > > > > > > > > > > +recycle_rxq_info-
> > > > > > > > > > >refill_requirement;
> > > > > > > > > > > > > > +   uint16_t refill_head = *recycle_rxq_info-
> >refill_head;
> > > > > > > > > > > > > > +   uint16_t receive_tail =
> > > > > > > > > > > > > > +*recycle_rxq_info->receive_tail;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   /* Get available recycling Rx buffers. */
> > > > > > > > > > > > > > +   avail = (mbuf_ring_size - (refill_head -
> > > > > > > > > > > > > > +receive_tail)) & mask;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   /* Check Tx free thresh and Rx available
> space. */
> > > > > > > > > > > > > > +   if (txq->nb_tx_free > txq->tx_free_thresh ||
> > > > > > > > > > > > > > +avail <=
> > > > > > > > > > > > > > +txq-
> > > > > > > > > > >tx_rs_thresh)
> > > > > > > > > > > > > > +           return 0;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   /* check DD bits on threshold descriptor */
> > > > > > > > > > > > > > +   if
> > > > > > > > > > > > > > +((txq->tx_ring[txq->tx_next_dd].cmd_type_offs
> > > > > > > > > > > > > > +et
> > > > > > > > > > > > > > +_bsz
> > > > > > > > > > > > > > +&
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > >     rte_cpu_to_le_64(I40E_TXD_QW1_DTYPE_MASK)) !=
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > rte_cpu_to_le_64(I40E_TX_DESC_DTYPE_DESC_DONE)
> > > > > > > > > > > > > > )
> > > > > > > > > > > > > > +           return 0;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   n = txq->tx_rs_thresh;
> > > > > > > > > > > > > > +   nb_recycle_mbufs = n;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   /* Mbufs recycle mode can only support no
> > > > > > > > > > > > > > +ring buffer
> > > > > > > > > > wrapping
> > > > > > > > > > > > > > around.
> > > > > > > > > > > > > > +    * Two case for this:
> > > > > > > > > > > > > > +    *
> > > > > > > > > > > > > > +    * case 1: The refill head of Rx buffer ring
> > > > > > > > > > > > > > +needs to be aligned
> > > > > > > > > > with
> > > > > > > > > > > > > > +    * mbuf ring size. In this case, the number
> > > > > > > > > > > > > > +of Tx
> > > > > freeing buffers
> > > > > > > > > > > > > > +    * should be equal to refill_requirement.
> > > > > > > > > > > > > > +    *
> > > > > > > > > > > > > > +    * case 2: The refill head of Rx ring buffer
> > > > > > > > > > > > > > +does not need to be
> > > > > > > > > > aligned
> > > > > > > > > > > > > > +    * with mbuf ring size. In this case, the
> > > > > > > > > > > > > > +update of refill head
> > > > > > > > > > can not
> > > > > > > > > > > > > > +    * exceed the Rx mbuf ring size.
> > > > > > > > > > > > > > +    */
> > > > > > > > > > > > > > +   if (refill_requirement != n ||
> > > > > > > > > > > > > > +           (!refill_requirement && (refill_head +
> n >
> > > > > > > > > > mbuf_ring_size)))
> > > > > > > > > > > > > > +           return 0;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   /* First buffer to free from S/W ring is at 
> > > > > > > > > > > > > > index
> > > > > > > > > > > > > > +    * tx_next_dd - (tx_rs_thresh-1).
> > > > > > > > > > > > > > +    */
> > > > > > > > > > > > > > +   txep = &txq->sw_ring[txq->tx_next_dd - (n -
> 1)];
> > > > > > > > > > > > > > +   rxep = recycle_rxq_info->mbuf_ring;
> > > > > > > > > > > > > > +   rxep += refill_head;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   if (txq->offloads &
> > > > > RTE_ETH_TX_OFFLOAD_MBUF_FAST_FREE) {
> > > > > > > > > > > > > > +           /* Avoid txq contains buffers from
> > > > > > > > > > > > > > +unexpected
> > > > > > > > > > mempool. */
> > > > > > > > > > > > > > +           if (unlikely(recycle_rxq_info->mp
> > > > > > > > > > > > > > +                                   !=
> txep[0].mbuf-
> > > > > >pool))
> > > > > > > > > > > > > > +                   return 0;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +           /* Directly put mbufs from Tx to Rx. */
> > > > > > > > > > > > > > +           for (i = 0; i < n; i++)
> > > > > > > > > > > > > > +                   rxep[i] = txep[i].mbuf;
> > > > > > > > > > > > > > +   } else {
> > > > > > > > > > > > > > +           for (i = 0; i < n; i++) {
> > > > > > > > > > > > > > +                   rxep[i] =
> > > > > > > > > > rte_pktmbuf_prefree_seg(txep[i].mbuf);
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +                   /* If Tx buffers are not the 
> > > > > > > > > > > > > > last
> > > > > reference or
> > > > > > > > > > from
> > > > > > > > > > > > > > +                    * unexpected mempool,
> previous
> > > > > copied
> > > > > > > > > > buffers are
> > > > > > > > > > > > > > +                    * considered as invalid.
> > > > > > > > > > > > > > +                    */
> > > > > > > > > > > > > > +                   if (unlikely((rxep[i] == NULL
> &&
> > > > > > > > > > refill_requirement) ||
> > > > > > > > > > > > > [Konstantin]
> > > > > > > > > > > > > Could you pls remind me why it is ok to have
> > > > > > > > > > > > > rxep[i]==NULL when refill_requirement is not set?
> > > > > > > > > > > > >
> > > > > > > > > > > > > If reill_requirement is not zero, it means each
> > > > > > > > > > > > > tx freed buffer must be valid and can be put into Rx
> sw_ring.
> > > > > > > > > > > > > Then the refill head of Rx buffer ring can be
> > > > > > > > > > > > > aligned with mbuf ring size. Briefly speaking
> > > > > > > > > > > > > the number
> > > > > > > > > > > > of Tx valid freed buffer must be equal to Rx
> > refill_requirement.
> > > > > > > > > > > > For example, i40e driver.
> > > > > > > > > > > > >
> > > > > > > > > > > > > If reill_requirement is zero, it means that the
> > > > > > > > > > > > > refill head of Rx buffer ring does not need to
> > > > > > > > > > > > > be aligned with mbuf ring size, thus if Tx have
> > > > > > > > > > > > > n valid freed buffers, we just need to put these
> > > > > > > > > > > > > n buffers into Rx
> > > > > > > > > > > > > sw-
> > > > > > > > > > > > ring, and not to be equal to the Rx setting rearm 
> > > > > > > > > > > > number.
> > > > > > > > > > > > For example, mlx5 driver.
> > > > > > > > > > > > >
> > > > > > > > > > > > > In conclusion, above difference is due to pmd
> > > > > > > > > > > > > drivers have different
> > > > > > > > > > > > strategies to update their Rx rearm(refill) head.
> > > > > > > > > > > > > For i40e driver, if rearm_head exceed 1024, it
> > > > > > > > > > > > > will be set as
> > > > > > > > > > > > > 0 due to  the
> > > > > > > > > > > > number of each rearm is a fixed value by default.
> > > > > > > > > > > > > For mlx5 driver. Its rearm_head can exceed 1024,
> > > > > > > > > > > > > and use mask to achieve
> > > > > > > > > > > > real index. Thus its rearm number can be a different 
> > > > > > > > > > > > value.
> > > > > > > > > > > >
> > > > > > > > > > > > Ok, but if rte_pktmbuf_prefree_seg(txep[i].mbuf),
> > > > > > > > > > > > it means that this mbuf is not free yet and can't be
> reused.
> > > > > > > > > > > > Shouldn't we then set nb_recycle_mbufs = 0 in that
> > > > > > > > > > > > case
> > too?
> > > > > > > > > > > > Or probably would be enough to skip that mbuf?
> > > > > > > > > > > > Might be something like that:
> > > > > > > > > > > >
> > > > > > > > > > > > for (i = 0, j = 0; i < n; i++) {
> > > > > > > > > > > >
> > > > > > > > > > > >         rxep[j] = rte_pktmbuf_prefree_seg(txep[i].mbuf);
> > > > > > > > > > > >         if (rxep[j] == NULL || recycle_rxq_info->mp !=
> > > > > > > > > > > > rxep[j].mbuf-
> > > > > > >pool))  {
> > > > > > > > > > > >                 if (refill_requirement) {
> > > > > > > > > > > >                         nb_recycle_mbufs = 0;
> > > > > > > > > > > >                         break;
> > > > > > > > > > > >                 }
> > > > > > > > > > > >         } else
> > > > > > > > > > > >                 j++;
> > > > > > > > > > > > }
> > > > > > > > > > > >
> > > > > > > > > > > > /* now j contains actual number of recycled mbufs
> > > > > > > > > > > > */
> > > > > > > > > > > >
> > > > > > > > > > > > ?
> > > > > > > > > > > >
> > > > > > > > > > > > >
> > > > > > > > > > > > > > +
>       recycle_rxq_info-
> > > > > >mp !=
> > > > > > > > > > txep[i].mbuf-
> > > > > > > > > > > > > > >pool))
> > > > > > > > > > > > > > +                           nb_recycle_mbufs = 0;
> > > > > > > > > > > > > > +           }
> > > > > > > > > > > > > > +           /* If Tx buffers are not the last
> reference or
> > > > > > > > > > > > > > +            * from unexpected mempool, all
> recycled
> > > > > buffers
> > > > > > > > > > > > > > +            * are put into mempool.
> > > > > > > > > > > > > > +            */
> > > > > > > > > > > > > > +           if (nb_recycle_mbufs == 0)
> > > > > > > > > > > > > > +                   for (i = 0; i < n; i++) {
> > > > > > > > > > > > > > +                           if (rxep[i] != NULL)
> > > > > > > > > > > > > > +
> > > > >       rte_mempool_put(rxep[i]-
> > > > > > > > > > >pool,
> > > > > > > > > > > > > > rxep[i]);
> > > > > > > > > > > > > > +                   }
> > > > > > > > > > > > > > +   }
> > > > > > > > > > > > > > +
> > > > > > > > > > > [Konstantin] After another thought, it might be
> > > > > > > > > > > easier and cleaner
> > > > > > just to:
> > > > > > > > > > > if (rxep[j] == NULL || recycle_rxq_info->mp !=
> > > > > > > > > > > rxep[j].mbuf-
> > >pool)
> > > > > > > > > > >   nb_recycle_mbufs = 0;
> > > > > > > > > > >
> > > > > > > > > > > Anyway, from my understanding - if
> > > > > > > > > > > rte_pktmbuf_prefree_seg(mbuf) returns NULL, then we
> > > > > > > > > > > can't recycle
> > > > > > that mbuf.
> > > > > > > > > > >
> > > > > > > > > > > [Feifei] Agree with that
> > > > > > > > > > > 'rte_pktmbuf_prefree_seg(mbuf) returns NULL, then
> > > > > > > > > > we can't recycle that mbuf'.
> > > > > > > > > > >
> > > > > > > > > > > Firstly, we should know for i40e driver,  the number
> > > > > > > > > > > of free mbufs is fixed, it
> > > > > > > > > > must equal to 'tx_rs_thresh'
> > > > > > > > > > > This means if we start to free Tx mbufs, it cannot
> > > > > > > > > > > be interrupted until the
> > > > > > > > > > given amount of mbufs are freed.
> > > > > > > > > > > In the meanwhile, doing prefree operation for a Tx
> > > > > > > > > > > mbuf can be looked at this mbuf is freed from this
> > > > > > > > > > > TX sw-ring if the API returns NULL. This is due
> > > > > > > > > > to that call 'prefree_seg' function means update the
> > > > > > > > > > mbuf
> > refcnt.
> > > > > > > > > > >
> > > > > > > > > > > So let's come back our recycle mbuf case.
> > > > > > > > > > > Here we consider that the case if we have 32 mbufs
> > > > > > > > > > > need to be freed, and
> > > > > > > > > > we firstly do the pre-free.
> > > > > > > > > > > And then first 8 mbufs is good and return value is not 
> > > > > > > > > > > none.
> > > > > > > > > > > But the 9th
> > > > > > > > > > mbuf is bad, its refcnt is more than 1.
> > > > > > > > > > > So we just update its refcnt and cannot put it into Rx sw-
> ring.
> > > > > > > > > > > For Tx sw-ring,
> > > > > > > > > > this mbuf has been freed.
> > > > > > > > > > > Then we should continue to do pre-free operation for
> > > > > > > > > > > the next Tx mbufs to ensure the given amount of
> > > > > > > > > > > mbufs are
> > freed.
> > > > > > > > > > >
> > > > > > > > > > > Do a conclusion for this, this is because if we
> > > > > > > > > > > start to do pre-free operation,  the Tx mbuf refcnt
> > > > > > > > > > > value maybe changed, so we cannot stop or
> > > > > > > > > > break until finish all the pre-free operation.
> > > > > > > > > > >
> > > > > > > > > > > Finally, in the case that Rx refill_request is not
> > > > > > > > > > > zero, but the valid mbuf amount is less than this
> > > > > > > > > > > value, we must put back this Tx mbufs into
> > > > > > > > > > mempool.
> > > > > > > > > > >
> > > > > > > > > > > Above is the reason why I do not directly jump out
> > > > > > > > > > > of the loop if some mbuf
> > > > > > > > > > return value is NULL.
> > > > > > > > > >
> > > > > > > > > > Yep, I already realized that it is a bit more
> > > > > > > > > > complicated and we need to continue with prefree() for
> > > > > > > > > > all packets even when we get NULL in
> > > > > > > > the middle.
> > > > > > > > > > Anyway the code has to be changed, right?
> > > > > > > > > >
> > > > > > > > > Sorry, I think for the code, it is unnecessary to be changed.
> > > > > > > > > For no fast free path, currently:
> > > > > > > > > 1.   We check whether each mbuf is Ok and call pre_free
> function
> > > > > > > > > --------------------------------------------------------
> > > > > > > > > --
> > > > > > > > > ----
> > > > > > > > > --
> > > > > > > > > --
> > > > > > > > > ----
> > > > > > > > > --------------------------------------------------------
> > > > > > > > > --
> > > > > > > > > ----
> > > > > > > > > 2.1 For the mbuf return value is not NULL, it is put
> > > > > > > > > into Rx sw-
> > ring.
> > > > > > > > > 2.2 For the mbuf return value is zero and
> > > > > > > > > refill-request, it will also firstly put into Rx
> > > > > > > > > sw-ring, and we set nb_recycle =
> > > > > > > > > 0
> > > > > > > > > --------------------------------------------------------
> > > > > > > > > --
> > > > > > > > > ----
> > > > > > > > > --
> > > > > > > > > --
> > > > > > > > > ----
> > > > > > > > > --------------------------------------------------------
> > > > > > > > > --
> > > > > > > > > ----
> > > > > > > > > 3.1 We check nb_recycle, if it is not 0, we will
> > > > > > > > > continue to rearm Rx descs
> > > > > > > > and update its index by call descs_refill function.
> > > > > > > > > 3.2 if nb_recycle is 0, we will put valid recycle mbufs
> > > > > > > > > back into mempool as general path. This is necessary,
> > > > > > > > > because we need to ensure the freed Tx number is
> > > > > > > > > fixed.(Some buffers return is null can be seen as freed,
> > > > > > > > > others need to be put into mempool)
> > > > > > > > >
> > > > > > > > > Or maybe I ignore something?
> > > > > > > >
> > > > > > > >
> > > > > > > > I am talking about the case when both refill_requirement
> > > > > > > > and mbuf return values iare zero:
> > > > > > > > if (unlikely((rxep[i] == NULL && refill_requirement) ||         
> > > > > > > > // ???
> > > > rxep[i]
> > > > > ==
> > > > > > 0
> > > > > > > > AND refill_requirement == 0 ???
> > > > > > > >                 recycle_rxq_info->mp != txep[i].mbuf->pool))
> > > > > > > >         nb_recycle_mbufs = 0;
> > > > > > > >
> > > > > > > > As I can read the code you treat such situation as valid,
> > > > > > > > while I think we should reset nb_recycle_mbufs to zero
> > > > > > > > when rxep[i] == NULL, no matter what value refill_requirement 
> > > > > > > > is.
> > > > > > >
> > > > > > > So this means for maybe MLX5 driver, its refill_request = 0.
> > > > > > > And if some mbufs return value is zero, the other mbufs can
> > > > > > > not be recycled into Rx sw-ring? Because nb_recycle=0, and
> > > > > > > they need to be put into
> > > > > > mempool.
> > > > > > >
> > > > > > > I think for such as MLX5 driver, we can allow recycle some
> > > > > > > valid mbufs into
> > > > > > Rx ring.
> > > > > > > Because no constraint for its refill number. Is my
> > > > > > > suggestion
> > reasonable?
> > > > > >
> > > > > > I suppose yes: if refill_request is zero we potentially can skip 
> > > > > > 'bad'
> > > > > > mbufs and continue with recycling for remaining ones.
> > > > > > It would probably require more changes in current code, but
> > > > > > sounds ok to me in general.
> > > > > That's Ok. Thanks for your careful reviewing.
> > > >
> > > > I'm very sorry not to receive your e-mail and until now I realize
> > > > we need to do some code change for i40e driver. Also thanks ferruh
> > > > to kindly
> > remind this.
> >
> > No worries at all and thanks for you and Ferruh fro fast reaction.
> >
> > > >
> > > > Agree with you we need some operation for the case that
> > > > (refill_requirement == 0 &&  rxep[i] == 0).
> > > > Thus maybe we can do a change as follows:
> > > >
> > > > for (i = 0; i < n; i++) {
> > > >         rxep[0] = rte_pktmbuf_prefree_seg(txep[i].mbuf);
> > > >         if (unlikely((rxep[0] == NULL && refill_requirement) ||
> > > >                         recycle_rxq_info->mp != txep[i].mbuf->pool))
> > > >                 nb_recycle_mbufs = 0;
> > > >         if (likely(rxep[0]))
> > > >                 rxep++;
> > > > }
> > > >
> > > > Is above change is Ok?
> > > >
> > >
> > > Just do a change for the above code:
> > > --------------------------------------------------------------------
> > > --
> > > -------------------------------
> > > int j = 0;
> > > ......
> > >
> > >  for (i = 0; i < n; i++) {
> > >   rxep[j] = rte_pktmbuf_prefree_seg(txep[i].mbuf);
> > >   if (unlikely(rxep[j] == NULL  && !refill_requirement))
> > >           continue;
> > >   if (unlikely((rxep[j] == NULL && refill_requirement) ||
> > >                   recycle_rxq_info->mp != txep[i].mbuf->pool))
> > >           nb_recycle_mbufs  = 0;
> > >   j++;
> > >  }
> > >
> > > if (nb_recycle_mbufs  == 0)
> > >   for (i = 0; i < j; i++) {
> > >           if (rxep[i] != NULL)
> > >                   rte_mempool_put(rxep[i]->pool, rxep[i]);
> > >   }
> >
> > I think it should work...
> > One more thing, shouldn't we reset
> > nb_recycle_mbufs  = j;
> >
> > Something like:
> >  if (nb_recycle_mbufs  == 0) {
> >     for (i = 0; i < j; i++) {
> >             if (rxep[i] != NULL)
> >                     rte_mempool_put(rxep[i]->pool, rxep[i]);
> >     }
> > } else
> >   nb_recycle_mbufs  = j;
> >
> > ?
> >
> >
> Yes, we should reset it.
> 
> I just do a performance test with the new code and find the performance
> decrease with  8%, comparing with the old version.
> 
> Thus I think we should come back your previous idea:
> "we should reset nb_recycle_mbufs to zero when rxep[i] == NULL, no matter
> what value refill_requirement is":
> 
> for (i = 0; i < n; i++) {
>       rxep[i] = rte_pktmbuf_prefree_seg(txep[i].mbuf);
>       if (unlikely((rxep[i] == NULL) ||
>                       recycle_rxq_info->mp != txep[i].mbuf->pool))
>               nb_recycle_mbufs  = 0;
> }
> 
> Thus we drop the case that putting the remaining valid mbuf into Rx sw-ring
> when no refill_request, but maintain good performance.

[Konstantin] I am ok with that way too.
Again that way, it looks simpler and probaly less error-prone.

Thanks
Konstantin
[Feifei] Agree with this, I will update a new version to the community.
> 
> >
> >
> >
> > > --------------------------------------------------------------------
> > > --
> > > -----------------------------------
> 
> > > > > >
> > > > > > > >
> > > > > > > >
> > > > > > > > > > >
> > > > > > > > > > > > > > +   /* Update counters for Tx. */
> > > > > > > > > > > > > > +   txq->nb_tx_free = (uint16_t)(txq->nb_tx_free
> > > > > > > > > > > > > > ++
> > > > > > > > > > > > > > +txq-
> > > > > > > > > > >tx_rs_thresh);
> > > > > > > > > > > > > > +   txq->tx_next_dd = (uint16_t)(txq->tx_next_dd
> > > > > > > > > > > > > > ++
> > > > > > > > > > > > > > +txq-
> > > > > > > > > > >tx_rs_thresh);
> > > > > > > > > > > > > > +   if (txq->tx_next_dd >= txq->nb_tx_desc)
> > > > > > > > > > > > > > +           txq->tx_next_dd = (uint16_t)(txq-
> > > > > >tx_rs_thresh -
> > > > > > > > > > > > > > +1);
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   return nb_recycle_mbufs; }
> > > > > > > > > > > > > > diff --git a/drivers/net/i40e/i40e_rxtx.c
> > > > > > > > > > > > > > b/drivers/net/i40e/i40e_rxtx.c index
> > > > > > > > > > > > > > b4f65b58fa..a9c9eb331c
> > > > > > > > > > > > > > 100644
> > > > > > > > > > > > > > --- a/drivers/net/i40e/i40e_rxtx.c
> > > > > > > > > > > > > > +++ b/drivers/net/i40e/i40e_rxtx.c
> > > > > > > > > > > > > > @@ -3199,6 +3199,30 @@
> > > > > > > > > > > > > > i40e_txq_info_get(struct rte_eth_dev *dev, uint16_t
> queue_id,
> > > > > > > > > > > > > >     qinfo->conf.offloads = txq->offloads;  }
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > +void
> > > > > > > > > > > > > > +i40e_recycle_rxq_info_get(struct rte_eth_dev
> > > > > > > > > > > > > > +*dev, uint16_t
> > > > > > > > > > queue_id,
> > > > > > > > > > > > > > +   struct rte_eth_recycle_rxq_info
> *recycle_rxq_info) {
> > > > > > > > > > > > > > +   struct i40e_rx_queue *rxq;
> > > > > > > > > > > > > > +   struct i40e_adapter *ad =
> > > > > > > > > > > > > > +           I40E_DEV_PRIVATE_TO_ADAPTER(dev-
> >data-
> > > > > > > > > > >dev_private);
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   rxq = dev->data->rx_queues[queue_id];
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   recycle_rxq_info->mbuf_ring = (void *)rxq-
> >sw_ring;
> > > > > > > > > > > > > > +   recycle_rxq_info->mp = rxq->mp;
> > > > > > > > > > > > > > +   recycle_rxq_info->mbuf_ring_size = rxq-
> >nb_rx_desc;
> > > > > > > > > > > > > > +   recycle_rxq_info->receive_tail =
> > > > > > > > > > > > > > +&rxq->rx_tail;
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > > +   if (ad->rx_vec_allowed) {
> > > > > > > > > > > > > > +           recycle_rxq_info->refill_requirement =
> > > > > > > > > > > > > > RTE_I40E_RXQ_REARM_THRESH;
> > > > > > > > > > > > > > +           recycle_rxq_info->refill_head = &rxq-
> > > > > >rxrearm_start;
> > > > > > > > > > > > > > +   } else {
> > > > > > > > > > > > > > +           recycle_rxq_info->refill_requirement =
> rxq-
> > > > > > > > > > >rx_free_thresh;
> > > > > > > > > > > > > > +           recycle_rxq_info->refill_head = &rxq-
> > > > > >rx_free_trigger;
> > > > > > > > > > > > > > +   }
> > > > > > > > > > > > > > +}
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > >  #ifdef RTE_ARCH_X86  static inline bool
> > > > > > > > > > > > > > get_avx_supported(bool request_avx512) @@
> > > > > > > > > > > > > > -3293,6
> > > > > > > > > > > > > > +3317,8
> > > > > > > > @@
> > > > > > > > > > > > > > i40e_set_rx_function(struct rte_eth_dev *dev)
> > > > > > > > > > > > > >                             dev->rx_pkt_burst =
> ad-
> > > > > > > > >rx_use_avx2 ?
> > > > > > > > > > > > > >
> > > > > > > >         i40e_recv_scattered_pkts_vec_avx2 :
> > > > > > > > > > > > > >
> > > > >       i40e_recv_scattered_pkts_vec;
> > > > > > > > > > > > > > +                           dev-
> > > > > >recycle_rx_descriptors_refill =
> > > > > > > > > > > > > > +
> > > > > > > > > >     i40e_recycle_rx_descriptors_refill_vec;
> > > > > > > > > > > > > >                     }
> > > > > > > > > > > > > >             } else {
> > > > > > > > > > > > > >                     if (ad->rx_use_avx512) { @@ -
> 3311,9
> > > > > +3337,12
> > > > > > @@
> > > > > > > > > > > > > > i40e_set_rx_function(struct rte_eth_dev
> > > > > > > > > > *dev)
> > > > > > > > > > > > > >                             dev->rx_pkt_burst =
> ad-
> > > > > > > > >rx_use_avx2 ?
> > > > > > > > > > > > > >
> > > > >       i40e_recv_pkts_vec_avx2 :
> > > > > > > > > > > > > >
>       i40e_recv_pkts_vec;
> > > > > > > > > > > > > > +                           dev-
> > > > > >recycle_rx_descriptors_refill =
> > > > > > > > > > > > > > +
> > > > > > > > > >     i40e_recycle_rx_descriptors_refill_vec;
> > > > > > > > > > > > > >                     }
> > > > > > > > > > > > > >             }
> > > > > > > > > > > > > >  #else /* RTE_ARCH_X86 */
> > > > > > > > > > > > > > +           dev->recycle_rx_descriptors_refill =
> > > > > > > > > > > > > > +i40e_recycle_rx_descriptors_refill_vec;
> > > > > > > > > > > > > >             if (dev->data->scattered_rx) {
> > > > > > > > > > > > > >                     PMD_INIT_LOG(DEBUG,
> > > > > > > > > > > > > >                                  "Using Vector
> Scattered Rx
> > > > > > > > (port %d).", @@
> > > > > > > > > > > > > > -3481,15 +3510,18 @@
> > > > > > > > > > > > > > i40e_set_tx_function(struct rte_eth_dev
> > > > > > > > *dev)
> > > > > > > > > > > > > >                             dev->tx_pkt_burst =
> ad-
> > > > > > > > >tx_use_avx2 ?
> > > > > > > > > > > > > >
> > > > > > > > i40e_xmit_pkts_vec_avx2 :
> > > > > > > > > > > > > >
> > > > > > > > i40e_xmit_pkts_vec;
> > > > > > > > > > > > > > +                           dev-
> >recycle_tx_mbufs_reuse
> > > > > =
> > > > > > > > > > > > > > i40e_recycle_tx_mbufs_reuse_vec;
> > > > > > > > > > > > > >                     }
> > > > > > > > > > > > > >  #else /* RTE_ARCH_X86 */
> > > > > > > > > > > > > >                     PMD_INIT_LOG(DEBUG,
> "Using Vector
> > > > > Tx
> > > > > > > > (port %d).",
> > > > > > > > > > > > > >                                  dev->data-
> >port_id);
> > > > > > > > > > > > > >                     dev->tx_pkt_burst =
> > > > > i40e_xmit_pkts_vec;
> > > > > > > > > > > > > > +                   dev->recycle_tx_mbufs_reuse
> =
> > > > > > > > > > > > > > i40e_recycle_tx_mbufs_reuse_vec;  #endif /*
> > > > > > > > > > > > > > RTE_ARCH_X86
> > > > > */
> > > > > > > > > > > > > >             } else {
> > > > > > > > > > > > > >                     PMD_INIT_LOG(DEBUG,
> "Simple tx
> > > > > finally be
> > > > > > > > used.");
> > > > > > > > > > > > > >                     dev->tx_pkt_burst =
> > > > > i40e_xmit_pkts_simple;
> > > > > > > > > > > > > > +                   dev->recycle_tx_mbufs_reuse
> =
> > > > > > > > > > > > > > i40e_recycle_tx_mbufs_reuse_vec;
> > > > > > > > > > > > > >             }
> > > > > > > > > > > > > >             dev->tx_pkt_prepare =
> > > > > i40e_simple_prep_pkts;
> > > > > > > > > > > > > >     } else {
> > > > > > > > > > > > > > diff --git a/drivers/net/i40e/i40e_rxtx.h
> > > > > > > > > > > > > > b/drivers/net/i40e/i40e_rxtx.h index
> > > > > > > > > > > > > > a8686224e5..b191f23e1f
> > > > > > > > > > > > > > 100644
> > > > > > > > > > > > > > --- a/drivers/net/i40e/i40e_rxtx.h
> > > > > > > > > > > > > > +++ b/drivers/net/i40e/i40e_rxtx.h
> > > > > > > > > > > > > > @@ -236,6 +236,10 @@ uint32_t
> > > > > > > > > > > > > > i40e_dev_rx_queue_count(void *rx_queue);  int
> > > > > > > > > > > > > > i40e_dev_rx_descriptor_status(void *rx_queue,
> > > > > > > > > > > > > > uint16_t offset);  int
> > > > > > > > > > > > > > i40e_dev_tx_descriptor_status(void
> > > > > > > > > > > > > > *tx_queue, uint16_t offset);
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > +uint16_t i40e_recycle_tx_mbufs_reuse_vec(void
> > > > *tx_queue,
> > > > > > > > > > > > > > +           struct rte_eth_recycle_rxq_info
> > > > > *recycle_rxq_info);
> > > > > > > > > > void
> > > > > > > > > > > > > > +i40e_recycle_rx_descriptors_refill_vec(void
> > > > > > > > > > > > > > +*rx_queue, uint16_t nb_mbufs);
> > > > > > > > > > > > > > +
> > > > > > > > > > > > > >  uint16_t i40e_recv_pkts_vec(void *rx_queue,
> > > > > > > > > > > > > > struct rte_mbuf
> > > > > > > > > > **rx_pkts,
> > > > > > > > > > > > > >                         uint16_t nb_pkts);  uint16_t
> > > > > > > > > > > > > > i40e_recv_scattered_pkts_vec(void *rx_queue,
> > > > > > > > > > > > > > diff --git a/drivers/net/i40e/meson.build
> > > > > > > > > > > > > > b/drivers/net/i40e/meson.build index
> > > > > > > > > > > > > > 8e53b87a65..3b1a233c84 100644
> > > > > > > > > > > > > > --- a/drivers/net/i40e/meson.build
> > > > > > > > > > > > > > +++ b/drivers/net/i40e/meson.build
> > > > > > > > > > > > > > @@ -34,6 +34,7 @@ sources = files(
> > > > > > > > > > > > > >          'i40e_tm.c',
> > > > > > > > > > > > > >          'i40e_hash.c',
> > > > > > > > > > > > > >          'i40e_vf_representor.c',
> > > > > > > > > > > > > > +   'i40e_recycle_mbufs_vec_common.c',
> > > > > > > > > > > > > >          'rte_pmd_i40e.c',
> > > > > > > > > > > > > >  )
> > > > > > > > > > > > > >
> > > > > > > > > > > > > > --
> > > > > > > > > > > > > > 2.25.1

Reply via email to