[PATCH] crypto/qat: fix cleanup function default behaviour

2022-06-15 Thread Arek Kusztal
Full cookie data should be cleared by default if not algorithm
specified. This is protection in case algorithm specific cleanup
is not implemented.

Fixes: 3b78aa7b2317 ("crypto/qat: refactor asymmetric crypto functions")

Signed-off-by: Arek Kusztal 
---
 drivers/crypto/qat/qat_asym.c | 4 
 1 file changed, 4 insertions(+)

diff --git a/drivers/crypto/qat/qat_asym.c b/drivers/crypto/qat/qat_asym.c
index e4ab4bf33d..6e43a6cac8 100644
--- a/drivers/crypto/qat/qat_asym.c
+++ b/drivers/crypto/qat/qat_asym.c
@@ -144,6 +144,10 @@ cleanup(struct qat_asym_op_cookie *cookie,
cleanup_arrays(cookie, QAT_ASYM_RSA_NUM_IN_PARAMS,
QAT_ASYM_RSA_NUM_OUT_PARAMS, alg_size);
}
+   } else {
+   cleanup_arrays(cookie, QAT_ASYM_MAX_PARAMS,
+   QAT_ASYM_MAX_PARAMS,
+   QAT_PKE_MAX_LN_SIZE);
}
 }
 
-- 
2.13.6



Re: [PATCH 2/2] common/cnxk: add include for macro definition

2022-06-15 Thread Thomas Monjalon
14/06/2022 14:29, Bruce Richardson:
> The header file "roc_io.h" uses the "__plt_always_inline" macro but
> don't include "roc_platform.h" to get the definition of it. This
> inclusion is not necessary for compilation, but the lack of it can
> confuse some indexers - such as those in eclipse, which reports the
> lines:
> 
> "static __plt_always_inline uint64_t"
> 
> as possible definitions of a variable called "uint64_t". This confusion
> leads to uint64_t being flagged as an unknown type in all other parts of
> the project being indexed, e.g. across all of DPDK code.
> 
> Adding in the include of rte_common.h makes it clear to the indexer that
> those lines are  part of a function definition, and that allows eclipse
> to correctly recognise uint64_t as a type from stdint.h

It is not rte_common.h
Looks like a copy past of the first patch.





[PATCH v2] vdpa/ifc: fix null pointer dereference

2022-06-15 Thread Andy Pei
Fix null pointer dereference reported in coverity scan.

Coverity issue: 378882
Fixes: 5d75517beffe ("vdpa/ifc/base: access correct register for blk device")

Signed-off-by: Andy Pei 
---
 drivers/vdpa/ifc/base/ifcvf.c | 9 +
 1 file changed, 9 insertions(+)

diff --git a/drivers/vdpa/ifc/base/ifcvf.c b/drivers/vdpa/ifc/base/ifcvf.c
index dd475a7..0a9f71a 100644
--- a/drivers/vdpa/ifc/base/ifcvf.c
+++ b/drivers/vdpa/ifc/base/ifcvf.c
@@ -255,6 +255,10 @@
u32 ring_state;
 
cfg = hw->common_cfg;
+   if (!cfg) {
+   DEBUGOUT("common_cfg in HW is NULL.\n");
+   return;
+   }
 
IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->msix_config);
for (i = 0; i < hw->nr_vring; i++) {
@@ -262,6 +266,11 @@
IFCVF_WRITE_REG16(0, &cfg->queue_enable);
IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->queue_msix_vector);
 
+   if (!hw->lm_cfg) {
+   DEBUGOUT("live migration cfg in HW is NULL.\n");
+   continue;
+   }
+
if (hw->device_type == IFCVF_BLK)
ring_state = *(u32 *)(hw->lm_cfg +
IFCVF_LM_RING_STATE_OFFSET +
-- 
1.8.3.1



RE: [PATCH v7 2/5] raw/ifpga: add N3000 AFU driver

2022-06-15 Thread Xu, Rosen
Hi Wei,

Some comments.

Thanks,
Rosen

> -Original Message-
> From: Huang, Wei 
> Sent: Thursday, June 09, 2022 15:37
> To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com;
> hemant.agra...@nxp.com
> Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei
> ; Zhang, Qi Z ; Huang, Wei
> 
> Subject: [PATCH v7 2/5] raw/ifpga: add N3000 AFU driver
> 
> N3000 AFU includes NLB0 and DMA modules, NLB0 is used to test PCI bus
> and DMA is used to test local memory.
> This driver initialize the modules and report test result.
> 
> Signed-off-by: Wei Huang 
> ---
> v2: move source files to ifpga and rename, refine code
> ---
> v3: fix Ubuntu 20.04 ARM build
> ---
>  drivers/raw/ifpga/afu_pmd_core.h  |   19 +
>  drivers/raw/ifpga/afu_pmd_n3000.c | 2019
> +
>  drivers/raw/ifpga/afu_pmd_n3000.h |  339 +++
>  drivers/raw/ifpga/meson.build |3 +-
>  drivers/raw/ifpga/rte_pmd_afu.h   |   97 ++
>  5 files changed, 2476 insertions(+), 1 deletion(-)
>  create mode 100644 drivers/raw/ifpga/afu_pmd_n3000.c
>  create mode 100644 drivers/raw/ifpga/afu_pmd_n3000.h
>  create mode 100644 drivers/raw/ifpga/rte_pmd_afu.h
> 
> diff --git a/drivers/raw/ifpga/afu_pmd_core.h
> b/drivers/raw/ifpga/afu_pmd_core.h
> index 4fad2c7..a938172 100644
> --- a/drivers/raw/ifpga/afu_pmd_core.h
> +++ b/drivers/raw/ifpga/afu_pmd_core.h
> @@ -14,6 +14,7 @@
>  #include 
> 
>  #include 
> +#include 
>  #include 
>  #include 
> 
> @@ -60,6 +61,24 @@ struct afu_rawdev {
>   return rawdev ? (struct afu_rawdev *)rawdev->dev_private : NULL;
>  }
> 
> +#define CLS_TO_SIZE(n)  ((n) << 6)  /* get size of n cache lines */
> +#define SIZE_TO_CLS(s)  ((s) >> 6)  /* convert size to number of cache lines
> */
> +#define MHZ(f)  ((f) * 100)
> +
> +#define dsm_poll_timeout(addr, val, cond, invl, timeout) \
> +({   \
> + uint64_t __wait = 0; \
> + uint64_t __invl = (invl);\
> + uint64_t __timeout = (timeout);  \
> + for (; __wait <= __timeout; __wait += __invl) {  \
> + (val) = *(addr); \
> + if (cond)\
> + break;   \
> + rte_delay_ms(__invl);\
> + }\
> + (cond) ? 0 : 1;  \
> +})
> +
>  void afu_pmd_register(struct afu_rawdev_drv *driver);
>  void afu_pmd_unregister(struct afu_rawdev_drv *driver);
> 
> diff --git a/drivers/raw/ifpga/afu_pmd_n3000.c
> b/drivers/raw/ifpga/afu_pmd_n3000.c
> new file mode 100644
> index 000..14f7fe0
> --- /dev/null
> +++ b/drivers/raw/ifpga/afu_pmd_n3000.c
> @@ -0,0 +1,2019 @@
> +/* SPDX-License-Identifier: BSD-3-Clause
> + * Copyright(c) 2022 Intel Corporation
> + */
> +
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +
> +#include "afu_pmd_core.h"
> +#include "afu_pmd_n3000.h"
> +
> +static int nlb_afu_config(struct afu_rawdev *dev)
> +{
> + struct n3000_afu_priv *priv = NULL;
> + struct rte_pmd_afu_nlb_cfg *cfg = NULL;
> + struct nlb_csr_cfg v;
> +
> + if (!dev)
> + return -EINVAL;
> +
> + if (!dev->priv)
> + return -ENOENT;
> +
> + priv = (struct n3000_afu_priv *)dev->priv;
> + cfg = &priv->nlb_cfg;
> +
> + v.csr = 0;
> +
> + if (cfg->cont)
> + v.cont = 1;
> +
> + if (cfg->cache_policy == NLB_WRPUSH_I)
> + v.wrpush_i = 1;
> + else
> + v.wrthru_en = cfg->cache_policy;
> +
> + if (cfg->cache_hint == NLB_RDLINE_MIXED)
> + v.rdsel = 3;
> + else
> + v.rdsel = cfg->cache_hint;
> +
> + v.mode = cfg->mode;
> + v.chsel = cfg->read_vc;
> + v.wr_chsel = cfg->write_vc;
> + v.wrfence_chsel = cfg->wrfence_vc;
> + v.wrthru_en = cfg->cache_policy;
> + v.multicl_len = cfg->multi_cl - 1;
> +
> + IFPGA_RAWDEV_PMD_DEBUG("cfg: 0x%08x", v.csr);
> + rte_write32(v.csr, priv->nlb_ctx.addr + CSR_CFG);
> +
> + return 0;
> +}
> +
> +static void nlb_afu_report(struct afu_rawdev *dev, uint32_t cl)
> +{
> + struct n3000_afu_priv *priv = NULL;
> + struct rte_pmd_afu_nlb_cfg *cfg = NULL;
> + struct nlb_dsm_status *stat = NULL;
> + uint64_t ticks = 0;
> + double num, rd_bw, wr_bw;
> +
> + if (!dev || !dev->priv)
> + return;
> +
> + priv = (struct n3000_afu_priv *)dev->priv;
> +
> + cfg = &priv->nlb_cfg;
> + stat = priv->nlb_ctx.status_ptr;
> +
> + if (cfg->cont)
> + ticks = stat->num_clocks - stat->start_overhead;
> + else
> + ticks = st

RE: [PATCH] vdpa/ifc: fix null pointer dereference

2022-06-15 Thread Pei, Andy
Hi Chenbo,

Thanks for your reply.
I will fix this in next version.

> -Original Message-
> From: Xia, Chenbo 
> Sent: Wednesday, June 15, 2022 10:04 AM
> To: Pei, Andy ; dev@dpdk.org
> Cc: maxime.coque...@redhat.com; Wang, Xiao W ;
> Xu, Rosen ; Xiao, QimaiX 
> Subject: RE: [PATCH] vdpa/ifc: fix null pointer dereference
> 
> Hi Andy,
> 
> > -Original Message-
> > From: Pei, Andy 
> > Sent: Wednesday, June 8, 2022 3:34 PM
> > To: dev@dpdk.org
> > Cc: Xia, Chenbo ; maxime.coque...@redhat.com;
> > Wang, Xiao W ; Xu, Rosen ;
> > Xiao, QimaiX 
> > Subject: [PATCH] vdpa/ifc: fix null pointer dereference
> >
> > Fix null pointer dereference reported in coverity scan.
> >
> > Coverity issue: 378882
> > Fixes: 8162a4a9 ("vdpa/ifc/base: access correct register for blk
> > device")
> 
> Commit id need to be 12 digits. Also pls leave a blank line between Fixes tag
> and signed-off
> 
> Thanks,
> Chenbo
> 
> > Signed-off-by: Andy Pei 
> > ---
> >  drivers/vdpa/ifc/base/ifcvf.c | 9 +
> >  1 file changed, 9 insertions(+)
> >
> > diff --git a/drivers/vdpa/ifc/base/ifcvf.c
> > b/drivers/vdpa/ifc/base/ifcvf.c index dd475a7..0a9f71a 100644
> > --- a/drivers/vdpa/ifc/base/ifcvf.c
> > +++ b/drivers/vdpa/ifc/base/ifcvf.c
> > @@ -255,6 +255,10 @@
> > u32 ring_state;
> >
> > cfg = hw->common_cfg;
> > +   if (!cfg) {
> > +   DEBUGOUT("common_cfg in HW is NULL.\n");
> > +   return;
> > +   }
> >
> > IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->msix_config);
> > for (i = 0; i < hw->nr_vring; i++) { @@ -262,6 +266,11 @@
> > IFCVF_WRITE_REG16(0, &cfg->queue_enable);
> > IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg-
> > >queue_msix_vector);
> >
> > +   if (!hw->lm_cfg) {
> > +   DEBUGOUT("live migration cfg in HW is NULL.\n");
> > +   continue;
> > +   }
> > +
> > if (hw->device_type == IFCVF_BLK)
> > ring_state = *(u32 *)(hw->lm_cfg +
> > IFCVF_LM_RING_STATE_OFFSET +
> > --
> > 1.8.3.1



RE: [PATCH v7 3/5] raw/ifpga: add HE-LPBK AFU driver

2022-06-15 Thread Xu, Rosen
Hi Wei,

> -Original Message-
> From: Huang, Wei 
> Sent: Thursday, June 09, 2022 15:37
> To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com;
> hemant.agra...@nxp.com
> Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei
> ; Zhang, Qi Z ; Huang, Wei
> 
> Subject: [PATCH v7 3/5] raw/ifpga: add HE-LPBK AFU driver
> 
> HE-LPBK and HE-MEM-LPBK are host exerciser modules in OFS FPGA, HE-
> LPBK is used to test PCI bus and HE-MEM-LPBK is used to test local memory.
> This driver initialize the modules and report test result.
> 
> Signed-off-by: Wei Huang 
> ---
> v2: move source files to ifpga and rename, refine code
> ---
>  drivers/raw/ifpga/afu_pmd_he_lpbk.c | 436
> 
>  drivers/raw/ifpga/afu_pmd_he_lpbk.h | 126 +++
>  drivers/raw/ifpga/meson.build   |   2 +-
>  drivers/raw/ifpga/rte_pmd_afu.h |  14 ++
>  4 files changed, 577 insertions(+), 1 deletion(-)  create mode 100644
> drivers/raw/ifpga/afu_pmd_he_lpbk.c
>  create mode 100644 drivers/raw/ifpga/afu_pmd_he_lpbk.h
> 
> diff --git a/drivers/raw/ifpga/afu_pmd_he_lpbk.c
> b/drivers/raw/ifpga/afu_pmd_he_lpbk.c
> new file mode 100644
> index 000..8b2c85b
> --- /dev/null
> +++ b/drivers/raw/ifpga/afu_pmd_he_lpbk.c
> @@ -0,0 +1,436 @@
> +/* SPDX-License-Identifier: BSD-3-Clause
> + * Copyright(c) 2022 Intel Corporation
> + */
> +
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +
> +#include "afu_pmd_core.h"
> +#include "afu_pmd_he_lpbk.h"
> +
> +static int he_lpbk_afu_config(struct afu_rawdev *dev) {
> + struct he_lpbk_priv *priv = NULL;
> + struct rte_pmd_afu_he_lpbk_cfg *cfg = NULL;
> + struct he_lpbk_csr_cfg v;
> +
> + if (!dev)
> + return -EINVAL;
> +
> + priv = (struct he_lpbk_priv *)dev->priv;
> + if (!priv)
> + return -ENOENT;
> +
> + cfg = &priv->he_lpbk_cfg;
> +
> + v.csr = 0;
> +
> + if (cfg->cont)
> + v.cont = 1;
> +
> + v.mode = cfg->mode;
> + v.trput_interleave = cfg->trput_interleave;
> + if (cfg->multi_cl == 4)
> + v.multicl_len = 2;
> + else
> + v.multicl_len = cfg->multi_cl - 1;
> +
> + IFPGA_RAWDEV_PMD_DEBUG("cfg: 0x%08x", v.csr);
> + rte_write32(v.csr, priv->he_lpbk_ctx.addr + CSR_CFG);
> +
> + return 0;
> +}
> +
> +static void he_lpbk_report(struct afu_rawdev *dev, uint32_t cl) {
> + struct he_lpbk_priv *priv = NULL;
> + struct rte_pmd_afu_he_lpbk_cfg *cfg = NULL;
> + struct he_lpbk_ctx *ctx = NULL;
> + struct he_lpbk_dsm_status *stat = NULL;
> + struct he_lpbk_status0 stat0;
> + struct he_lpbk_status1 stat1;
> + uint64_t swtest_msg = 0;
> + uint64_t ticks = 0;
> + uint64_t info = 0;
> + double num, rd_bw, wr_bw;
> +
> + if (!dev || !dev->priv)
> + return;
> +
> + priv = (struct he_lpbk_priv *)dev->priv;
> + cfg = &priv->he_lpbk_cfg;
> + ctx = &priv->he_lpbk_ctx;
> +
> + stat = ctx->status_ptr;
> +
> + swtest_msg = rte_read64(ctx->addr + CSR_SWTEST_MSG);
> + stat0.csr = rte_read64(ctx->addr + CSR_STATUS0);
> + stat1.csr = rte_read64(ctx->addr + CSR_STATUS1);
> +
> + if (cfg->cont)
> + ticks = stat->num_clocks - stat->start_overhead;
> + else
> + ticks = stat->num_clocks -
> + (stat->start_overhead + stat->end_overhead);
> +
> + if (cfg->freq_mhz == 0) {
> + info = rte_read64(ctx->addr + CSR_HE_INFO0);
> + IFPGA_RAWDEV_PMD_INFO("API version: %"PRIx64, info >>
> 16);
> + cfg->freq_mhz = info & 0x;
> + if (cfg->freq_mhz == 0) {
> + IFPGA_RAWDEV_PMD_INFO("Frequency of AFU
> clock is unknown."
> + " Assuming 350 MHz.");
> + cfg->freq_mhz = 350;
> + }
> + }
> +
> + num = (double)stat0.num_reads;
> + rd_bw = (num * CLS_TO_SIZE(1) * MHZ(cfg->freq_mhz)) / ticks;
> + num = (double)stat0.num_writes;
> + wr_bw = (num * CLS_TO_SIZE(1) * MHZ(cfg->freq_mhz)) / ticks;
> +
> + printf("Cachelines  Read_Count Write_Count Pend_Read
> Pend_Write "
> + "Clocks@%uMHz   Rd_Bandwidth   Wr_Bandwidth\n",
> + cfg->freq_mhz);
> + printf("%10u  %10u %10u %10u %10u  %12"PRIu64
> + "   %7.3f GB/s   %7.3f GB/s\n",
> + cl, stat0.num_reads, stat0.num_writes,
> + stat1.num_pend_reads, stat1.num_pend_writes,
> + ticks, rd_bw / 1e9, wr_bw / 1e9);
> + printf("Test Message: 0x%"PRIx64"\n", swtest_msg); }
> +
> +static int he_lpbk_test(struct afu_rawdev *dev) {
> + struct he_lpbk_priv *priv = NULL;
> + struct rte_pmd_afu_he_lpbk_cfg *cfg = NULL;
> + struct he_lpbk_ctx *ctx = NULL;
> + struct he_lpbk_csr_ctl ctl;
> + uin

RE: [PATCH v7 4/5] raw/ifpga: add HE-MEM AFU driver

2022-06-15 Thread Xu, Rosen
Hi Wei,


> -Original Message-
> From: Huang, Wei 
> Sent: Thursday, June 09, 2022 15:37
> To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com;
> hemant.agra...@nxp.com
> Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei
> ; Zhang, Qi Z ; Huang, Wei
> 
> Subject: [PATCH v7 4/5] raw/ifpga: add HE-MEM AFU driver
> 
> HE-MEM is one of the host exerciser modules in OFS FPGA, which is used to
> test local memory with built-in traffic generator.
> This driver initialize the module and report test result.
> 
> Signed-off-by: Wei Huang 
> Acked-by: Tianfei Zhang 
> ---
> v2: move source files to ifpga and rename
> ---
>  drivers/raw/ifpga/afu_pmd_he_mem.c | 183
> +
>  drivers/raw/ifpga/afu_pmd_he_mem.h |  46 ++
>  drivers/raw/ifpga/meson.build  |   2 +-
>  drivers/raw/ifpga/rte_pmd_afu.h|   7 ++
>  4 files changed, 237 insertions(+), 1 deletion(-)  create mode 100644
> drivers/raw/ifpga/afu_pmd_he_mem.c
>  create mode 100644 drivers/raw/ifpga/afu_pmd_he_mem.h
> 
> diff --git a/drivers/raw/ifpga/afu_pmd_he_mem.c
> b/drivers/raw/ifpga/afu_pmd_he_mem.c
> new file mode 100644
> index 000..0f57a03
> --- /dev/null
> +++ b/drivers/raw/ifpga/afu_pmd_he_mem.c
> @@ -0,0 +1,183 @@
> +/* SPDX-License-Identifier: BSD-3-Clause
> + * Copyright(c) 2022 Intel Corporation
> + */
> +
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +
> +#include "afu_pmd_core.h"
> +#include "afu_pmd_he_mem.h"
> +
> +static int he_mem_tg_test(struct afu_rawdev *dev) {
> + struct he_mem_tg_priv *priv = NULL;
> + struct rte_pmd_afu_he_mem_tg_cfg *cfg = NULL;
> + struct he_mem_tg_ctx *ctx = NULL;
> + uint64_t value = 0x12345678;
> + uint64_t cap = 0;
> + uint64_t channel_mask = 0;
> + int i, t = 0;
> +
> + if (!dev)
> + return -EINVAL;
> +
> + priv = (struct he_mem_tg_priv *)dev->priv;
> + if (!priv)
> + return -ENOENT;
> +
> + cfg = &priv->he_mem_tg_cfg;
> + ctx = &priv->he_mem_tg_ctx;
> +
> + IFPGA_RAWDEV_PMD_DEBUG("Channel mask: 0x%x", cfg-
> >channel_mask);
> +
> + rte_write64(value, ctx->addr + MEM_TG_SCRATCHPAD);
> + cap = rte_read64(ctx->addr + MEM_TG_SCRATCHPAD);
> + IFPGA_RAWDEV_PMD_DEBUG("Scratchpad value: 0x%"PRIx64, cap);
> + if (cap != value) {
> + IFPGA_RAWDEV_PMD_ERR("Test scratchpad register failed");
> + return -EIO;
> + }
> +
> + cap = rte_read64(ctx->addr + MEM_TG_CTRL);
> + IFPGA_RAWDEV_PMD_DEBUG("Capability: 0x%"PRIx64, cap);
> +
> + channel_mask = cfg->channel_mask & cap;
> + /* start traffic generators */
> + rte_write64(channel_mask, ctx->addr + MEM_TG_CTRL);
> +
> + /* check test status */
> + while (t < MEM_TG_TIMEOUT_MS) {
> + value = rte_read64(ctx->addr + MEM_TG_STAT);
> + for (i = 0; i < NUM_MEM_TG_CHANNELS; i++) {
> + if (channel_mask & (1 << i)) {
> + if (TGACTIVE(value, i))
> + continue;
> + printf("TG channel %d test %s\n", i,
> + TGPASS(value, i) ? "pass" :
> + TGTIMEOUT(value, i) ? "timeout" :
> + TGFAIL(value, i) ? "fail" : "error");
> + channel_mask &= ~(1 << i);
> + }
> + }
> + if (!channel_mask)
> + break;
> + rte_delay_ms(MEM_TG_POLL_INTERVAL_MS);
> + t += MEM_TG_POLL_INTERVAL_MS;
> + }
> +
> + if (channel_mask) {
> + IFPGA_RAWDEV_PMD_ERR("Timeout 0x%04lx", (unsigned
> long)value);
> + return channel_mask;
> + }
> +
> + return 0;
> +}
> +
> +static int he_mem_tg_init(struct afu_rawdev *dev) {
> + struct he_mem_tg_priv *priv = NULL;
> + struct he_mem_tg_ctx *ctx = NULL;
> +
> + if (!dev)
> + return -EINVAL;
> +
> + priv = (struct he_mem_tg_priv *)dev->priv;
> + if (!priv) {
> + priv = rte_zmalloc(NULL, sizeof(struct he_mem_tg_priv), 0);
> + if (!priv)
> + return -ENOMEM;
> + dev->priv = priv;
> + }
> +
> + ctx = &priv->he_mem_tg_ctx;
> + ctx->addr = (uint8_t *)dev->addr;
> +
> + return 0;
> +}
> +
> +static int he_mem_tg_config(struct afu_rawdev *dev, void *config,
> + size_t config_size)
> +{
> + struct he_mem_tg_priv *priv = NULL;
> +
> + if (!dev || !config || !config_size)
> + return -EINVAL;
> +
> + priv = (struct he_mem_tg_priv *)dev->priv;
> + if (!priv)
> + return -ENOENT;
> +
> + if (config_size != sizeof(struct rte_pmd_afu_he_mem_tg_cfg))
> + return -EINVAL;
> +
>

[PATCH v2] vdpa/ifc: fix null pointer dereference

2022-06-15 Thread Andy Pei
Fix null pointer dereference reported in coverity scan.

Coverity issue: 378882
Fixes: 5d75517beffe ("vdpa/ifc/base: access correct register for blk device")

Signed-off-by: Andy Pei 
Acked-by: Xiao Wang 
---
 drivers/vdpa/ifc/base/ifcvf.c | 9 +
 1 file changed, 9 insertions(+)

diff --git a/drivers/vdpa/ifc/base/ifcvf.c b/drivers/vdpa/ifc/base/ifcvf.c
index dd475a7..0a9f71a 100644
--- a/drivers/vdpa/ifc/base/ifcvf.c
+++ b/drivers/vdpa/ifc/base/ifcvf.c
@@ -255,6 +255,10 @@
u32 ring_state;
 
cfg = hw->common_cfg;
+   if (!cfg) {
+   DEBUGOUT("common_cfg in HW is NULL.\n");
+   return;
+   }
 
IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->msix_config);
for (i = 0; i < hw->nr_vring; i++) {
@@ -262,6 +266,11 @@
IFCVF_WRITE_REG16(0, &cfg->queue_enable);
IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->queue_msix_vector);
 
+   if (!hw->lm_cfg) {
+   DEBUGOUT("live migration cfg in HW is NULL.\n");
+   continue;
+   }
+
if (hw->device_type == IFCVF_BLK)
ring_state = *(u32 *)(hw->lm_cfg +
IFCVF_LM_RING_STATE_OFFSET +
-- 
1.8.3.1



Re: [PATCH] config: remove explicit undef of unset values

2022-06-15 Thread David Marchand
On Thu, Dec 16, 2021 at 12:15 PM Bruce Richardson
 wrote:
>
> Rather than explicitly clearing any setting of undefined values in our
> rte_config.h file, it's better to instead just add a comment that the
> value is not set. Using a comment allows the user to set the value using
> CFLAGS or similar mechanism without the config file clearing the value
> again.
>
> The text used " is not set" is modelled after the kernel approach
> of doing the same thing.
>
> Signed-off-by: Bruce Richardson 
Acked-by: Morten Brørup 
Acked-by: Tyler Retzlaff 

Rebased, added check as agreed, and applied.
Thanks Bruce.


-- 
David Marchand



RE: [PATCH v7 5/5] raw/ifpga: add HE-HSSI AFU driver

2022-06-15 Thread Xu, Rosen
Hi Wei,

> -Original Message-
> From: Huang, Wei 
> Sent: Thursday, June 09, 2022 15:37
> To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com;
> hemant.agra...@nxp.com
> Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei
> ; Zhang, Qi Z ; Huang, Wei
> 
> Subject: [PATCH v7 5/5] raw/ifpga: add HE-HSSI AFU driver
> 
> HE-HSSI is one of the host exerciser modules in OFS FPGA, which is used to
> test HSSI (High Speed Serial Interface).
> This driver initialize the module and report test result.
> 
> Signed-off-by: Wei Huang 
> ---
> v2: move source files to ifpga and rename, refine code
> ---
>  drivers/raw/ifpga/afu_pmd_he_hssi.c | 371
> 
>  drivers/raw/ifpga/afu_pmd_he_hssi.h | 109 +++
>  drivers/raw/ifpga/meson.build   |   3 +-
>  drivers/raw/ifpga/rte_pmd_afu.h |  18 ++
>  4 files changed, 500 insertions(+), 1 deletion(-)  create mode 100644
> drivers/raw/ifpga/afu_pmd_he_hssi.c
>  create mode 100644 drivers/raw/ifpga/afu_pmd_he_hssi.h
> 
> diff --git a/drivers/raw/ifpga/afu_pmd_he_hssi.c
> b/drivers/raw/ifpga/afu_pmd_he_hssi.c
> new file mode 100644
> index 000..102de50
> --- /dev/null
> +++ b/drivers/raw/ifpga/afu_pmd_he_hssi.c
> @@ -0,0 +1,371 @@
> +/* SPDX-License-Identifier: BSD-3-Clause
> + * Copyright(c) 2022 Intel Corporation
> + */
> +
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +#include 
> +
> +#include "afu_pmd_core.h"
> +#include "afu_pmd_he_hssi.h"
> +
> +static int he_hssi_indirect_write(struct he_hssi_ctx *ctx, uint32_t addr,
> + uint32_t value)
> +{
> + struct traffic_ctrl_cmd cmd;
> + struct traffic_ctrl_data data;
> + uint32_t i = 0;
> +
> + IFPGA_RAWDEV_PMD_DEBUG("Indirect write 0x%x, value 0x%08x",
> addr,
> +value);
> +
> + if (!ctx)
> + return -EINVAL;
> +
> + data.write_data = value;
> + rte_write64(data.csr, ctx->addr + TRAFFIC_CTRL_DATA);
> +
> + cmd.csr = 0;
> + cmd.write_cmd = 1;
> + cmd.afu_cmd_addr = addr;
> + rte_write64(cmd.csr, ctx->addr + TRAFFIC_CTRL_CMD);
> +
> + while (i < MAILBOX_TIMEOUT_MS) {
> + rte_delay_ms(MAILBOX_POLL_INTERVAL_MS);
> + cmd.csr = rte_read64(ctx->addr + TRAFFIC_CTRL_CMD);
> + if (cmd.ack_trans)
> + break;
> + i += MAILBOX_POLL_INTERVAL_MS;
> + }
> + if (i >= MAILBOX_TIMEOUT_MS)
> + return -ETIMEDOUT;
> +
> + i = 0;
> + cmd.csr = 0;
> + while (i < MAILBOX_TIMEOUT_MS) {
> + cmd.ack_trans = 1;
> + rte_write64(cmd.csr, ctx->addr + TRAFFIC_CTRL_CMD);
> + rte_delay_ms(MAILBOX_POLL_INTERVAL_MS);
> + cmd.csr = rte_read64(ctx->addr + TRAFFIC_CTRL_CMD);
> + if (!cmd.ack_trans)
> + break;
> + i += MAILBOX_POLL_INTERVAL_MS;
> + }
> + if (i >= MAILBOX_TIMEOUT_MS)
> + return -ETIMEDOUT;
> +
> + return 0;
> +}
> +
> +static int he_hssi_indirect_read(struct he_hssi_ctx *ctx, uint32_t addr,
> + uint32_t *value)
> +{
> + struct traffic_ctrl_cmd cmd;
> + struct traffic_ctrl_data data;
> + uint32_t i = 0;
> +
> + if (!ctx)
> + return -EINVAL;
> +
> + cmd.csr = 0;
> + cmd.read_cmd = 1;
> + cmd.afu_cmd_addr = addr;
> + rte_write64(cmd.csr, ctx->addr + TRAFFIC_CTRL_CMD);
> +
> + while (i < MAILBOX_TIMEOUT_MS) {
> + rte_delay_ms(MAILBOX_POLL_INTERVAL_MS);
> + cmd.csr = rte_read64(ctx->addr + TRAFFIC_CTRL_CMD);
> + if (cmd.ack_trans) {
> + data.csr = rte_read64(ctx->addr +
> TRAFFIC_CTRL_DATA);
> + *value = data.read_data;
> + break;
> + }
> + i += MAILBOX_POLL_INTERVAL_MS;
> + }
> + if (i >= MAILBOX_TIMEOUT_MS)
> + return -ETIMEDOUT;
> +
> + i = 0;
> + cmd.csr = 0;
> + while (i < MAILBOX_TIMEOUT_MS) {
> + cmd.ack_trans = 1;
> + rte_write64(cmd.csr, ctx->addr + TRAFFIC_CTRL_CMD);
> + rte_delay_ms(MAILBOX_POLL_INTERVAL_MS);
> + cmd.csr = rte_read64(ctx->addr + TRAFFIC_CTRL_CMD);
> + if (!cmd.ack_trans)
> + break;
> + i += MAILBOX_POLL_INTERVAL_MS;
> + }
> + if (i >= MAILBOX_TIMEOUT_MS)
> + return -ETIMEDOUT;
> +
> + IFPGA_RAWDEV_PMD_DEBUG("Indirect read 0x%x, value 0x%08x",
> addr, *value);
> + return 0;
> +}
> +
> +static void he_hssi_report(struct he_hssi_ctx *ctx) {
> + uint32_t val = 0;
> + uint64_t v64 = 0;
> + int ret = 0;
> +
> + ret = he_hssi_indirect_read(ctx, TM_PKT_GOOD, &val);
> + if (ret)
> + return;
> + printf("Number of good packets received: %u\n", val);
> +
> + ret = he_hssi_indir

[Bug 1035] __rte_raw_cksum() crash with misaligned pointer

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=1035

Bug ID: 1035
   Summary: __rte_raw_cksum() crash with misaligned pointer
   Product: DPDK
   Version: 21.11
  Hardware: All
OS: All
Status: UNCONFIRMED
  Severity: normal
  Priority: Normal
 Component: ethdev
  Assignee: dev@dpdk.org
  Reporter: emil.b...@ericsson.com
  Target Milestone: ---

See rte_raw_cksum() in rte_ip.h, which is part of the public API. See also the
subfunction __rte_raw_cksum().

_rte_raw_cksum assumes that the buffer over which the checksum is calculated is
an even address (divisible by two). See for example this stack overflow post:
https://stackoverflow.com/questions/46790550/c-undefined-behavior-strict-aliasing-rule-or-incorrect-alignment

The post explains that there is undefined behavior in C11 when "conversion
between two pointer types produces a result that is incorrectly aligned". When
the buf argument starts on an odd address we thus have undefined behavior,
since a pointer is cast from void* to uint16_t*.

In most cases (at least on x86) that isn't a problem, but with higher
optimization levels it may break due to vector instructions. This new function
seems to be easier to optimize by the compiler, resulting in a crash when the
buf argument is odd. Please note that the undefined behavior is present in
earlier versions of dpdk as well.

Now you're probably thinking: "Just align your buffers". The problem is that we
have a packet buffer which is aligned. The checksum is calculated on a subset
of that aligned packet buffer, and that sometimes lies on odd addresses.

The question remains if this is an issue with dpdk or not. Maybe you do the
assumption that odd addresses are never passed in when calculating the
checksum, and that's all right. But perhaps a public API comment should be
added in that case. Or perhaps you use it incorrectly as well, and in that case
something should be done about it.

Triggering this bug seems to require high optimization levels as well as odd
values of buf.

Of interest may be that in older version of dpdk you added the a comment: /*
workaround gcc strict-aliasing warning */ showing that you willfully ignored
the warning.

-- 
You are receiving this mail because:
You are the assignee for the bug.

RE: [PATCH v1] raw/ifpga: free file handle before function return

2022-06-15 Thread Xu, Rosen
Hi Wei,

> -Original Message-
> From: Huang, Wei 
> Sent: Thursday, June 09, 2022 16:50
> To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com;
> hemant.agra...@nxp.com
> Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei
> ; Zhang, Qi Z ; Huang, Wei
> 
> Subject: [PATCH v1] raw/ifpga: free file handle before function return
> 
> Coverity issue: 379064
> Fixes: 673c897f4d73 ("raw/ifpga: support OFS card probing")
> 
> Signed-off-by: Wei Huang 
> ---
>  drivers/raw/ifpga/base/ifpga_enumerate.c | 28 +++
> -
>  1 file changed, 19 insertions(+), 9 deletions(-)
> 
> diff --git a/drivers/raw/ifpga/base/ifpga_enumerate.c
> b/drivers/raw/ifpga/base/ifpga_enumerate.c
> index 7a5d264..61eb660 100644
> --- a/drivers/raw/ifpga/base/ifpga_enumerate.c
> +++ b/drivers/raw/ifpga/base/ifpga_enumerate.c
> @@ -837,8 +837,10 @@ static int find_dfls_by_vsec(struct
> dfl_fpga_enum_info *info)
>   vndr_hdr = 0;
>   ret = pread(fd, &vndr_hdr, sizeof(vndr_hdr),
>   voff + PCI_VNDR_HEADER);
> - if (ret < 0)
> - return -EIO;
> + if (ret < 0) {
> + ret = -EIO;
> + goto free_handle;
> + }
>   if (PCI_VNDR_HEADER_ID(vndr_hdr) ==
> PCI_VSEC_ID_INTEL_DFLS &&
>   pci_data->vendor_id == PCI_VENDOR_ID_INTEL)
>   break;
> @@ -846,19 +848,23 @@ static int find_dfls_by_vsec(struct
> dfl_fpga_enum_info *info)
> 
>   if (!voff) {
>   dev_debug(hw, "%s no DFL VSEC found\n", __func__);
> - return -ENODEV;
> + ret = -ENODEV;
> + goto free_handle;
>   }
> 
>   dfl_cnt = 0;
>   ret = pread(fd, &dfl_cnt, sizeof(dfl_cnt), voff +
> PCI_VNDR_DFLS_CNT);
> - if (ret < 0)
> - return -EIO;
> + if (ret < 0) {
> + ret = -EIO;
> + goto free_handle;
> + }
> 
>   dfl_res_off = voff + PCI_VNDR_DFLS_RES;
>   if (dfl_res_off + (dfl_cnt * sizeof(u32)) > PCI_CFG_SPACE_EXP_SIZE)
> {
>   dev_err(hw, "%s DFL VSEC too big for PCIe config space\n",
>   __func__);
> - return -EINVAL;
> + ret = -EINVAL;
> + goto free_handle;
>   }
> 
>   for (i = 0; i < dfl_cnt; i++, dfl_res_off += sizeof(u32)) { @@ -868,7
> +874,8 @@ static int find_dfls_by_vsec(struct dfl_fpga_enum_info *info)
>   if (bir >= PCI_MAX_RESOURCE) {
>   dev_err(hw, "%s bad bir number %d\n",
>   __func__, bir);
> - return -EINVAL;
> + ret = -EINVAL;
> + goto free_handle;
>   }
> 
>   len = pci_data->region[bir].len;
> @@ -876,7 +883,8 @@ static int find_dfls_by_vsec(struct
> dfl_fpga_enum_info *info)
>   if (offset >= len) {
>   dev_err(hw, "%s bad offset %u >= %"PRIu64"\n",
>   __func__, offset, len);
> - return -EINVAL;
> + ret = -EINVAL;
> + goto free_handle;
>   }
> 
>   dev_debug(hw, "%s BAR %d offset 0x%x\n", __func__, bir,
> offset); @@ -886,7 +894,9 @@ static int find_dfls_by_vsec(struct
> dfl_fpga_enum_info *info)
>   dfl_fpga_enum_info_add_dfl(info, start, len, addr);
>   }
> 
> - return 0;
> +free_handle:
> + close(fd);
> + return ret;
>  }
> 
>  /* default method of finding dfls starting at offset 0 of bar 0 */
> --
> 1.8.3.1

It looks good for me.
Reviewed-by: Rosen Xu 


Re: [PATCH 0/3] Fix xmm_t to rte_xmm_t scalar conversion

2022-06-15 Thread David Marchand
On Thu, Jun 9, 2022 at 2:17 PM Stanislaw Kardach  wrote:
>
> As David noticed in [1] there is an issue with C++ compilation of the
> rte_vect.h header in RISC-V. Upon closer inspection, the problem appears on
> all architectures due to the type conversion rules in C++.
> More precisely a union type rte_xmm_t requires a conversion constructor
> from xmm_t type.
> The most obvious fix is to use a structure initializer for such copies
> (since rte_xmm_t union contains xmm_t anyway). The generated assembly
> at -O2 is exactly the same, so there's no real impact.
>
> The bigger question is whether accessing bits of the architecture specific
> xmm_t type in an array fashion is always correct? All current architectures
> define rte_xmm_t in the same manner implying that.

Copying other arch maintainers.

>
> Additionally change RISC-V CI settings to use crossbuild-essential-riscv64
> package which provides tools that enable C++ checks.
>
> [1] http://mails.dpdk.org/archives/dev/2022-June/243683.html
>
> Stanislaw Kardach (3):
>   eal/riscv: fix xmm_t casting for C++
>   lpm: fix xmm_t casting for C++ in scalar version
>   ci: use crossbuild-essential-riscv64 for compiling

In any case, this series looks good.
Series applied, thanks Stanislaw.


-- 
David Marchand



[PATCH] net/igc: fix segmentation fault in secondary dpdk-symmetric_mp

2022-06-15 Thread zhichaox . zeng
From: Zhichao Zeng 

In the secondary dpdk-symmetric_mp process, the
"smp_port_init" was skipped, which cause some function
pointers not to be initialized, and a segmentation fault
occurred when calling these function pointers.

This patch assigns initial values to rx_pkt_burst,
tx_pkt_burst and tx_pkt_prepare pointers to avoid
calling null function pointer.

Signed-off-by: Zhichao Zeng 
---
 drivers/net/igc/igc_ethdev.c | 3 +++
 drivers/net/igc/igc_ethdev.h | 5 +
 drivers/net/igc/igc_txrx.c   | 6 +++---
 3 files changed, 11 insertions(+), 3 deletions(-)

diff --git a/drivers/net/igc/igc_ethdev.c b/drivers/net/igc/igc_ethdev.c
index b9933b395d..dd4a028ea1 100644
--- a/drivers/net/igc/igc_ethdev.c
+++ b/drivers/net/igc/igc_ethdev.c
@@ -1234,6 +1234,9 @@ eth_igc_dev_init(struct rte_eth_dev *dev)
dev->rx_queue_count = eth_igc_rx_queue_count;
dev->rx_descriptor_status = eth_igc_rx_descriptor_status;
dev->tx_descriptor_status = eth_igc_tx_descriptor_status;
+   dev->rx_pkt_burst = &igc_recv_pkts;
+   dev->tx_pkt_burst = &igc_xmit_pkts;
+   dev->tx_pkt_prepare = ð_igc_prep_pkts;
 
/*
 * for secondary processes, we don't initialize any further as primary
diff --git a/drivers/net/igc/igc_ethdev.h b/drivers/net/igc/igc_ethdev.h
index f56cad79e9..2fa3d51bba 100644
--- a/drivers/net/igc/igc_ethdev.h
+++ b/drivers/net/igc/igc_ethdev.h
@@ -263,6 +263,11 @@ struct igc_adapter {
 #define IGC_DEV_PRIVATE_FLOW_LIST(_dev) \
(&((struct igc_adapter *)(_dev)->data->dev_private)->flow_list)
 
+uint16_t igc_recv_pkts(void *rx_queue, struct rte_mbuf **rx_pkts, uint16_t 
nb_pkts);
+uint16_t igc_xmit_pkts(void *tx_queue, struct rte_mbuf **tx_pkts, uint16_t 
nb_pkts);
+uint16_t eth_igc_prep_pkts(__rte_unused void *tx_queue, struct rte_mbuf 
**tx_pkts,
+   uint16_t 
nb_pkts);
+
 static inline void
 igc_read_reg_check_set_bits(struct igc_hw *hw, uint32_t reg, uint32_t bits)
 {
diff --git a/drivers/net/igc/igc_txrx.c b/drivers/net/igc/igc_txrx.c
index e48d5df11a..753ac8a411 100644
--- a/drivers/net/igc/igc_txrx.c
+++ b/drivers/net/igc/igc_txrx.c
@@ -345,7 +345,7 @@ rx_desc_get_pkt_info(struct igc_rx_queue *rxq, struct 
rte_mbuf *rxm,
rxm->packet_type = rx_desc_pkt_info_to_pkt_type(pkt_info);
 }
 
-static uint16_t
+uint16_t
 igc_recv_pkts(void *rx_queue, struct rte_mbuf **rx_pkts, uint16_t nb_pkts)
 {
struct igc_rx_queue * const rxq = rx_queue;
@@ -1397,7 +1397,7 @@ eth_igc_rx_queue_setup(struct rte_eth_dev *dev,
 }
 
 /* prepare packets for transmit */
-static uint16_t
+uint16_t
 eth_igc_prep_pkts(__rte_unused void *tx_queue, struct rte_mbuf **tx_pkts,
uint16_t nb_pkts)
 {
@@ -1604,7 +1604,7 @@ tx_desc_cksum_flags_to_olinfo(uint64_t ol_flags)
return tmp;
 }
 
-static uint16_t
+uint16_t
 igc_xmit_pkts(void *tx_queue, struct rte_mbuf **tx_pkts, uint16_t nb_pkts)
 {
struct igc_tx_queue * const txq = tx_queue;
-- 
2.25.1



RE: [PATCH V2] vdpa/mlx5: return correct error code after rte_intr_instance_alloc failed

2022-06-15 Thread Matan Azrad


Hi Shiqi

From: 835703...@qq.com <835703...@qq.com>
> From: Shiqi Liu <835703...@qq.com>
> 
> To Maxime,
> I am so sorry too late to reply your comment.My name is Shiqi Liu.
> I got stuck in how to reply the comment before.Now, it seenms to be
> ok!Maybe there will be some mistakes I have taken , when I reply,please tell
> me how to do it correctly!
> Thanks,
> Shiqi Liu
> 
> diff --git a/drivers/vdpa/mlx5/mlx5_vdpa_event.c
> b/drivers/vdpa/mlx5/mlx5_vdpa_event.c
> index 7167a98db0..6223afaae8 100644
> --- a/drivers/vdpa/mlx5/mlx5_vdpa_event.c
> +++ b/drivers/vdpa/mlx5/mlx5_vdpa_event.c
> @@ -395,6 +395,7 @@ mlx5_vdpa_err_event_setup(struct mlx5_vdpa_priv
> *priv)
> rte_intr_instance_alloc(RTE_INTR_INSTANCE_F_SHARED);
> if (priv->err_intr_handle == NULL) {
> DRV_LOG(ERR, "Fail to allocate intr_handle");
> +   rte_errno = ENOMEM;

Shouldn't rte_intr_instance_alloc set it?

> goto error;
> }
> if (rte_intr_fd_set(priv->err_intr_handle, priv->err_chnl->fd))
> --


RE: [PATCH v7] app/testpmd: add Host Shaper command

2022-06-15 Thread Matan Azrad



From: Spike Du
> Add command line options to support host shaper configure.
> - Command syntax:
>   mlx5 set port  host_shaper avail_thresh_triggered <0|1> rate
> 
> 
> - Example commands:
> To enable avail_thresh_triggered on port 1 and disable current host
> shaper:
> testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0
> 
> To disable avail_thresh_triggered and current host shaper on port 1:
> testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0
> 
> The rate unit is 100Mbps.
> To disable avail_thresh_triggered and configure a shaper of 5Gbps on port 1:
> testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50
> 
> Add sample code to handle rxq available descriptor threshold event, it delays
> a while so that rxq empties, then disables host shaper and rearms available
> descriptor threshold event.
> 
> Signed-off-by: Spike Du 
Acked-by: Matan Azrad 


DPDK Compilation consuming too much memory

2022-06-15 Thread Raslan Darawsheh
Hi,

I've been facing an issue while building DPDK (mostly noticeable when building 
cross compilation on arm64) as follows:

meson --werror --buildtype=debugoptimized --cross-file 
config/arm/arm64_armv8_linux_gcc -Dexamples=all /tmp/build-aarch64-gcc;
ninja -C /tmp/build-aarch64-gcc/
I can see the memory footprint for it is around 52G which basically took all my 
system memory.

It happens when linking CNXK drivers.

Kindest regards,
Raslan Darawsheh



Re: [PATCH v2] net/mlx5: fix build with clang 14

2022-06-15 Thread Thomas Monjalon
11/05/2022 18:41, Ali Alnubani:
> Use fgets instead of fscanf to resolve the following warning
> reported by clang 14.0.0 in Fedora 37 (Rawhide):
> 
> drivers/net/mlx5/linux/mlx5_ethdev_os.c:1137:52: error:
>   'fscanf' may overflow; destination buffer in argument 3 has size 16,
>   but the corresponding specifier may require size 17
>   [-Werror,-Wfortify-source]
>   ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name);

Some other patches are proposing to declare the variable
of size IF_NAMESIZE+1 but I think it's wrong because
IF_NAMESIZE includes a terminating null byte.

> Fixes: 63d1db710fbc ("net/mlx5: fix unlimited parsing of switch info")
> Cc: michae...@mellanox.com
> Cc: sta...@dpdk.org
> 
> Signed-off-by: Ali Alnubani 
[...]
> - ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name);
> - fclose(file);
> - if (ret == 1)
> + if (fgets(port_name, IF_NAMESIZE, file) != NULL)
>   mlx5_translate_port_name(port_name, &data);
> + fclose(file);

Acked-by: Thomas Monjalon 




event_eth_rx_intr_adapter_autotest failing

2022-06-15 Thread Kundapura, Ganapati
Hi dpdk-dev,

  Test command: DPDK_TEST=event_eth_rx_intr_adapter_autotest ./dpdk-test

  event_eth_rx_intr_adapter_autotest is failing while adding interrupt queue to 
adapter instance.
It's failing in rte_eth_dev_rx_intr_ctl_q(), at dev->intr_handle == NULL check

5248 if (!dev->intr_handle) {
5249 RTE_ETHDEV_LOG(ERR, "Rx Intr handle unset\n");
5250 return -ENOTSUP;
5251 }

"net_null" PMD is used for the test. Does "net-null" supports interrupt queue?

I'm able to run event_eth_rx_adapter_autotest with 
'DPDK_TEST=event_eth_rx_adapter_autotest ./dpdk-test'

Could you please let me know how to run event_eth_rx_intr_adapter_autotest?

Thanks,
Ganapati


RE: [PATCH v2] net/mlx5: fix build with clang 14

2022-06-15 Thread Slava Ovsiienko
> -Original Message-
> From: Ali Alnubani 
> Sent: Wednesday, May 11, 2022 19:41
> To: dev@dpdk.org
> Cc: Michael Baum ; sta...@dpdk.org
> Subject: [PATCH v2] net/mlx5: fix build with clang 14
> 
> Use fgets instead of fscanf to resolve the following warning reported by clang
> 14.0.0 in Fedora 37 (Rawhide):
> 
> drivers/net/mlx5/linux/mlx5_ethdev_os.c:1137:52: error:
>   'fscanf' may overflow; destination buffer in argument 3 has size 16,
>   but the corresponding specifier may require size 17
>   [-Werror,-Wfortify-source]
>   ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name);
> 
> Fixes: 63d1db710fbc ("net/mlx5: fix unlimited parsing of switch info")
> Cc: michae...@mellanox.com
> Cc: sta...@dpdk.org
> 
> Signed-off-by: Ali Alnubani 
Acked-by: Viacheslav Ovsiienko 



[Bug 857] gcc12 build error, lib/mbuf

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=857

David Marchand (david.march...@redhat.com) changed:

   What|Removed |Added

 CC||david.march...@redhat.com
 Resolution|--- |WORKSFORME
 Status|UNCONFIRMED |RESOLVED

--- Comment #4 from David Marchand (david.march...@redhat.com) ---
I can't reproduce with released gcc 12.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Bug 858] gcc12 build error, net/failsafe

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=858

David Marchand (david.march...@redhat.com) changed:

   What|Removed |Added

 Status|UNCONFIRMED |RESOLVED
 Resolution|--- |WORKSFORME
 CC||david.march...@redhat.com

--- Comment #1 from David Marchand (david.march...@redhat.com) ---
Same as 822, I can't reproduce with released gcc 12.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Bug 860] gcc12 build error, net/ngbe

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=860

David Marchand (david.march...@redhat.com) changed:

   What|Removed |Added

 CC||david.march...@redhat.com
 Status|UNCONFIRMED |RESOLVED
 Resolution|--- |WORKSFORME

--- Comment #2 from David Marchand (david.march...@redhat.com) ---
I can't reproduce with released gcc 12.

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Bug 1016] [dpdk-22.07*] drivers/net/ice meson build failed with gcc12.1.1 on Fedora36/64

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=1016

David Marchand (david.march...@redhat.com) changed:

   What|Removed |Added

 Status|UNCONFIRMED |RESOLVED
 CC||david.march...@redhat.com
 Resolution|--- |DUPLICATE

--- Comment #4 from David Marchand (david.march...@redhat.com) ---
Please, don't mix several issues in the same report.
I'll focus on the initial report, about net/ice which is a duplicate of bug
852.

*** This bug has been marked as a duplicate of bug 852 ***

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Bug 849] gcc12 build error, net/ena

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=849

David Marchand (david.march...@redhat.com) changed:

   What|Removed |Added

 Status|UNCONFIRMED |RESOLVED
 CC||david.march...@redhat.com
 Resolution|--- |FIXED

--- Comment #1 from David Marchand (david.march...@redhat.com) ---
Fixed in https://git.dpdk.org/dpdk/commit/?id=244994958466

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Bug 855] gcc12 build error, vdpa/ifc

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=855

David Marchand (david.march...@redhat.com) changed:

   What|Removed |Added

 CC||david.march...@redhat.com
 Status|UNCONFIRMED |RESOLVED
 Resolution|--- |FIXED

--- Comment #2 from David Marchand (david.march...@redhat.com) ---
Fixed in https://git.dpdk.org/dpdk/commit/?id=2a213b794fdd

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Bug 856] gcc12 build error, app/test-flow-perf

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=856

David Marchand (david.march...@redhat.com) changed:

   What|Removed |Added

 CC||david.march...@redhat.com
 Status|UNCONFIRMED |RESOLVED
 Resolution|--- |FIXED

--- Comment #2 from David Marchand (david.march...@redhat.com) ---
Fixed in https://git.dpdk.org/dpdk/commit/?id=5fc2eece8d45

-- 
You are receiving this mail because:
You are the assignee for the bug.

Re: [PATCH v2] net/mlx5: fix build with clang 14

2022-06-15 Thread David Marchand
On Wed, Jun 15, 2022 at 10:17 AM Thomas Monjalon  wrote:
>
> 11/05/2022 18:41, Ali Alnubani:
> > Use fgets instead of fscanf to resolve the following warning
> > reported by clang 14.0.0 in Fedora 37 (Rawhide):
> >
> > drivers/net/mlx5/linux/mlx5_ethdev_os.c:1137:52: error:
> >   'fscanf' may overflow; destination buffer in argument 3 has size 16,
> >   but the corresponding specifier may require size 17
> >   [-Werror,-Wfortify-source]
> >   ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name);
>
> Some other patches are proposing to declare the variable
> of size IF_NAMESIZE+1 but I think it's wrong because
> IF_NAMESIZE includes a terminating null byte.

Ack, I rejected my patch accordingly.


-- 
David Marchand



dev@dpdk.org

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=1033

David Marchand (david.march...@redhat.com) changed:

   What|Removed |Added

 CC||david.march...@redhat.com
 Status|UNCONFIRMED |RESOLVED
 Resolution|--- |DUPLICATE

--- Comment #2 from David Marchand (david.march...@redhat.com) ---
Already reported earlier.

*** This bug has been marked as a duplicate of bug 856 ***

-- 
You are receiving this mail because:
You are the assignee for the bug.

[Bug 1008] Building DPDK with gcc 12.1.1 throws warnings on Fedora 36

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=1008

David Marchand (david.march...@redhat.com) changed:

   What|Removed |Added

 Resolution|--- |INVALID
 CC||david.march...@redhat.com
 Status|UNCONFIRMED |RESOLVED

--- Comment #1 from David Marchand (david.march...@redhat.com) ---
Thanks, there were several other reports for the various issues, I prefer to
close this one. Feel free to open a bugzilla for any remaining issue that has
no assigned bz.

-- 
You are receiving this mail because:
You are the assignee for the bug.

RE: [EXT] Re: [PATCH v2] devtools: ensure proper tag sequence

2022-06-15 Thread Jakub Palider
> -Original Message-
> From: Thomas Monjalon 
> Sent: Wednesday, June 15, 2022 8:21 AM
> To: Jakub Palider 
> Cc: dev@dpdk.org; david.march...@redhat.com
> Subject: [EXT] Re: [PATCH v2] devtools: ensure proper tag sequence
> 
> External Email
> 
> --
> 14/06/2022 00:21, jpali...@marvell.com:
> > +   SEQ[0] = "Coverity issue";
> > +   SEQ[1] = "Bugzilla ID";
> > +   SEQ[2] = "Fixes";
> > +   SEQ[3] = "Cc";
> > +   SEQ[4] = "^$";
> > +   SEQ[5] = "Suggested-by";
> > +   SEQ[6] = "Reported-by";
> > +   SEQ[7] = "Signed-off-by";
> > +   latest = 0;
> 
> Do you think you could check that Review, Ack and Test are added after the
> first Signed-off?

Ok, thank you for clarification. So any further Signed-offs need not to be in a 
particular 
sequence as long as the first one precedes Review/Ack/Test (these 3 in any 
order).

> 
> > +Tag order
> > +~
> > +
> > +There is a pattern indicating how certain tags should relate to each other.
> > +
> > +Example of proper tag sequence::
> > +
> > + Coverity issue:
> > + Bugzilla ID:
> > + Fixes:
> > + Cc:
> > +
> > + Suggested-by:
> > + Reported-by:
> > + Signed-off-by:
> 
> Given it is an example, you could add Reviewed-by, Acked-by and Tested-by.

Sure, I will restore it.

> > +
> > +Between first and second tag section there is and empty line.
> > +
> > +While ``Signed-off-by:`` is an obligatory tag and must exists in each
> > +commit, all other tags are optional. Any tag, as long as it is in
> > +proper location to other adjacent tags (if present), may occur multiple
> times.
> > +
> > +Other tags shall be laid out in a chronological order.
> 
> Yes, after the first Signed-off-by.

Side question: what about Sponsored-by, I noticed it appeared just recently?




Re: [PATCH 00/12] Fix compilation with gcc 12

2022-06-15 Thread David Marchand
On Wed, May 18, 2022 at 12:17 PM David Marchand
 wrote:
>
> Fedora 36 is out since early may and comes with gcc 12.
> This series fixes compilation or waives some checks.
>
> There might be something fishy with rte_memcpy on x86 but, for now,
> the rte_memcpy related fixes are on the caller side.
>
> Some "base" drivers have issues, I chose the simple solution of waiving
> the checks for them.
>
> Compilation is the only thing checked.
> Please driver maintainers, check nothing got broken.

I applied the patches that got acked and that had no objection or
comment from maintainers (i.e. patch 3, 4, 5, 6, 9, 11).
I also cleaned the mess in bugzilla where we had multiple reports of
the same issues, or stale bugs that I can't reproduce with released
gcc 12.

I'll respin separately the patches for which I have clear comments,
and drop my patches waiving the compiler checks.

We still need to agree on the best approach to handle the new checks.
We have two rfc series from Stephen, how do we move forward?


-- 
David Marchand



RE: [PATCH v7 1/5] raw/ifpga: introduce AFU driver framework

2022-06-15 Thread Huang, Wei



> -Original Message-
> From: Xu, Rosen 
> Sent: Wednesday, June 15, 2022 14:55
> To: Huang, Wei ; dev@dpdk.org;
> tho...@monjalon.net; nipun.gu...@nxp.com; hemant.agra...@nxp.com
> Cc: sta...@dpdk.org; Zhang, Tianfei ; Zhang, Qi Z
> 
> Subject: RE: [PATCH v7 1/5] raw/ifpga: introduce AFU driver framework
> 
> Hi Wei,
> 
> Some comments.
> 
> Thanks,
> Rosen
> 
> > -Original Message-
> > From: Huang, Wei 
> > Sent: Thursday, June 09, 2022 15:37
> > To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com;
> > hemant.agra...@nxp.com
> > Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei
> > ; Zhang, Qi Z ; Huang,
> > Wei 
> > Subject: [PATCH v7 1/5] raw/ifpga: introduce AFU driver framework
> >
> > AFU (Acceleration Function Unit) is part of FPGA and enumerated by
> > ifpga driver.
> > This driver implements common AFU device interfaces and exposes them
> > to application as standard raw device APIs.
> > Normally application can operate specified AFU as below, 1. call
> > rte_rawdev_pmd_get_named_dev() to find AFU device.
> > 2. call rte_rawdev_configure() to initialize AFU device.
> > 3. call rte_rawdev_selftest() to test AFU device.
> >
> > Signed-off-by: Wei Huang 
> > ---
> > v2: fix typo
> > ---
> > v3: fix build error in FreeBSD13-64, UB2004-32 and UB2204-32
> > ---
> > v4: fix coding style issue and build error in FreeBSD13-64
> > ---
> > v5: split patch into several patches
> > ---
> > v6: move source files to ifpga and rename, use spinlock
> > ---
> >  drivers/raw/ifpga/afu_pmd_core.c | 453
> > +++
> >  drivers/raw/ifpga/afu_pmd_core.h |  76 +++
> >  drivers/raw/ifpga/meson.build|   2 +-
> >  3 files changed, 530 insertions(+), 1 deletion(-)  create mode 100644
> > drivers/raw/ifpga/afu_pmd_core.c  create mode 100644
> > drivers/raw/ifpga/afu_pmd_core.h
> >
> > diff --git a/drivers/raw/ifpga/afu_pmd_core.c
> > b/drivers/raw/ifpga/afu_pmd_core.c
> > new file mode 100644
> > index 000..a24b517
> > --- /dev/null
> > +++ b/drivers/raw/ifpga/afu_pmd_core.c
> > @@ -0,0 +1,453 @@
> > +/* SPDX-License-Identifier: BSD-3-Clause
> > + * Copyright 2022 Intel Corporation
> > + *
> > + * AFU is Acceleration Function Unit in FPGA, it is enumerated by ifpga
> pmd.
> > + * Suppose AFU is found in FPGA at PCI address 31:00.0, then you can
> > +create
> > + * and test a AFU device by following steps in application.
> > + * 1. rte_vdev_init("ifpga_rawdev_cfg0", "ifpga=31:00.0,port=0")
> > + * 2. rawdev = rte_rawdev_pmd_get_named_dev("afu_0|31:00.0")
> > + * 3. rte_rawdev_configure(rawdev->dev_id, &cfg, sizeof(cfg))
> > + * 4. rte_rawdev_selftest(rawdev->dev_id)
> > + * 5. rte_vdev_uninit("ifpga_rawdev_cfg0")
> > + *
> > + * AFU device name format used in rte_rawdev_pmd_get_named_dev is
> > + * "afu_[port]|[BDF]". Please refer to OPAE documentation for the
> > +meaning of
> > + * port. Each AFU device has specific configuration data, they are
> > +defined
> > + * in rte_pmd_afu.h.
> > + *
> > + */
> > +
> Could you split these notes? General speaking we just announce license in
> head of source file.
> 
Agree
> > +#include 
> > +#include 
> > +#include 
> > +#include 
> > +#include 
> > +#include 
> > +#include 
> > +#include 
> > +#include 
> > +
> > +#include 
> > +#include 
> > +#include 
> > +#include 
> > +
> > +#include "afu_pmd_core.h"
> > +
> > +static struct rte_afu_uuid
> > afu_pmd_uuid_map[AFU_RAWDEV_MAX_DRVS+1];
> > +TAILQ_HEAD(afu_drv_list, afu_rawdev_drv); static struct afu_drv_list
> > +afu_pmd_list = TAILQ_HEAD_INITIALIZER(afu_pmd_list);
> > +
> > +static inline int afu_rawdev_trylock(struct afu_rawdev *dev) {
> > +   if (!dev || !dev->sd)
> > +   return 0;
> > +
> > +   return rte_spinlock_trylock(&dev->sd->lock);
> > +}
> > +
> > +static inline void afu_rawdev_unlock(struct afu_rawdev *dev) {
> > +   if (!dev || !dev->sd)
> > +   return;
> > +
> > +   rte_spinlock_unlock(&dev->sd->lock);
> > +}
> > +
> > +static int afu_rawdev_configure(const struct rte_rawdev *rawdev,
> > +   rte_rawdev_obj_t config, size_t config_size) {
> > +   struct afu_rawdev *dev = NULL;
> > +   int ret = 0;
> > +
> > +   IFPGA_RAWDEV_PMD_FUNC_TRACE();
> > +
> > +   dev = afu_rawdev_get_priv(rawdev);
> > +   if (!dev)
> > +   return -ENODEV;
> > +
> > +   if (dev->ops && dev->ops->config)
> > +   ret = (*dev->ops->config)(dev, config, config_size);
> > +
> > +   return ret;
> > +}
> > +
> > +static int afu_rawdev_start(struct rte_rawdev *rawdev) {
> > +   struct afu_rawdev *dev = NULL;
> > +   int ret = 0;
> > +
> > +   IFPGA_RAWDEV_PMD_FUNC_TRACE();
> > +
> > +   dev = afu_rawdev_get_priv(rawdev);
> > +   if (!dev)
> > +   return -ENODEV;
> > +
> > +   ret = afu_rawdev_trylock(dev);
> > +   if (!ret) {
> > +   IFPGA_RAWDEV_PMD_WARN("AFU is busy, please start it
> > later");
> > +   return ret;
> > +   }
> > +
> > +   if (dev->ops && dev->ops->start)
> > +   ret = (*dev->ops->start)(dev);
> > +
> > + 

[PATCH v3] vdpa/mlx5: workaround var offset within page

2022-06-15 Thread Yajun Wu
vDPA driver first uses kernel driver to allocate doorbell(VAR) area for
each device. Then uses var->mmap_off and var->length to mmap uverbs device
file as doorbell userspace virtual address.

Current kernel driver provides var->mmap_off equal to page start of VAR.
It's fine with x86 4K page server, because VAR physical address is only 4K
aligned thus locate in 4K page start.

But with aarch64 64K page server, the actual VAR physical address has
offset within page(not locate in 64K page start). So vDPA driver need add
this within page offset(caps.doorbell_bar_offset) to get right VAR virtual
address.

Fixes: 62c813706e4 ("vdpa/mlx5: map doorbell")
Cc: sta...@dpdk.org

Signed-off-by: Yajun Wu 
Acked-by: Matan Azrad 
Reviewed-by: Maxime Coquelin 
---
 drivers/vdpa/mlx5/mlx5_vdpa.c | 8 +++-
 1 file changed, 7 insertions(+), 1 deletion(-)

diff --git a/drivers/vdpa/mlx5/mlx5_vdpa.c b/drivers/vdpa/mlx5/mlx5_vdpa.c
index 76fa5d4299..8a33a0c9a1 100644
--- a/drivers/vdpa/mlx5/mlx5_vdpa.c
+++ b/drivers/vdpa/mlx5/mlx5_vdpa.c
@@ -14,6 +14,7 @@
 #include 
 #include 
 #include 
+#include 
 
 #include 
 #include 
@@ -560,6 +561,9 @@ mlx5_vdpa_create_dev_resources(struct mlx5_vdpa_priv *priv)
rte_errno = errno;
return -rte_errno;
}
+   /* Add within page offset for 64K page system. */
+   priv->virtq_db_addr = (char *)priv->virtq_db_addr +
+   ((rte_mem_page_size() - 1) & priv->caps.doorbell_bar_offset);
DRV_LOG(DEBUG, "VAR address of doorbell mapping is %p.",
priv->virtq_db_addr);
priv->td = mlx5_devx_cmd_create_td(ctx);
@@ -705,7 +709,9 @@ mlx5_vdpa_release_dev_resources(struct mlx5_vdpa_priv *priv)
if (priv->td)
claim_zero(mlx5_devx_cmd_destroy(priv->td));
if (priv->virtq_db_addr)
-   claim_zero(munmap(priv->virtq_db_addr, priv->var->length));
+   /* Mask out the within page offset for munmap. */
+   claim_zero(munmap((void *)((uintptr_t)priv->virtq_db_addr &
+   ~(rte_mem_page_size() - 1)), priv->var->length));
if (priv->var)
mlx5_glue->dv_free_var(priv->var);
 }
-- 
2.30.2



RE: [PATCH v2] net/mlx5: fix build with clang 14

2022-06-15 Thread Raslan Darawsheh
Hi,

> -Original Message-
> From: Ali Alnubani 
> Sent: Wednesday, May 11, 2022 7:41 PM
> To: dev@dpdk.org
> Cc: Michael Baum ; sta...@dpdk.org
> Subject: [PATCH v2] net/mlx5: fix build with clang 14
> 
> Use fgets instead of fscanf to resolve the following warning reported by
> clang 14.0.0 in Fedora 37 (Rawhide):
> 
> drivers/net/mlx5/linux/mlx5_ethdev_os.c:1137:52: error:
>   'fscanf' may overflow; destination buffer in argument 3 has size 16,
>   but the corresponding specifier may require size 17
>   [-Werror,-Wfortify-source]
>   ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name);
> 
> Fixes: 63d1db710fbc ("net/mlx5: fix unlimited parsing of switch info")
> Cc: michae...@mellanox.com
> Cc: sta...@dpdk.org
> 
> Signed-off-by: Ali Alnubani 
> ---
> Changes in v2:
>   - Removed unnecessary variable.
> 

Patch applied to next-net-mlx,

Kindest regards,
Raslan Darawsheh


[PATCH v2] net/nfp: initialize the flbufsz

2022-06-15 Thread Peng Zhang
When the testpmd start-up, it will check MTU range,
if MTU > flubfsz, it will lead testpmd start fail.
Because the hw->flbufsz doesn't have the initialized
value, so it will lead the bug.

Fixes: 97a30c4d4c7b ("net/nfp: make sure MTU is never larger than mbuf size")
Cc: sta...@dpdk.org

Signed-off-by: Peng Zhang 
Signed-off-by: Chaoyong He 
---
 drivers/net/nfp/nfp_ethdev.c| 1 +
 drivers/net/nfp/nfp_ethdev_vf.c | 1 +
 2 files changed, 2 insertions(+)

diff --git a/drivers/net/nfp/nfp_ethdev.c b/drivers/net/nfp/nfp_ethdev.c
index 9166f65da3..1a9f7581a7 100644
--- a/drivers/net/nfp/nfp_ethdev.c
+++ b/drivers/net/nfp/nfp_ethdev.c
@@ -502,6 +502,7 @@ nfp_net_init(struct rte_eth_dev *eth_dev)
hw->cap = nn_cfg_readl(hw, NFP_NET_CFG_CAP);
hw->max_mtu = nn_cfg_readl(hw, NFP_NET_CFG_MAX_MTU);
hw->mtu = RTE_ETHER_MTU;
+   hw->flbufsz = RTE_ETHER_MTU;
 
/* VLAN insertion is incompatible with LSOv2 */
if (hw->cap & NFP_NET_CFG_CTRL_LSO2)
diff --git a/drivers/net/nfp/nfp_ethdev_vf.c b/drivers/net/nfp/nfp_ethdev_vf.c
index 0034d68ea6..0781f34764 100644
--- a/drivers/net/nfp/nfp_ethdev_vf.c
+++ b/drivers/net/nfp/nfp_ethdev_vf.c
@@ -369,6 +369,7 @@ nfp_netvf_init(struct rte_eth_dev *eth_dev)
hw->cap = nn_cfg_readl(hw, NFP_NET_CFG_CAP);
hw->max_mtu = nn_cfg_readl(hw, NFP_NET_CFG_MAX_MTU);
hw->mtu = RTE_ETHER_MTU;
+   hw->flbufsz = RTE_ETHER_MTU;
 
/* VLAN insertion is incompatible with LSOv2 */
if (hw->cap & NFP_NET_CFG_CTRL_LSO2)
-- 
2.27.0



Re: [PATCH 2/2] common/cnxk: add include for macro definition

2022-06-15 Thread Bruce Richardson
On Wed, Jun 15, 2022 at 09:06:26AM +0200, Thomas Monjalon wrote:
> 14/06/2022 14:29, Bruce Richardson:
> > The header file "roc_io.h" uses the "__plt_always_inline" macro but
> > don't include "roc_platform.h" to get the definition of it. This
> > inclusion is not necessary for compilation, but the lack of it can
> > confuse some indexers - such as those in eclipse, which reports the
> > lines:
> > 
> > "static __plt_always_inline uint64_t"
> > 
> > as possible definitions of a variable called "uint64_t". This confusion
> > leads to uint64_t being flagged as an unknown type in all other parts of
> > the project being indexed, e.g. across all of DPDK code.
> > 
> > Adding in the include of rte_common.h makes it clear to the indexer that
> > those lines are  part of a function definition, and that allows eclipse
> > to correctly recognise uint64_t as a type from stdint.h
> 
> It is not rte_common.h
> Looks like a copy past of the first patch.
> 
Yes, it's a copy paste error. I got most instances replaced but missed one
(or at least I don't see a second miss).


Re: [PATCH v7] app/testpmd: add Host Shaper command

2022-06-15 Thread Thomas Monjalon
Few comments below,

14/06/2022 14:01, Spike Du:
> Add command line options to support host shaper configure.
> - Command syntax:
>   mlx5 set port  host_shaper avail_thresh_triggered <0|1> rate
> 
> 
> - Example commands:
> To enable avail_thresh_triggered on port 1 and disable current host
> shaper:
> testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0
> 
> To disable avail_thresh_triggered and current host shaper on port 1:
> testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0
> 
> The rate unit is 100Mbps.
> To disable avail_thresh_triggered and configure a shaper of 5Gbps on
> port 1:
> testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50
> 
> Add sample code to handle rxq available descriptor threshold event, it
> delays a while so that rxq empties, then disables host shaper and
> rearms available descriptor threshold event.
> 
> Signed-off-by: Spike Du 
[...]
> @@ -3659,6 +3662,10 @@ struct pmd_test_command {
>   break;
>   printf("Received avail_thresh event, port:%d 
> rxq_id:%d\n",
>  port_id, rxq_id);
> +
> +#ifdef RTE_NET_MLX5
> + mlx5_test_avail_thresh_event_handler(port_id, rxq_id);
> +#endif

This function includes a check of the driver in use for the port,
so I assume it is OK to call it unconditionally here.

[...]
> +How to use available descriptor threshold and Host Shaper
> +--

Underlining should be of same size as the title.

> +
> +There are sample command lines to configure available descriptor threshold 
> in testpmd.

Simpler:
There is a command to configure available descriptor threshold in testpmd.

You could add a link here to the doc of the command.

> +Testpmd also contains sample logic to handle available descriptor threshold 
> event.
> +The typical workflow is: testpmd configure available descriptor threshold 
> for Rx queues, enable
> +avail_thresh_triggered in host shaper and register a callback, when traffic 
> from host is
> +too high and Rx queue emptiness is below available descriptor threshold, PMD 
> receives an event and
> +firmware configures a 100Mbps shaper on host port automatically, then PMD 
> call
> +the callback registered previously, which will delay a while to let Rx queue
> +empty, then disable host shaper.
> +
> +Let's assume we have a simple Blue Field 2 setup: port 0 is uplink, port 1

There's no space in BlueField.

> +is VF representor. Each port has 2 Rx queues.
> +In order to control traffic from host to ARM, we can enable available 
> descriptor threshold in testpmd by:
> +
> +.. code-block:: console
> +
> +   testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0
> +   testpmd> set port 1 rxq 0 avail_thresh 70
> +   testpmd> set port 1 rxq 1 avail_thresh 70
> +
> +The first command disables current host shaper, and enables available 
> descriptor threshold triggered mode.
> +The left commands configure available descriptor threshold to 70% of Rx 
> queue size for both Rx queues,

left -> other

> +When traffic from host is too high, you can see testpmd console prints log
> +about available descriptor threshold event receiving, then host shaper is 
> disabled.
> +The traffic rate from host is controlled and less drop happens in Rx queues.
> +
> +When disable available descriptor threshold and avail_thresh_triggered, we 
> can invoke below commands in testpmd:

Reword:
The threshold event and shaper can be disabled like this:

> +
> +.. code-block:: console
> +
> +   testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0
> +   testpmd> set port 1 rxq 0 avail_thresh 0
> +   testpmd> set port 1 rxq 1 avail_thresh 0
> +
> +It's recommended an application disables available descriptor threshold and 
> avail_thresh_triggered before exit,
> +if it enables them before.
> +
> +We can also configure the shaper with a value, the rate unit is 100Mbps, 
> below
> +command sets current shaper to 5Gbps and disables avail_thresh_triggered.
> +
> +.. code-block:: console
> +
> +   testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50





[PATCH] net/mlx5: block negative integrity flows

2022-06-15 Thread Raja Zidane
Negative integrity flows are supported, but malfunctioning due to a
bug, and an API change is necessary to fix.

Drop negative flow support for now.

Signed-off-by: Raja Zidane 
Acked-by: Matan Azrad 
Acked-by: Gregory Etelson 
---
 doc/guides/nics/mlx5.rst| 4 +++-
 drivers/net/mlx5/mlx5_flow_dv.c | 6 ++
 2 files changed, 9 insertions(+), 1 deletion(-)

diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst
index f41703277e..f28a874006 100644
--- a/doc/guides/nics/mlx5.rst
+++ b/doc/guides/nics/mlx5.rst
@@ -467,7 +467,9 @@ Limitations
 TCP or UDP, must be in the rule pattern as well::
 
   flow create 0 ingress pattern integrity level is 0 value mask l3_ok 
value spec l3_ok / eth / ipv6 / end …
-  or
+
+Negative integrity flows are not supported. i.e.
+
   flow create 0 ingress pattern integrity level is 0 value mask l4_ok 
value spec 0 / eth / ipv4 proto is udp / end …
 
 - Connection tracking:
diff --git a/drivers/net/mlx5/mlx5_flow_dv.c b/drivers/net/mlx5/mlx5_flow_dv.c
index 65b02b20ce..f82986ac31 100644
--- a/drivers/net/mlx5/mlx5_flow_dv.c
+++ b/drivers/net/mlx5/mlx5_flow_dv.c
@@ -6809,6 +6809,12 @@ flow_dv_validate_item_integrity(struct rte_eth_dev *dev,
  RTE_FLOW_ERROR_TYPE_ITEM,
  integrity_item,
  "unsupported integrity filter");
+   if ((mask->l3_ok & !spec->l3_ok) || (mask->l4_ok & !spec->l4_ok) ||
+   (mask->ipv4_csum_ok & !spec->ipv4_csum_ok) ||
+   (mask->l4_csum_ok & !spec->l4_csum_ok))
+   return rte_flow_error_set(error, EINVAL,
+ RTE_FLOW_ERROR_TYPE_ITEM,
+ NULL, "negative integrity flow is not 
supported");
if (spec->level > 1) {
if (pattern_flags & MLX5_FLOW_ITEM_INNER_INTEGRITY)
return rte_flow_error_set
-- 
2.21.0



[Bug 1036] vhost primary and secondary process symmetric test meet issue

2022-06-15 Thread bugzilla
https://bugs.dpdk.org/show_bug.cgi?id=1036

Bug ID: 1036
   Summary: vhost primary and secondary process symmetric test
meet issue
   Product: DPDK
   Version: 22.03
  Hardware: x86
OS: Linux
Status: UNCONFIRMED
  Severity: normal
  Priority: Normal
 Component: vhost/virtio
  Assignee: dev@dpdk.org
  Reporter: xingguang...@intel.com
  Target Milestone: ---

Prerequisites
-
1. Compile symmetric_mp example application::
meson configure -Dexamples=multi_process/symmetric_mp
x86_64-native-linuxapp-gcc
ninja -C x86_64-native-linuxapp-gcc

Steps
-
1. Launch vhost testpmd as primary process::

./x86_64-native-linuxapp-gcc/examples/dpdk-symmetric_mp -l 1 -n 4 \
--vdev 'net_vhost,iface=vhost-net,queues=2,client=1' \
--vdev 'net_vhost1,iface=vhost-net1,queues=2,client=1' \
--proc-type=auto -- -p 3 --num-procs=2 --proc-id=0

2. Launch vhost testpmd as secondary process::  

./x86_64-native-linuxapp-gcc/examples/dpdk-symmetric_mp -l 2 -n 4 \
--vdev 'net_vhost,iface=vhost-net,queues=2,client=1' \
--vdev 'net_vhost1,iface=vhost-net1,queues=2,client=1' \
--proc-type=auto -- -p 3 --num-procs=2 --proc-id=1

3. Launch virtio-user::
./x86_64-native-linuxapp-gcc/app/dpdk-testpmd -n 4 -l 7-8 --no-pci
--file-prefix=virtio1 \
--vdev=net_virtio_user1,mac=00:01:02:03:04:05,path=./vhost-net,queues=2,server=1,mrg_rxbuf=0,in_order=0
\
-- -i --nb-cores=1 --rxq=2 --txq=2 --txd=1024 --rxd=1024

when launch virtio-user:
VHOST_CONFIG: (0) device not found.

-- 
You are receiving this mail because:
You are the assignee for the bug.

RE: [PATCH v7] net/i40e: add outer VLAN processing

2022-06-15 Thread Zhang, Qi Z



> -Original Message-
> From: Zhang, Yuying 
> Sent: Tuesday, June 14, 2022 4:40 PM
> To: Liu, KevinX ; dev@dpdk.org
> Cc: Xing, Beilei ; Yang, SteveX
> ; Zhang, RobinX 
> Subject: RE: [PATCH v7] net/i40e: add outer VLAN processing
> 
> Hi Kevin,
> 
> Workaround should be replaced when root cause be found.
> 
> Best regards,
> Yuying
> 
> > -Original Message-
> > From: Liu, KevinX 
> > Sent: Tuesday, June 14, 2022 11:07 AM
> > To: Zhang, Yuying ; dev@dpdk.org
> > Cc: Xing, Beilei ; Yang, SteveX
> > ; Zhang, RobinX 
> > Subject: RE: [PATCH v7] net/i40e: add outer VLAN processing
> >
> > Hi, Yuying
> >
> > > -Original Message-
> > > From: Zhang, Yuying 
> > > Sent: 2022年6月14日 10:44
> > > To: Liu, KevinX ; dev@dpdk.org
> > > Cc: Xing, Beilei ; Yang, SteveX
> > > ; Zhang, RobinX 
> > > Subject: RE: [PATCH v7] net/i40e: add outer VLAN processing
> > >
> > > Hi Kevin,
> > >
> > > > -Original Message-
> > > > From: Liu, KevinX 
> > > > Sent: Saturday, June 11, 2022 12:30 AM
> > > > To: dev@dpdk.org
> > > > Cc: Zhang, Yuying ; Xing, Beilei
> > > > ; Yang, SteveX ;
> > > > Zhang, RobinX ; Liu, KevinX
> > > > 
> > > > Subject: [PATCH v7] net/i40e: add outer VLAN processing
> > > >
> > > > From: Robin Zhang 
> > > >
> > > > Outer VLAN processing is supported after firmware v8.4, kernel
> > > > driver also
> > >
> > > Since this patch can only be enabled with firmware v8.6, should you
> > > sync with dpdk here?
> > OK, I'll revise it here.
> > >
> > > > change the default behavior to support this feature. To align with
> > > > kernel driver, add support for outer VLAN processing in DPDK.
> > > >
> > > > But it is forbidden for firmware to change the Inner/Outer VLAN
> > > > configuration while there are MAC/VLAN filters in the switch table.
> > > > Therefore, we need to clear the MAC table before setting config,
> > > > and then restore the MAC table after setting.
> > > >
> > > > This will not impact on an old firmware.
> > > >
> > > > Signed-off-by: Robin Zhang 
> > > > Signed-off-by: Kevin Liu 
> 
> Acked-by: Yuying Zhang 

Applied to dpdk-next-net-intel.

Thanks
Qi


Re: [PATCH 1/2] common/cnxk: decrypt packet count register update

2022-06-15 Thread Jerin Jacob
On Fri, May 20, 2022 at 10:53 AM Rahul Bhansali  wrote:
>
> Corrects the CPT decrypt packet counter register.
>
> Signed-off-by: Rahul Bhansali 

Updated the git commit as follows and applied to
dpdk-next-net-mrvl/for-next-net. Thanks

common/cnxk: fix decrypt packet count register update

Corrects the CPT decrypt packet counter register.

Fixes: b1a22e5d4f ("common/cnxk: add CPT diagnostics")
Cc: sta...@dpdk.org

Signed-off-by: Rahul Bhansali 


> ---
>  drivers/common/cnxk/roc_cpt_debug.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
>
> diff --git a/drivers/common/cnxk/roc_cpt_debug.c 
> b/drivers/common/cnxk/roc_cpt_debug.c
> index 847d969268..be6ddb56aa 100644
> --- a/drivers/common/cnxk/roc_cpt_debug.c
> +++ b/drivers/common/cnxk/roc_cpt_debug.c
> @@ -200,7 +200,7 @@ cpt_lf_print(struct roc_cpt_lf *lf)
> reg_val = plt_read64(lf->rbase + CPT_LF_CTX_DEC_BYTE_CNT);
> plt_print("Decrypted byte count:\t%" PRIu64, reg_val);
>
> -   reg_val = plt_read64(lf->rbase + CPT_LF_CTX_ENC_PKT_CNT);
> +   reg_val = plt_read64(lf->rbase + CPT_LF_CTX_DEC_PKT_CNT);
> plt_print("Decrypted packet count:\t%" PRIu64, reg_val);
>  }
>
> --
> 2.25.1
>


Re: [PATCH 2/2] common/cnxk: print NIX inline outbound CPT LF registers

2022-06-15 Thread Jerin Jacob
On Tue, May 24, 2022 at 8:43 PM Ray Kinsella  wrote:
>
>
> Rahul Bhansali  writes:
>
> > This add the support to dump NIX inline outbound CPT LF
> > registers.
> >
> > Signed-off-by: Rahul Bhansali 
> > ---
> >  drivers/common/cnxk/roc_nix_debug.c | 14 ++
> >  drivers/common/cnxk/roc_nix_inl.h   |  1 +
> >  drivers/common/cnxk/version.map |  1 +
> >  3 files changed, 16 insertions(+)
> >
>
> Acked-by: Ray Kinsella 

Applied to dpdk-next-net-mrvl/for-next-net. Thanks


Re: [dpdk-dev] [PATCH] common/cnxk: add ROC API to free MCAM entry

2022-06-15 Thread Jerin Jacob
On Fri, Jun 3, 2022 at 9:06 AM  wrote:
>
> From: Satheesh Paul 
>
> Add ROC API to free the given MCAM entry. If the MCAM
> entry has flow counter associated, this API will clear
> and free the flow counter.
>
> Signed-off-by: Satheesh Paul 
> Reviewed-by: Jerin Jacob Kollanukkaran 
> ---
>  drivers/common/cnxk/roc_npc.c   | 63 +++--
>  drivers/common/cnxk/roc_npc.h   |  7 
>  drivers/common/cnxk/roc_npc_mcam.c  | 39 ++
>  drivers/common/cnxk/roc_npc_priv.h  |  2 +
>  drivers/common/cnxk/roc_npc_utils.c | 15 +++
>  drivers/common/cnxk/version.map |  3 ++
>  6 files changed, 100 insertions(+), 29 deletions(-)
>
> diff --git a/drivers/common/cnxk/roc_npc.c b/drivers/common/cnxk/roc_npc.c
> index da5b96242e..02624856e0 100644
> --- a/drivers/common/cnxk/roc_npc.c
> +++ b/drivers/common/cnxk/roc_npc.c
> @@ -55,6 +55,53 @@ roc_npc_mcam_free_entry(struct roc_npc *roc_npc, uint32_t 
> entry)
> return npc_mcam_free_entry(npc, entry);
>  }
>
> +int
> +roc_npc_mcam_free(struct roc_npc *roc_npc, struct roc_npc_flow *mcam)
> +{
> +   int rc = 0;
> +
> +   if (mcam->use_ctr) {
> +   rc = roc_npc_mcam_clear_counter(roc_npc, mcam->ctr_id);
> +   if (rc)
> +   return rc;
> +
> +   rc = roc_npc_mcam_free_counter(roc_npc, mcam->ctr_id);
> +   if (rc)
> +   return rc;
> +   }
> +
> +   return roc_npc_mcam_free_entry(roc_npc, mcam->mcam_id);
> +}
> +
> +int
> +roc_npc_mcam_init(struct roc_npc *roc_npc, struct roc_npc_flow *flow,
> + int mcam_id)
> +{
> +   struct npc *npc = roc_npc_to_npc_priv(roc_npc);
> +   int rc = 0;

No need to assign to zero as the next line is overwriting it.

> +
> +   rc = npc_mcam_init(npc, flow, mcam_id);
> +   if (rc != 0) {
> +   plt_err("npc: mcam initialisation write failed");
> +   return rc;
> +   }
> +   return 0;
> +}
> +
> +int
> +roc_npc_mcam_move(struct roc_npc *roc_npc, uint16_t old_ent, uint16_t 
> new_ent)
> +{
> +   struct npc *npc = roc_npc_to_npc_priv(roc_npc);
> +   struct mbox *mbox = npc->mbox;
> +   int rc = -ENOSPC;

No need to assign to value as the next line is overwriting it.

> +
> +   rc = npc_mcam_move(mbox, old_ent, new_ent);
> +   if (rc)
> +   return rc;
> +
> +   return 0;
> +}
> +
>  int
>  roc_npc_mcam_free_all_resources(struct roc_npc *roc_npc)
>  {
> @@ -383,7 +430,7 @@ npc_parse_actions(struct roc_npc *roc_npc, const struct 
> roc_npc_attr *attr,
>
> case ROC_NPC_ACTION_TYPE_COUNT:
> /* Indicates, need a counter */
> -   flow->ctr_id = 1;
> +   flow->use_ctr = 1;
> req_act |= ROC_NPC_ACTION_TYPE_COUNT;
> break;
>
> @@ -1268,7 +1315,7 @@ roc_npc_flow_create(struct roc_npc *roc_npc, const 
> struct roc_npc_attr *attr,
> return flow;
>
>  set_rss_failed:
> -   rc = npc_mcam_free_entry(npc, flow->mcam_id);
> +   rc = roc_npc_mcam_free_entry(roc_npc, flow->mcam_id);
> if (rc != 0) {
> *errcode = rc;
> plt_free(flow);
> @@ -1314,17 +1361,7 @@ roc_npc_flow_destroy(struct roc_npc *roc_npc, struct 
> roc_npc_flow *flow)
> return rc;
> }
>
> -   if (flow->ctr_id != NPC_COUNTER_NONE) {
> -   rc = roc_npc_mcam_clear_counter(roc_npc, flow->ctr_id);
> -   if (rc != 0)
> -   return rc;
> -
> -   rc = npc_mcam_free_counter(npc, flow->ctr_id);
> -   if (rc != 0)
> -   return rc;
> -   }
> -
> -   rc = npc_mcam_free_entry(npc, flow->mcam_id);
> +   rc = roc_npc_mcam_free(roc_npc, flow);
> if (rc != 0)
> return rc;
>
> diff --git a/drivers/common/cnxk/roc_npc.h b/drivers/common/cnxk/roc_npc.h
> index f92c2a633c..1b4e5521cb 100644
> --- a/drivers/common/cnxk/roc_npc.h
> +++ b/drivers/common/cnxk/roc_npc.h
> @@ -246,6 +246,7 @@ struct roc_npc_flow {
> uint8_t nix_intf;
> uint8_t enable;
> uint32_t mcam_id;
> +   uint8_t use_ctr;
> int32_t ctr_id;
> uint32_t priority;
> uint32_t mtr_id;
> @@ -329,6 +330,8 @@ roc_npc_flow_create(struct roc_npc *roc_npc, const struct 
> roc_npc_attr *attr,
> const struct roc_npc_action actions[], int *errcode);
>  int __roc_api roc_npc_flow_destroy(struct roc_npc *roc_npc,
>struct roc_npc_flow *flow);
> +int __roc_api roc_npc_mcam_free(struct roc_npc *roc_npc,
> +   struct roc_npc_flow *mcam);
>  int __roc_api roc_npc_mcam_free_entry(struct roc_npc *roc_npc, uint32_t 
> entry);
>  int __roc_api roc_npc_mcam_enable_all_entries(struct roc_npc *roc_npc,
>   bool enable);
> @

[PATCH v8 2/6] common/mlx5: share interrupt management

2022-06-15 Thread Spike Du
There are many duplicate code of creating and initializing rte_intr_handle.
Add a new mlx5_os API to do this, replace all PMD related code with this
API.

Signed-off-by: Spike Du 
---
 drivers/common/mlx5/linux/mlx5_common_os.c   | 131 ++
 drivers/common/mlx5/linux/mlx5_common_os.h   |  11 +++
 drivers/common/mlx5/version.map  |   2 +
 drivers/common/mlx5/windows/mlx5_common_os.h |  24 +
 drivers/net/mlx5/linux/mlx5_ethdev_os.c  |  71 --
 drivers/net/mlx5/linux/mlx5_os.c | 132 ++-
 drivers/net/mlx5/linux/mlx5_socket.c |  53 ++-
 drivers/net/mlx5/mlx5.h  |   2 -
 drivers/net/mlx5/mlx5_txpp.c |  28 ++
 drivers/net/mlx5/windows/mlx5_ethdev_os.c|  22 -
 drivers/vdpa/mlx5/mlx5_vdpa_virtq.c  |  48 ++
 11 files changed, 217 insertions(+), 307 deletions(-)

diff --git a/drivers/common/mlx5/linux/mlx5_common_os.c 
b/drivers/common/mlx5/linux/mlx5_common_os.c
index d40cfd5..f10a981 100644
--- a/drivers/common/mlx5/linux/mlx5_common_os.c
+++ b/drivers/common/mlx5/linux/mlx5_common_os.c
@@ -11,6 +11,7 @@
 #endif
 #include 
 #include 
+#include 
 
 #include 
 #include 
@@ -964,3 +965,133 @@
claim_zero(mlx5_glue->dereg_mr(pmd_mr->obj));
memset(pmd_mr, 0, sizeof(*pmd_mr));
 }
+
+/**
+ * Rte_intr_handle create and init helper.
+ *
+ * @param[in] mode
+ *   interrupt instance can be shared between primary and secondary
+ *   processes or not.
+ * @param[in] set_fd_nonblock
+ *   Whether to set fd to O_NONBLOCK.
+ * @param[in] fd
+ *   Fd to set in created intr_handle.
+ * @param[in] cb
+ *   Callback to register for intr_handle.
+ * @param[in] cb_arg
+ *   Callback argument for cb.
+ *
+ * @return
+ *  - Interrupt handle on success.
+ *  - NULL on failure, with rte_errno set.
+ */
+struct rte_intr_handle *
+mlx5_os_interrupt_handler_create(int mode, bool set_fd_nonblock, int fd,
+rte_intr_callback_fn cb, void *cb_arg)
+{
+   struct rte_intr_handle *tmp_intr_handle;
+   int ret, flags;
+
+   tmp_intr_handle = rte_intr_instance_alloc(mode);
+   if (!tmp_intr_handle) {
+   rte_errno = ENOMEM;
+   goto err;
+   }
+   if (set_fd_nonblock) {
+   flags = fcntl(fd, F_GETFL);
+   ret = fcntl(fd, F_SETFL, flags | O_NONBLOCK);
+   if (ret) {
+   rte_errno = errno;
+   goto err;
+   }
+   }
+   ret = rte_intr_fd_set(tmp_intr_handle, fd);
+   if (ret)
+   goto err;
+   ret = rte_intr_type_set(tmp_intr_handle, RTE_INTR_HANDLE_EXT);
+   if (ret)
+   goto err;
+   ret = rte_intr_callback_register(tmp_intr_handle, cb, cb_arg);
+   if (ret) {
+   rte_errno = -ret;
+   goto err;
+   }
+   return tmp_intr_handle;
+err:
+   if (tmp_intr_handle)
+   rte_intr_instance_free(tmp_intr_handle);
+   return NULL;
+}
+
+/* Safe unregistration for interrupt callback. */
+static void
+mlx5_intr_callback_unregister(const struct rte_intr_handle *handle,
+ rte_intr_callback_fn cb_fn, void *cb_arg)
+{
+   uint64_t twait = 0;
+   uint64_t start = 0;
+
+   do {
+   int ret;
+
+   ret = rte_intr_callback_unregister(handle, cb_fn, cb_arg);
+   if (ret >= 0)
+   return;
+   if (ret != -EAGAIN) {
+   DRV_LOG(INFO, "failed to unregister interrupt"
+ " handler (error: %d)", ret);
+   MLX5_ASSERT(false);
+   return;
+   }
+   if (twait) {
+   struct timespec onems;
+
+   /* Wait one millisecond and try again. */
+   onems.tv_sec = 0;
+   onems.tv_nsec = NS_PER_S / MS_PER_S;
+   nanosleep(&onems, 0);
+   /* Check whether one second elapsed. */
+   if ((rte_get_timer_cycles() - start) <= twait)
+   continue;
+   } else {
+   /*
+* We get the amount of timer ticks for one second.
+* If this amount elapsed it means we spent one
+* second in waiting. This branch is executed once
+* on first iteration.
+*/
+   twait = rte_get_timer_hz();
+   MLX5_ASSERT(twait);
+   }
+   /*
+* Timeout elapsed, show message (once a second) and retry.
+* We have no other acceptable option here, if we ignore
+* the unregistering return code the handler will not
+* be unr

[PATCH v8 0/6] introduce per-queue available descriptor threshold and host shaper

2022-06-15 Thread Spike Du
available descriptor threshold(ADT for short) is per RX queue attribute, when 
RX queue available descriptors for HW is below ADT, HW sends an event to 
application.
Host shaper can configure shaper rate and avail_thresh-triggered for a host 
port.
The shaper limits the rate of traffic from host port to embedded ARM rx port on 
Nvidia BlueField 2 NIC.
If avail_thresh-triggered is enabled, a 100Mbps shaper is enabled automatically 
when one of the host port's Rx queues receives available descriptor threshold 
event.

These two features can combine to control traffic from host port to wire port 
for BlueField 2 NIC.
The traffic flows from host to embedded ARM, then to the physical port.
The work flow is on the ARM system, configure available descriptor threshold to 
RX queue and enable avail_thresh-triggered flag in host shaper, after receiving 
available descriptor threshold event, delay a while until RX queue is empty , 
then disable the shaper. We recycle this work flow to reduce RX queue drops on 
ARM system.

Add new libethdev API to set available descriptor threshold, add rte event 
RTE_ETH_EVENT_RX_AVAIL_THRESH to handle available descriptor threshold event. 
For host shaper, because it doesn't align to existing DPDK framework and is 
specific to Nvidia NIC, use PMD private API.

For integration with testpmd, put the private cmdline function and available 
descriptor threshold event handler in mlx5 PMD directory by adding a new file 
mlx5_testpmd.c. Follow David Marchand's driver specific commands framework to 
add mlx5 specific commands.

Spike Du (6):
  net/mlx5: add LWM support for Rxq
  common/mlx5: share interrupt management
  net/mlx5: add LWM event handling support
  net/mlx5: support Rx queue based available descriptor threshold
  net/mlx5: add private API to config host port shaper
  app/testpmd: add Host Shaper command

 app/test-pmd/testpmd.c   |   7 +
 doc/guides/nics/mlx5.rst |  93 +
 doc/guides/rel_notes/release_22_07.rst   |   2 +
 drivers/common/mlx5/linux/meson.build|  13 ++
 drivers/common/mlx5/linux/mlx5_common_os.c   | 131 
 drivers/common/mlx5/linux/mlx5_common_os.h   |  11 +
 drivers/common/mlx5/mlx5_prm.h   |  26 +++
 drivers/common/mlx5/version.map  |   2 +
 drivers/common/mlx5/windows/mlx5_common_os.h |  24 +++
 drivers/net/mlx5/linux/mlx5_ethdev_os.c  |  71 ---
 drivers/net/mlx5/linux/mlx5_os.c | 132 +++-
 drivers/net/mlx5/linux/mlx5_socket.c |  53 +
 drivers/net/mlx5/meson.build |   4 +
 drivers/net/mlx5/mlx5.c  |  68 +++
 drivers/net/mlx5/mlx5.h  |  12 +-
 drivers/net/mlx5/mlx5_devx.c |  60 +-
 drivers/net/mlx5/mlx5_devx.h |   1 +
 drivers/net/mlx5/mlx5_rx.c   | 288 +++
 drivers/net/mlx5/mlx5_rx.h   |  13 ++
 drivers/net/mlx5/mlx5_testpmd.c  | 205 +++
 drivers/net/mlx5/mlx5_testpmd.h  |  26 +++
 drivers/net/mlx5/mlx5_txpp.c |  28 +--
 drivers/net/mlx5/rte_pmd_mlx5.h  |  30 +++
 drivers/net/mlx5/version.map |   2 +
 drivers/net/mlx5/windows/mlx5_ethdev_os.c|  22 --
 drivers/vdpa/mlx5/mlx5_vdpa_virtq.c  |  48 +
 26 files changed, 1064 insertions(+), 308 deletions(-)
 create mode 100644 drivers/net/mlx5/mlx5_testpmd.c
 create mode 100644 drivers/net/mlx5/mlx5_testpmd.h

-- 
1.8.3.1



[PATCH v8 5/6] net/mlx5: add private API to config host port shaper

2022-06-15 Thread Spike Du
Host port shaper can be configured with QSHR(QoS Shaper Host Register).
Add check in build files to enable this function or not.

The host shaper configuration affects all the ethdev ports belonging to the
same host port.

Host shaper can configure shaper rate and lwm-triggered for a host port.
The shaper limits the rate of traffic from host port to wire port.
If lwm-triggered is enabled, a 100Mbps shaper is enabled automatically
when one of the host port's Rx queues receives available descriptor
threshold event.

Signed-off-by: Spike Du 
---
 doc/guides/nics/mlx5.rst   |  35 +++
 doc/guides/rel_notes/release_22_07.rst |   1 +
 drivers/common/mlx5/linux/meson.build  |  13 +
 drivers/common/mlx5/mlx5_prm.h |  25 
 drivers/net/mlx5/mlx5.h|   2 +
 drivers/net/mlx5/mlx5_rx.c | 104 +
 drivers/net/mlx5/rte_pmd_mlx5.h|  30 ++
 drivers/net/mlx5/version.map   |   2 +
 8 files changed, 212 insertions(+)

diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst
index cceaddf..5f7b060 100644
--- a/doc/guides/nics/mlx5.rst
+++ b/doc/guides/nics/mlx5.rst
@@ -94,6 +94,7 @@ Features
 - Sub-Function representors.
 - Sub-Function.
 - Rx queue available descriptor threshold configuration.
+- Host shaper support.
 
 
 Limitations
@@ -525,6 +526,12 @@ Limitations
 
   - Doesn't support shared Rx queue and Hairpin Rx queue.
 
+- Host shaper:
+
+  - Support BlueField series NIC from BlueField 2.
+  - When configure host shaper with 
MLX5_HOST_SHAPER_FLAG_AVAIL_THRESH_TRIGGERED flag set,
+only rate 0 and 100Mbps are supported.
+
 Statistics
 --
 
@@ -1692,3 +1699,31 @@ Available descriptor threshold is a per Rx queue 
attribute, it should be configu
 a percentage of the Rx queue size.
 When Rx queue available descriptors for hardware are below the threshold, an 
event is sent to PMD.
 
+Host shaper introduction
+
+
+Host shaper register is per host port register which sets a shaper
+on the host port.
+All VF/hostPF representors belonging to one host port share one host shaper.
+For example, if representor 0 and representor 1 belong to same host port,
+and a host shaper rate of 1Gbps is configured, the shaper throttles both
+representors' traffic from host.
+Host shaper has two modes for setting the shaper, immediate and deferred to
+available descriptor threshold event trigger. In immediate mode, the rate 
limit is configured
+immediately to host shaper. When deferring to available descriptor threshold 
trigger, the shaper
+is not set until an available descriptor threshold event is received by any Rx 
queue in a VF
+representor belonging to the host port. The only rate supported for deferred
+mode is 100Mbps (there is no limit on the supported rates for immediate mode).
+In deferred mode, the shaper is set on the host port by the firmware upon
+receiving the available descriptor threshold event, which allows throttling 
host traffic on
+available descriptor threshold events at minimum latency, preventing excess 
drops in the
+Rx queue.
+
+Host shaper dependency for mstflint package
+---
+
+In order to configure host shaper register, ``librte_net_mlx5`` depends on 
``libmtcr_ul``
+which can be installed from OFED mstflint package.
+Meson detects ``libmtcr_ul`` existence at configure stage.
+If the library is detected, the application must link with ``-lmtcr_ul``,
+as done by the pkg-config file libdpdk.pc.
diff --git a/doc/guides/rel_notes/release_22_07.rst 
b/doc/guides/rel_notes/release_22_07.rst
index 46fd73a..3349cda 100644
--- a/doc/guides/rel_notes/release_22_07.rst
+++ b/doc/guides/rel_notes/release_22_07.rst
@@ -90,6 +90,7 @@ New Features
   * Added support for MTU on Windows.
   * Added matching and RSS on IPsec ESP.
   * Added Rx queue available descriptor threshold support.
+  * Added host shaper support.
 
 * **Updated Marvell cnxk crypto driver.**
 
diff --git a/drivers/common/mlx5/linux/meson.build 
b/drivers/common/mlx5/linux/meson.build
index 5335f5b..51c6e5d 100644
--- a/drivers/common/mlx5/linux/meson.build
+++ b/drivers/common/mlx5/linux/meson.build
@@ -45,6 +45,13 @@ if static_ibverbs
 ext_deps += declare_dependency(link_args:ibv_ldflags.split())
 endif
 
+libmtcr_ul_found = false
+lib = cc.find_library('mtcr_ul', required:false)
+if lib.found() and run_command('meson', 
'--version').stdout().version_compare('>= 0.49.2')
+libmtcr_ul_found = true
+ext_deps += lib
+endif
+
 sources += files('mlx5_nl.c')
 sources += files('mlx5_common_auxiliary.c')
 sources += files('mlx5_common_os.c')
@@ -207,6 +214,12 @@ has_sym_args = [
 [ 'HAVE_MLX5_IBV_IMPORT_CTX_PD_AND_MR', 'infiniband/verbs.h',
 'ibv_import_device' ],
 ]
+if  libmtcr_ul_found
+has_sym_args += [
+[  'HAVE_MLX5_MSTFLINT', 'mstflint/mtcr.h',
+'mopen'],
+]
+endif
 config = configuration_data()
 fo

[PATCH v8 1/6] net/mlx5: add LWM support for Rxq

2022-06-15 Thread Spike Du
Add lwm(Limit WaterMark) field to Rxq object which indicates the percentage
of RX queue size used by HW to raise LWM event to the user.
Allow LWM setting in modify_rq command.
Allow the LWM configuration dynamically by adding RDY2RDY state change.

Signed-off-by: Spike Du 
---
 drivers/net/mlx5/mlx5.h  |  1 +
 drivers/net/mlx5/mlx5_devx.c | 13 -
 drivers/net/mlx5/mlx5_devx.h |  1 +
 drivers/net/mlx5/mlx5_rx.h   |  1 +
 4 files changed, 15 insertions(+), 1 deletion(-)

diff --git a/drivers/net/mlx5/mlx5.h b/drivers/net/mlx5/mlx5.h
index ef755ee..305edff 100644
--- a/drivers/net/mlx5/mlx5.h
+++ b/drivers/net/mlx5/mlx5.h
@@ -1395,6 +1395,7 @@ enum mlx5_rxq_modify_type {
MLX5_RXQ_MOD_RST2RDY, /* modify state from reset to ready. */
MLX5_RXQ_MOD_RDY2ERR, /* modify state from ready to error. */
MLX5_RXQ_MOD_RDY2RST, /* modify state from ready to reset. */
+   MLX5_RXQ_MOD_RDY2RDY, /* modify state from ready to ready. */
 };
 
 enum mlx5_txq_modify_type {
diff --git a/drivers/net/mlx5/mlx5_devx.c b/drivers/net/mlx5/mlx5_devx.c
index 4b48f94..c918a50 100644
--- a/drivers/net/mlx5/mlx5_devx.c
+++ b/drivers/net/mlx5/mlx5_devx.c
@@ -62,7 +62,7 @@
  * @return
  *   0 on success, a negative errno value otherwise and rte_errno is set.
  */
-static int
+int
 mlx5_devx_modify_rq(struct mlx5_rxq_priv *rxq, uint8_t type)
 {
struct mlx5_devx_modify_rq_attr rq_attr;
@@ -76,6 +76,11 @@
case MLX5_RXQ_MOD_RST2RDY:
rq_attr.rq_state = MLX5_RQC_STATE_RST;
rq_attr.state = MLX5_RQC_STATE_RDY;
+   if (rxq->lwm) {
+   rq_attr.modify_bitmask |=
+   MLX5_MODIFY_RQ_IN_MODIFY_BITMASK_WQ_LWM;
+   rq_attr.lwm = rxq->lwm;
+   }
break;
case MLX5_RXQ_MOD_RDY2ERR:
rq_attr.rq_state = MLX5_RQC_STATE_RDY;
@@ -85,6 +90,12 @@
rq_attr.rq_state = MLX5_RQC_STATE_RDY;
rq_attr.state = MLX5_RQC_STATE_RST;
break;
+   case MLX5_RXQ_MOD_RDY2RDY:
+   rq_attr.rq_state = MLX5_RQC_STATE_RDY;
+   rq_attr.state = MLX5_RQC_STATE_RDY;
+   rq_attr.modify_bitmask |= 
MLX5_MODIFY_RQ_IN_MODIFY_BITMASK_WQ_LWM;
+   rq_attr.lwm = rxq->lwm;
+   break;
default:
break;
}
diff --git a/drivers/net/mlx5/mlx5_devx.h b/drivers/net/mlx5/mlx5_devx.h
index a95207a..ebd1da4 100644
--- a/drivers/net/mlx5/mlx5_devx.h
+++ b/drivers/net/mlx5/mlx5_devx.h
@@ -11,6 +11,7 @@
 int mlx5_txq_devx_modify(struct mlx5_txq_obj *obj,
 enum mlx5_txq_modify_type type, uint8_t dev_port);
 void mlx5_txq_devx_obj_release(struct mlx5_txq_obj *txq_obj);
+int mlx5_devx_modify_rq(struct mlx5_rxq_priv *rxq, uint8_t type);
 
 extern struct mlx5_obj_ops devx_obj_ops;
 
diff --git a/drivers/net/mlx5/mlx5_rx.h b/drivers/net/mlx5/mlx5_rx.h
index e715ed6..25a5f2c 100644
--- a/drivers/net/mlx5/mlx5_rx.h
+++ b/drivers/net/mlx5/mlx5_rx.h
@@ -175,6 +175,7 @@ struct mlx5_rxq_priv {
struct mlx5_devx_rq devx_rq;
struct rte_eth_hairpin_conf hairpin_conf; /* Hairpin configuration. */
uint32_t hairpin_status; /* Hairpin binding status. */
+   uint32_t lwm:16;
 };
 
 /* External RX queue descriptor. */
-- 
1.8.3.1



[PATCH v8 3/6] net/mlx5: add LWM event handling support

2022-06-15 Thread Spike Du
When LWM meets RQ WQE, the kernel driver raises an event to SW.
Use devx event_channel to catch this and to notify the user.
Allocate this channel per shared device.
The channel has a cookie that informs the specific event port and queue.

Signed-off-by: Spike Du 
---
 drivers/net/mlx5/mlx5.c  | 66 
 drivers/net/mlx5/mlx5.h  |  7 +
 drivers/net/mlx5/mlx5_devx.c | 47 +++
 drivers/net/mlx5/mlx5_rx.c   | 33 ++
 drivers/net/mlx5/mlx5_rx.h   |  7 +
 5 files changed, 160 insertions(+)

diff --git a/drivers/net/mlx5/mlx5.c b/drivers/net/mlx5/mlx5.c
index f098871..e04a666 100644
--- a/drivers/net/mlx5/mlx5.c
+++ b/drivers/net/mlx5/mlx5.c
@@ -9,6 +9,7 @@
 #include 
 #include 
 #include 
+#include 
 
 #include 
 #include 
@@ -22,6 +23,7 @@
 #include 
 #include 
 #include 
+#include 
 
 #include 
 #include 
@@ -1525,6 +1527,69 @@ struct mlx5_dev_ctx_shared *
 }
 
 /**
+ * Create LWM event_channel and interrupt handle for shared device
+ * context. All rxqs sharing the device context share the event_channel.
+ * A callback is registered in interrupt thread to receive the LWM event.
+ *
+ * @param[in] priv
+ *   Pointer to mlx5_priv instance.
+ *
+ * @return
+ *   0 on success, negative with rte_errno set.
+ */
+int
+mlx5_lwm_setup(struct mlx5_priv *priv)
+{
+   int fd_lwm;
+
+   pthread_mutex_init(&priv->sh->lwm_config_lock, NULL);
+   priv->sh->devx_channel_lwm = mlx5_os_devx_create_event_channel
+   (priv->sh->cdev->ctx,
+MLX5DV_DEVX_CREATE_EVENT_CHANNEL_FLAGS_OMIT_EV_DATA);
+   if (!priv->sh->devx_channel_lwm)
+   goto err;
+   fd_lwm = mlx5_os_get_devx_channel_fd(priv->sh->devx_channel_lwm);
+   priv->sh->intr_handle_lwm = mlx5_os_interrupt_handler_create
+   (RTE_INTR_INSTANCE_F_SHARED, true,
+fd_lwm, mlx5_dev_interrupt_handler_lwm, priv);
+   if (!priv->sh->intr_handle_lwm)
+   goto err;
+   return 0;
+err:
+   if (priv->sh->devx_channel_lwm) {
+   mlx5_os_devx_destroy_event_channel
+   (priv->sh->devx_channel_lwm);
+   priv->sh->devx_channel_lwm = NULL;
+   }
+   pthread_mutex_destroy(&priv->sh->lwm_config_lock);
+   return -rte_errno;
+}
+
+/**
+ * Destroy LWM event_channel and interrupt handle for shared device
+ * context before free this context. The interrupt handler is also
+ * unregistered.
+ *
+ * @param[in] sh
+ *   Pointer to shared device context.
+ */
+void
+mlx5_lwm_unset(struct mlx5_dev_ctx_shared *sh)
+{
+   if (sh->intr_handle_lwm) {
+   mlx5_os_interrupt_handler_destroy(sh->intr_handle_lwm,
+   mlx5_dev_interrupt_handler_lwm, (void *)-1);
+   sh->intr_handle_lwm = NULL;
+   }
+   if (sh->devx_channel_lwm) {
+   mlx5_os_devx_destroy_event_channel
+   (sh->devx_channel_lwm);
+   sh->devx_channel_lwm = NULL;
+   }
+   pthread_mutex_destroy(&sh->lwm_config_lock);
+}
+
+/**
  * Free shared IB device context. Decrement counter and if zero free
  * all allocated resources and close handles.
  *
@@ -1601,6 +1666,7 @@ struct mlx5_dev_ctx_shared *
claim_zero(mlx5_devx_cmd_destroy(sh->td));
MLX5_ASSERT(sh->geneve_tlv_option_resource == NULL);
pthread_mutex_destroy(&sh->txpp.mutex);
+   mlx5_lwm_unset(sh);
mlx5_free(sh);
return;
 exit:
diff --git a/drivers/net/mlx5/mlx5.h b/drivers/net/mlx5/mlx5.h
index 7ebb2cc..a76f2fe 100644
--- a/drivers/net/mlx5/mlx5.h
+++ b/drivers/net/mlx5/mlx5.h
@@ -1268,6 +1268,9 @@ struct mlx5_dev_ctx_shared {
struct mlx5_lb_ctx self_lb; /* QP to enable self loopback for Devx. */
unsigned int flow_max_priority;
enum modify_reg flow_mreg_c[MLX5_MREG_C_NUM];
+   void *devx_channel_lwm;
+   struct rte_intr_handle *intr_handle_lwm;
+   pthread_mutex_t lwm_config_lock;
/* Availability of mreg_c's. */
struct mlx5_dev_shared_port port[]; /* per device port data array. */
 };
@@ -1405,6 +1408,7 @@ enum mlx5_txq_modify_type {
 };
 
 struct mlx5_rxq_priv;
+struct mlx5_priv;
 
 /* HW objects operations structure. */
 struct mlx5_obj_ops {
@@ -1413,6 +1417,7 @@ struct mlx5_obj_ops {
int (*rxq_event_get)(struct mlx5_rxq_obj *rxq_obj);
int (*rxq_obj_modify)(struct mlx5_rxq_priv *rxq, uint8_t type);
void (*rxq_obj_release)(struct mlx5_rxq_priv *rxq);
+   int (*rxq_event_get_lwm)(struct mlx5_priv *priv, int *rxq_idx, int 
*port_id);
int (*ind_table_new)(struct rte_eth_dev *dev, const unsigned int log_n,
 struct mlx5_ind_table_obj *ind_tbl);
int (*ind_table_modify)(struct rte_eth_dev *dev,
@@ -1603,6 +1608,8 @@ int mlx5_udp_tunnel_port_add(struct rte_eth_dev *dev,
 bool mlx5_is_hpf(struct rte_eth_dev *dev);
 bool mlx5_is_sf_repr

[PATCH v8 4/6] net/mlx5: support Rx queue based available descriptor threshold

2022-06-15 Thread Spike Du
Add mlx5 specific available descriptor threshold configuration
and query handler.
In mlx5 PMD, available descriptor threshold is also called
LWM(limit watermark).
While the Rx queue fullness reaches the LWM limit, the driver catches
an HW event and invokes the user callback.
The query handler finds the next RX queue with pending LWM event
if any, starting from the given RX queue index.

Signed-off-by: Spike Du 
---
 doc/guides/nics/mlx5.rst   |  12 +++
 doc/guides/rel_notes/release_22_07.rst |   1 +
 drivers/common/mlx5/mlx5_prm.h |   1 +
 drivers/net/mlx5/mlx5.c|   2 +
 drivers/net/mlx5/mlx5_rx.c | 151 +
 drivers/net/mlx5/mlx5_rx.h |   5 ++
 6 files changed, 172 insertions(+)

diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst
index d83c56d..cceaddf 100644
--- a/doc/guides/nics/mlx5.rst
+++ b/doc/guides/nics/mlx5.rst
@@ -93,6 +93,7 @@ Features
 - Connection tracking.
 - Sub-Function representors.
 - Sub-Function.
+- Rx queue available descriptor threshold configuration.
 
 
 Limitations
@@ -520,6 +521,9 @@ Limitations
 
 - The NIC egress flow rules on representor port are not supported.
 
+- Available descriptor threshold:
+
+  - Doesn't support shared Rx queue and Hairpin Rx queue.
 
 Statistics
 --
@@ -1680,3 +1684,11 @@ The procedure below is an example of using a ConnectX-5 
adapter card (pf0) with
 #. For each VF PCIe, using the following command to bind the driver::
 
$ echo ":82:00.2" >> /sys/bus/pci/drivers/mlx5_core/bind
+
+Available descriptor threshold introduction
+---
+
+Available descriptor threshold is a per Rx queue attribute, it should be 
configured as
+a percentage of the Rx queue size.
+When Rx queue available descriptors for hardware are below the threshold, an 
event is sent to PMD.
+
diff --git a/doc/guides/rel_notes/release_22_07.rst 
b/doc/guides/rel_notes/release_22_07.rst
index 0ed4f92..46fd73a 100644
--- a/doc/guides/rel_notes/release_22_07.rst
+++ b/doc/guides/rel_notes/release_22_07.rst
@@ -89,6 +89,7 @@ New Features
   * Added support for promiscuous mode on Windows.
   * Added support for MTU on Windows.
   * Added matching and RSS on IPsec ESP.
+  * Added Rx queue available descriptor threshold support.
 
 * **Updated Marvell cnxk crypto driver.**
 
diff --git a/drivers/common/mlx5/mlx5_prm.h b/drivers/common/mlx5/mlx5_prm.h
index 630b2c5..3b5e605 100644
--- a/drivers/common/mlx5/mlx5_prm.h
+++ b/drivers/common/mlx5/mlx5_prm.h
@@ -3293,6 +3293,7 @@ struct mlx5_aso_wqe {
 
 enum {
MLX5_EVENT_TYPE_OBJECT_CHANGE = 0x27,
+   MLX5_EVENT_TYPE_SRQ_LIMIT_REACHED = 0x14,
 };
 
 enum {
diff --git a/drivers/net/mlx5/mlx5.c b/drivers/net/mlx5/mlx5.c
index e04a666..998846a 100644
--- a/drivers/net/mlx5/mlx5.c
+++ b/drivers/net/mlx5/mlx5.c
@@ -2071,6 +2071,8 @@ struct mlx5_dev_ctx_shared *
.dev_supported_ptypes_get = mlx5_dev_supported_ptypes_get,
.vlan_filter_set = mlx5_vlan_filter_set,
.rx_queue_setup = mlx5_rx_queue_setup,
+   .rx_queue_avail_thresh_set = mlx5_rx_queue_lwm_set,
+   .rx_queue_avail_thresh_query = mlx5_rx_queue_lwm_query,
.rx_hairpin_queue_setup = mlx5_rx_hairpin_queue_setup,
.tx_queue_setup = mlx5_tx_queue_setup,
.tx_hairpin_queue_setup = mlx5_tx_hairpin_queue_setup,
diff --git a/drivers/net/mlx5/mlx5_rx.c b/drivers/net/mlx5/mlx5_rx.c
index 197d708..2cb7006 100644
--- a/drivers/net/mlx5/mlx5_rx.c
+++ b/drivers/net/mlx5/mlx5_rx.c
@@ -25,6 +25,7 @@
 #include "mlx5.h"
 #include "mlx5_utils.h"
 #include "mlx5_rxtx.h"
+#include "mlx5_devx.h"
 #include "mlx5_rx.h"
 
 
@@ -128,6 +129,16 @@
return RTE_ETH_RX_DESC_AVAIL;
 }
 
+/* Get rxq lwm percentage according to lwm number. */
+static uint8_t
+mlx5_rxq_lwm_to_percentage(struct mlx5_rxq_priv *rxq)
+{
+   struct mlx5_rxq_data *rxq_data = &rxq->ctrl->rxq;
+   uint32_t wqe_cnt = 1 << (rxq_data->elts_n - rxq_data->sges_n);
+
+   return rxq->lwm * 100 / wqe_cnt;
+}
+
 /**
  * DPDK callback to get the RX queue information.
  *
@@ -150,6 +161,7 @@
 {
struct mlx5_rxq_ctrl *rxq_ctrl = mlx5_rxq_ctrl_get(dev, rx_queue_id);
struct mlx5_rxq_data *rxq = mlx5_rxq_data_get(dev, rx_queue_id);
+   struct mlx5_rxq_priv *rxq_priv = mlx5_rxq_get(dev, rx_queue_id);
 
if (!rxq)
return;
@@ -169,6 +181,8 @@
qinfo->nb_desc = mlx5_rxq_mprq_enabled(rxq) ?
RTE_BIT32(rxq->elts_n) * RTE_BIT32(rxq->log_strd_num) :
RTE_BIT32(rxq->elts_n);
+   qinfo->avail_thresh = rxq_priv ?
+   mlx5_rxq_lwm_to_percentage(rxq_priv) : 0;
 }
 
 /**
@@ -1188,6 +1202,34 @@ int mlx5_get_monitor_addr(void *rx_queue, struct 
rte_power_monitor_cond *pmc)
return -ENOTSUP;
 }
 
+int
+mlx5_rx_queue_lwm_query(struct rte_eth_dev *dev,
+   uint16_t *queue_id, uint8_t *lwm)
+{
+   struct mlx5_priv *priv = dev->dat

[PATCH v8 6/6] app/testpmd: add Host Shaper command

2022-06-15 Thread Spike Du
Add command line options to support host shaper configure.
- Command syntax:
  mlx5 set port  host_shaper avail_thresh_triggered <0|1> rate


- Example commands:
To enable avail_thresh_triggered on port 1 and disable current host
shaper:
testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0

To disable avail_thresh_triggered and current host shaper on port 1:
testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0

The rate unit is 100Mbps.
To disable avail_thresh_triggered and configure a shaper of 5Gbps on
port 1:
testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50

Add sample code to handle rxq available descriptor threshold event, it
delays a while so that rxq empties, then disables host shaper and
rearms available descriptor threshold event.

Signed-off-by: Spike Du 
---
 app/test-pmd/testpmd.c  |   7 ++
 doc/guides/nics/mlx5.rst|  46 +
 drivers/net/mlx5/meson.build|   4 +
 drivers/net/mlx5/mlx5_testpmd.c | 205 
 drivers/net/mlx5/mlx5_testpmd.h |  26 +
 5 files changed, 288 insertions(+)
 create mode 100644 drivers/net/mlx5/mlx5_testpmd.c
 create mode 100644 drivers/net/mlx5/mlx5_testpmd.h

diff --git a/app/test-pmd/testpmd.c b/app/test-pmd/testpmd.c
index 33d9b85..b491719 100644
--- a/app/test-pmd/testpmd.c
+++ b/app/test-pmd/testpmd.c
@@ -69,6 +69,9 @@
 #ifdef RTE_NET_BOND
 #include 
 #endif
+#ifdef RTE_NET_MLX5
+#include "mlx5_testpmd.h"
+#endif
 
 #include "testpmd.h"
 
@@ -3659,6 +3662,10 @@ struct pmd_test_command {
break;
printf("Received avail_thresh event, port:%d 
rxq_id:%d\n",
   port_id, rxq_id);
+
+#ifdef RTE_NET_MLX5
+   mlx5_test_avail_thresh_event_handler(port_id, rxq_id);
+#endif
}
break;
default:
diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst
index 5f7b060..64eaddf 100644
--- a/doc/guides/nics/mlx5.rst
+++ b/doc/guides/nics/mlx5.rst
@@ -1727,3 +1727,49 @@ which can be installed from OFED mstflint package.
 Meson detects ``libmtcr_ul`` existence at configure stage.
 If the library is detected, the application must link with ``-lmtcr_ul``,
 as done by the pkg-config file libdpdk.pc.
+
+How to use available descriptor threshold and Host Shaper
+-
+
+There is a command to configure available descriptor threshold in testpmd.
+Testpmd also contains sample logic to handle available descriptor threshold 
event.
+The typical workflow is: testpmd configure available descriptor threshold for 
Rx queues, enable
+avail_thresh_triggered in host shaper and register a callback, when traffic 
from host is
+too high and Rx queue emptiness is below available descriptor threshold, PMD 
receives an event and
+firmware configures a 100Mbps shaper on host port automatically, then PMD call
+the callback registered previously, which will delay a while to let Rx queue
+empty, then disable host shaper.
+
+Let's assume we have a simple BlueField 2 setup: port 0 is uplink, port 1
+is VF representor. Each port has 2 Rx queues.
+In order to control traffic from host to ARM, we can enable available 
descriptor threshold in testpmd by:
+
+.. code-block:: console
+
+   testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0
+   testpmd> set port 1 rxq 0 avail_thresh 70
+   testpmd> set port 1 rxq 1 avail_thresh 70
+
+The first command disables current host shaper, and enables available 
descriptor threshold triggered mode.
+The other commands configure available descriptor threshold to 70% of Rx queue 
size for both Rx queues,
+When traffic from host is too high, you can see testpmd console prints log
+about available descriptor threshold event receiving, then host shaper is 
disabled.
+The traffic rate from host is controlled and less drop happens in Rx queues.
+
+The threshold event and shaper can be disabled like this:
+
+.. code-block:: console
+
+   testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0
+   testpmd> set port 1 rxq 0 avail_thresh 0
+   testpmd> set port 1 rxq 1 avail_thresh 0
+
+It's recommended an application disables available descriptor threshold and 
avail_thresh_triggered before exit,
+if it enables them before.
+
+We can also configure the shaper with a value, the rate unit is 100Mbps, below
+command sets current shaper to 5Gbps and disables avail_thresh_triggered.
+
+.. code-block:: console
+
+   testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50
diff --git a/drivers/net/mlx5/meson.build b/drivers/net/mlx5/meson.build
index 99210fd..941642b 100644
--- a/drivers/net/mlx5/meson.build
+++ b/drivers/net/mlx5/meson.build
@@ -68,4 +68,8 @@ if get_option('buildtype').contains('debug')
 else
 cflags += [ '-UPEDANTIC' ]
 endif
+
+testpmd_sources += files('mlx5_testpmd.c')
+testpmd_drivers_deps += 'net_mlx5'
+
 subdir(ex

[PATCH] crypto/scheduler: fix qp_id in scheduler failover

2022-06-15 Thread Jakub Wysocki
This commit fixes wrong qp_id value in cryptodev scheduler in failover
mode.

Fixes: 37f075dad1e9 ("crypto/scheduler: add fail-over scheduling mode")
Cc: roy.fan.zh...@intel.com

Signed-off-by: Jakub Wysocki 
---
 drivers/crypto/scheduler/scheduler_failover.c | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/drivers/crypto/scheduler/scheduler_failover.c 
b/drivers/crypto/scheduler/scheduler_failover.c
index 5023577ef8..247aeb2f22 100644
--- a/drivers/crypto/scheduler/scheduler_failover.c
+++ b/drivers/crypto/scheduler/scheduler_failover.c
@@ -157,6 +157,9 @@ scheduler_start(struct rte_cryptodev *dev)
((struct scheduler_qp_ctx *)
dev->data->queue_pairs[i])->private_qp_ctx;
 
+sched_ctx->workers[PRIMARY_WORKER_IDX].qp_id = i;
+sched_ctx->workers[SECONDARY_WORKER_IDX].qp_id = i;
+
rte_memcpy(&qp_ctx->primary_worker,
&sched_ctx->workers[PRIMARY_WORKER_IDX],
sizeof(struct scheduler_worker));
-- 
2.17.1

--
Intel Research and Development Ireland Limited
Registered in Ireland
Registered Office: Collinstown Industrial Park, Leixlip, County Kildare
Registered Number: 308263


This e-mail and any attachments may contain confidential material for the sole
use of the intended recipient(s). Any review or distribution by others is
strictly prohibited. If you are not the intended recipient, please contact the
sender and delete all copies.



RE: [PATCH 1/5] usertools: use non-strict when json-loads in telemetry

2022-06-15 Thread Morten Brørup
> From: Chengwen Feng [mailto:fengcheng...@huawei.com]
> Sent: Wednesday, 15 June 2022 09.39
> 
> Use 'strict=False' in json-loads, it will ignore control characters
> (e.g. '\n\t'), this patch is prepared for the support of telemetry dump
> in the future.
> 
> Signed-off-by: Chengwen Feng 
> ---
>  usertools/dpdk-telemetry.py | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/usertools/dpdk-telemetry.py b/usertools/dpdk-telemetry.py
> index a81868a547..63f8004566 100755
> --- a/usertools/dpdk-telemetry.py
> +++ b/usertools/dpdk-telemetry.py
> @@ -27,7 +27,7 @@ def read_socket(sock, buf_len, echo=True):
>  """ Read data from socket and return it in JSON format """
>  reply = sock.recv(buf_len).decode()
>  try:
> -ret = json.loads(reply)
> +ret = json.loads(reply, strict=False)
>  except json.JSONDecodeError:
>  print("Error in reply: ", reply)
>  sock.close()
> --
> 2.33.0
> 

As I understand this patch, it accepts non-JSON data from the telemetry socket.

Isn't it is a major protocol violation if the telemetry socket produces output 
requiring this modification? Doing that would break other applications 
expecting strictly JSON formatted output from the telemetry socket.



[dpdk-dev] [PATCH v2] common/cnxk: add ROC API to free MCAM entry

2022-06-15 Thread psatheesh
From: Satheesh Paul 

Add ROC API to free the given MCAM entry. If the MCAM
entry has flow counter associated, this API will clear
and free the flow counter.

Signed-off-by: Satheesh Paul 
Reviewed-by: Jerin Jacob 
---
v2:
* Removed unnecessary variable initialization

 drivers/common/cnxk/roc_npc.c   | 63 +++--
 drivers/common/cnxk/roc_npc.h   |  7 
 drivers/common/cnxk/roc_npc_mcam.c  | 39 ++
 drivers/common/cnxk/roc_npc_priv.h  |  2 +
 drivers/common/cnxk/roc_npc_utils.c | 15 +++
 drivers/common/cnxk/version.map |  3 ++
 6 files changed, 100 insertions(+), 29 deletions(-)

diff --git a/drivers/common/cnxk/roc_npc.c b/drivers/common/cnxk/roc_npc.c
index da5b96242e..b38389b18a 100644
--- a/drivers/common/cnxk/roc_npc.c
+++ b/drivers/common/cnxk/roc_npc.c
@@ -55,6 +55,53 @@ roc_npc_mcam_free_entry(struct roc_npc *roc_npc, uint32_t 
entry)
return npc_mcam_free_entry(npc, entry);
 }
 
+int
+roc_npc_mcam_free(struct roc_npc *roc_npc, struct roc_npc_flow *mcam)
+{
+   int rc = 0;
+
+   if (mcam->use_ctr) {
+   rc = roc_npc_mcam_clear_counter(roc_npc, mcam->ctr_id);
+   if (rc)
+   return rc;
+
+   rc = roc_npc_mcam_free_counter(roc_npc, mcam->ctr_id);
+   if (rc)
+   return rc;
+   }
+
+   return roc_npc_mcam_free_entry(roc_npc, mcam->mcam_id);
+}
+
+int
+roc_npc_mcam_init(struct roc_npc *roc_npc, struct roc_npc_flow *flow,
+ int mcam_id)
+{
+   struct npc *npc = roc_npc_to_npc_priv(roc_npc);
+   int rc;
+
+   rc = npc_mcam_init(npc, flow, mcam_id);
+   if (rc != 0) {
+   plt_err("npc: mcam initialisation write failed");
+   return rc;
+   }
+   return 0;
+}
+
+int
+roc_npc_mcam_move(struct roc_npc *roc_npc, uint16_t old_ent, uint16_t new_ent)
+{
+   struct npc *npc = roc_npc_to_npc_priv(roc_npc);
+   struct mbox *mbox = npc->mbox;
+   int rc;
+
+   rc = npc_mcam_move(mbox, old_ent, new_ent);
+   if (rc)
+   return rc;
+
+   return 0;
+}
+
 int
 roc_npc_mcam_free_all_resources(struct roc_npc *roc_npc)
 {
@@ -383,7 +430,7 @@ npc_parse_actions(struct roc_npc *roc_npc, const struct 
roc_npc_attr *attr,
 
case ROC_NPC_ACTION_TYPE_COUNT:
/* Indicates, need a counter */
-   flow->ctr_id = 1;
+   flow->use_ctr = 1;
req_act |= ROC_NPC_ACTION_TYPE_COUNT;
break;
 
@@ -1268,7 +1315,7 @@ roc_npc_flow_create(struct roc_npc *roc_npc, const struct 
roc_npc_attr *attr,
return flow;
 
 set_rss_failed:
-   rc = npc_mcam_free_entry(npc, flow->mcam_id);
+   rc = roc_npc_mcam_free_entry(roc_npc, flow->mcam_id);
if (rc != 0) {
*errcode = rc;
plt_free(flow);
@@ -1314,17 +1361,7 @@ roc_npc_flow_destroy(struct roc_npc *roc_npc, struct 
roc_npc_flow *flow)
return rc;
}
 
-   if (flow->ctr_id != NPC_COUNTER_NONE) {
-   rc = roc_npc_mcam_clear_counter(roc_npc, flow->ctr_id);
-   if (rc != 0)
-   return rc;
-
-   rc = npc_mcam_free_counter(npc, flow->ctr_id);
-   if (rc != 0)
-   return rc;
-   }
-
-   rc = npc_mcam_free_entry(npc, flow->mcam_id);
+   rc = roc_npc_mcam_free(roc_npc, flow);
if (rc != 0)
return rc;
 
diff --git a/drivers/common/cnxk/roc_npc.h b/drivers/common/cnxk/roc_npc.h
index f92c2a633c..1b4e5521cb 100644
--- a/drivers/common/cnxk/roc_npc.h
+++ b/drivers/common/cnxk/roc_npc.h
@@ -246,6 +246,7 @@ struct roc_npc_flow {
uint8_t nix_intf;
uint8_t enable;
uint32_t mcam_id;
+   uint8_t use_ctr;
int32_t ctr_id;
uint32_t priority;
uint32_t mtr_id;
@@ -329,6 +330,8 @@ roc_npc_flow_create(struct roc_npc *roc_npc, const struct 
roc_npc_attr *attr,
const struct roc_npc_action actions[], int *errcode);
 int __roc_api roc_npc_flow_destroy(struct roc_npc *roc_npc,
   struct roc_npc_flow *flow);
+int __roc_api roc_npc_mcam_free(struct roc_npc *roc_npc,
+   struct roc_npc_flow *mcam);
 int __roc_api roc_npc_mcam_free_entry(struct roc_npc *roc_npc, uint32_t entry);
 int __roc_api roc_npc_mcam_enable_all_entries(struct roc_npc *roc_npc,
  bool enable);
@@ -367,4 +370,8 @@ int __roc_api roc_npc_mcam_merge_base_steering_rule(struct 
roc_npc *roc_npc,
struct roc_npc_flow *flow);
 int __roc_api roc_npc_validate_portid_action(struct roc_npc *roc_npc_src,
 struct roc_npc *roc_npc_dst);
+int __roc_api roc_npc_mcam_init(struct roc_npc *roc_npc,
+  

Re: [PATCH v2 3/3] test: support trace-autotest when enable trace

2022-06-15 Thread David Marchand
On Tue, Jun 14, 2022 at 8:06 AM Chengwen Feng  wrote:
>
> There are a bug[1] when exit application while enable tracing, this
> bug has not been discovered for a long time, to quickly detect such
> bugs, this patch was introduced.
>
> This patch adds a testcase with trace enabling, it also depends on
> patch[2] because it has a long file-prefix.
>
> [1] eal: fix segment fault when exit trace
> [2] eal: fix trace init fail with long file-prefix

This commitlog feels more like a cover letter.

Please describe what the impact of the patch is, like mention that the
trace_autotest unit test is being called twice, once with traces
disabled, and once with traces enabled.
And that the traces file is being written in a directory part of the
build directory.


>
> Signed-off-by: Chengwen Feng 
> ---
>  app/test/meson.build | 11 +++
>  1 file changed, 11 insertions(+)
>
> diff --git a/app/test/meson.build b/app/test/meson.build
> index 7fe261cae8..eb37aa632a 100644
> --- a/app/test/meson.build
> +++ b/app/test/meson.build
> @@ -509,6 +509,17 @@ foreach arg : fast_tests
>  is_parallel : false,
>  suite : 'fast-tests')
>  endif
> +
> +if run_test and arg[0] == 'trace_autotest' and (not is_windows)

Calling this test on Windows should not be an issue.
Is it not the case?


> +test_args += ['--trace=.*']
> +test_args += ['--trace-dir=@0@'.format(meson.current_build_dir())]
> +test(arg[0], dpdk_test,

58/68 DPDK:fast-tests / trace_autotest   OK  0.16s
59/68 DPDK:fast-tests / trace_autotest   OK  0.16s

By using the same test name, it's hard to tell what the difference is
between those two lines.
And practically, as a developer reproducing/troubleshooting a test
failure, you can't call only one of the test case.

I am not sure what could be done to enhance this.., how about using a
dedicated test name?
Like below snippet, replacing the whole proposed patch:

@@ -508,6 +508,16 @@ foreach arg : fast_tests
 timeout : timeout_seconds_fast,
 is_parallel : false,
 suite : 'fast-tests')
+if arg[0] == 'trace_autotest'
+test_args += ['--trace=.*']
+test_args += ['--trace-dir=@0@'.format(meson.current_build_dir())]
+test(arg[0] + '_with_traces', dpdk_test,
+env : ['DPDK_TEST=' + arg[0]],
+args : test_args,
+timeout : timeout_seconds_fast,
+is_parallel : false,
+suite : 'fast-tests')
+endif
 endif
 endforeach


-- 
David Marchand



Re: [PATCH v2 2/3] eal: fix trace init fail with long file-prefix

2022-06-15 Thread David Marchand
On Tue, Jun 14, 2022 at 8:06 AM Chengwen Feng  wrote:
>
> Bug scenario:
> 1. start testpmd:
> dpdk-testpmd -l 4-6 -a :7d:00.0 --trace=.* \
> -file-prefix=trace_autotest -- -i
> 2. then observed:
> EAL: eal_trace_init():94 failed to initialize trace [File exists]

This is not directly related to the issue being fixed, but this error
log is really obscure..
Can we enhance this, maybe in a followup patch?


> EAL: FATAL: Cannot init trace
> EAL: Cannot init trace
> EAL: Error - exiting with code: 1
>
> The root cause it that the offset set wrong with long file-prefix and
> then lead the strftime return failed.
>
> Fixes: 321dd5f8fa62 ("trace: add internal init and fini interface")
> Cc: sta...@dpdk.org
>
> Signed-off-by: Chengwen Feng 

Reviewed-by: David Marchand 


And another topic, I see no reason for the limitation/truncation on
the file prefix.
Testing with the last patch of the series, I can see odd directory, like
EAL: Trace dir:
/home/dmarchan/builds/main/build-gcc-shared/app/test/trace_autot-2022-06-15-PM-03-18-39
I'll send a cleanup series for this, later.


-- 
David Marchand



RE: [EXT] [PATCH v7 0/6] drivers/baseband: PMD to support ACC100/ACC101 devices

2022-06-15 Thread Akhil Goyal
> v7: updates suggested by Akhil and Maxime. Removed RTE prefix for internal
> #define, removed
> build-time single FEC engine configuration mode, documentation fix, coding
> guideline fix, renamed new file without rte prefix.
> 
> v6: use formally an unified driver for AC100 and AC101. Exposing single 
> unified
> function for the PF device configuration used by bbdev-test.
> 
> v5: minor fix to doc and intermediate commit caught in CI
> 
> v4: incremental updates based on reviews Changed order of patchset to include
> two ACC100 specific commits first: updating formally the companion configure
> function from PF and the protection adjustment.
> Then next 3 commits for the actual changes specific to ACC101:  the device
> specific function for FCW 5GUL generation used as a local function, combined
> documentations into a single .rst common to both devices, updated comments,
> removed unused #defines.
> 
> v3: Feedback from Tom Rix: missing copyright, refactor bbdev-test section
> calling the configure companion function for ACC100/101, taking the Pmon
> commit out which is not directly required.
> 
> v2: Based on good feedback from Thomas and David, now implementing the
> ACC101 PMD as a close derivative from existing ACC100 PMD with hooks to
> have different behaviour based on variant.
> This prevents code duplication and only rely on different functions and
> behaviour when hardware requires or support differences.
> Note that these are pending changes for ACC100 which would be specific to that
> device and not ACC101 but these can be managed based on the new
> implementation, ie. is_acc100() etc... (such incremental changes for ACC100
> trending 22.11 but confirming this is future proof).
> The serie also includes commits which were meant for ACC101 but are also
> valuable for ACC100.
> 
> v1: This serie introduces the PMD for the new bbdev device ACC101 (aka Mount
> Cirrus).
> This is a derivative from previous Mount Bryce ACC100 which includes silicon
> improvement, bug fixes, capacity improvement for 5GNR and feature
> improvement.
> 
> 
> Nicolas Chautru (6):
>   baseband/acc100: update companion PF configure function
>   baseband/acc100: add protection for some negative scenario
>   baseband/acc100: remove RTE prefix for internal macro
>   baseband/acc100: introduce PMD for ACC101
>   baseband/acc100: modify validation code for ACC101
>   baseband/acc100: configuration of ACC101 from PF
> 
>  MAINTAINERS  |   1 +
>  app/test-bbdev/test_bbdev_perf.c |   6 +-
>  doc/guides/bbdevs/acc100.rst |  37 +-
>  doc/guides/bbdevs/features/acc101.ini|  13 +
>  doc/guides/rel_notes/release_22_07.rst   |   3 +
>  drivers/baseband/acc100/acc100_pf_enum.h |  18 +
>  drivers/baseband/acc100/acc101_pmd.h |  50 +++
>  drivers/baseband/acc100/rte_acc100_cfg.h |   4 +-
>  drivers/baseband/acc100/rte_acc100_pmd.c | 687
> ---
>  drivers/baseband/acc100/rte_acc100_pmd.h |  37 +-
>  drivers/baseband/acc100/version.map  |   3 +-
>  11 files changed, 768 insertions(+), 91 deletions(-)
>  create mode 100644 doc/guides/bbdevs/features/acc101.ini
>  create mode 100644 drivers/baseband/acc100/acc101_pmd.h

Series applied to dpdk-next-crypto

Thanks.


Re: [PATCH 0/3] bugfix and feature for skeleton DMA

2022-06-15 Thread Thomas Monjalon
08/06/2022 10:50, Chengwen Feng:
> This patch set contains three patches for skeleton DMA.
> 
> Chengwen Feng (2):
>   dma/skeleton: fix return last-idx when no memcopy completed
>   test/dma: reset last-idx before invoke DMA completed ops
> 
> Sivaprasad Tummala (1):
>   dma/skeleton: support multiple instances

Applied, thanks.





Re: [PATCH] dma/hisilicon: fix header file self contained

2022-06-15 Thread Thomas Monjalon
14/06/2022 11:33, Chengwen Feng:
> From: "Min Hu (Connor)" 
> 
> Header files should be self contained. This patch fixed it.
> 
> Fixes: 9e16317a38f9 ("dma/hisilicon: add probing")
> Cc: sta...@dpdk.org
> 
> Signed-off-by: Min Hu (Connor) 
> Signed-off-by: Chengwen Feng 

Applied, thanks.




Re: [dpdk-dev] [RFC PATCH v2] raw/ptdma: introduce ptdma driver

2022-06-15 Thread Thomas Monjalon
28/10/2021 16:54, Sebastian, Selwin:
> I am working on making ptdma driver as a dmadev.  Will submit new patch for 
> review.

Any update?





RE: [Bug 1035] __rte_raw_cksum() crash with misaligned pointer

2022-06-15 Thread Morten Brørup
> From: bugzi...@dpdk.org [mailto:bugzi...@dpdk.org]
> Sent: Wednesday, 15 June 2022 09.16
> 
> https://bugs.dpdk.org/show_bug.cgi?id=1035
> 
> Bug ID: 1035
>Summary: __rte_raw_cksum() crash with misaligned pointer
>Product: DPDK
>Version: 21.11
>   Hardware: All
> OS: All
> Status: UNCONFIRMED
>   Severity: normal
>   Priority: Normal
>  Component: ethdev
>   Assignee: dev@dpdk.org
>   Reporter: emil.b...@ericsson.com
>   Target Milestone: ---
> 
> See rte_raw_cksum() in rte_ip.h, which is part of the public API. See
> also the
> subfunction __rte_raw_cksum().
> 
> _rte_raw_cksum assumes that the buffer over which the checksum is
> calculated is
> an even address (divisible by two). See for example this stack overflow
> post:
> https://stackoverflow.com/questions/46790550/c-undefined-behavior-
> strict-aliasing-rule-or-incorrect-alignment
> 
> The post explains that there is undefined behavior in C11 when
> "conversion
> between two pointer types produces a result that is incorrectly
> aligned". When
> the buf argument starts on an odd address we thus have undefined
> behavior,
> since a pointer is cast from void* to uint16_t*.
> 
> In most cases (at least on x86) that isn't a problem, but with higher
> optimization levels it may break due to vector instructions. This new
> function
> seems to be easier to optimize by the compiler, resulting in a crash
> when the
> buf argument is odd. Please note that the undefined behavior is present
> in
> earlier versions of dpdk as well.
> 
> Now you're probably thinking: "Just align your buffers". The problem is
> that we
> have a packet buffer which is aligned. The checksum is calculated on a
> subset
> of that aligned packet buffer, and that sometimes lies on odd
> addresses.
> 
> The question remains if this is an issue with dpdk or not.

I can imagine other systems doing what you describe too. So it needs to be 
addressed.

Off the top of my head, an easy fix would be updating __rte_raw_cksum() like 
this:

static inline uint32_t
__rte_raw_cksum(const void *buf, size_t len, uint32_t sum)
{
if (likely((buf & 1) == 0)) {
/* The buffer is 16 bit aligned. */
Keep the existing, optimized implementation here.
} else {
/* The buffer is not 16 bit aligned. */
Add a new odd-buf tolerant implementation here.
}
}

However, I'm not sure that it covers your scenario!

The checksum is 16 bit wide, so if you calculate the checksum of e.g. 4 bytes 
of memory starting at offset 1 in a 6 byte packet buffer, the memory block can 
be treated as either 4 or 6 bytes relative to the data covered by the checksum, 
i.e.:

A: XX [01 02] [03 04] XX --> cksum = [04 06]

B: [XX 01] [02 03] [04 XX] --> cksum = [06 04]

Which one do you need?

Perhaps an additional function is required to support your use case, and the 
documentation for rte_raw_cksum() and __rte_raw_cksum() needs to reflect that 
the buffer must be 16 bit aligned.

Or the rte_raw_cksum() function can be modified to support an odd buffer 
pointer as outlined above, with documentation added about alignment of the 
running checksum.



[PATCH v9 1/6] net/mlx5: add LWM support for Rxq

2022-06-15 Thread Spike Du
Add lwm(Limit WaterMark) field to Rxq object which indicates the percentage
of RX queue size used by HW to raise LWM event to the user.
Allow LWM setting in modify_rq command.
Allow the LWM configuration dynamically by adding RDY2RDY state change.

Signed-off-by: Spike Du 
---
 drivers/net/mlx5/mlx5.h  |  1 +
 drivers/net/mlx5/mlx5_devx.c | 13 -
 drivers/net/mlx5/mlx5_devx.h |  1 +
 drivers/net/mlx5/mlx5_rx.h   |  1 +
 4 files changed, 15 insertions(+), 1 deletion(-)

diff --git a/drivers/net/mlx5/mlx5.h b/drivers/net/mlx5/mlx5.h
index ef755ee..305edff 100644
--- a/drivers/net/mlx5/mlx5.h
+++ b/drivers/net/mlx5/mlx5.h
@@ -1395,6 +1395,7 @@ enum mlx5_rxq_modify_type {
MLX5_RXQ_MOD_RST2RDY, /* modify state from reset to ready. */
MLX5_RXQ_MOD_RDY2ERR, /* modify state from ready to error. */
MLX5_RXQ_MOD_RDY2RST, /* modify state from ready to reset. */
+   MLX5_RXQ_MOD_RDY2RDY, /* modify state from ready to ready. */
 };
 
 enum mlx5_txq_modify_type {
diff --git a/drivers/net/mlx5/mlx5_devx.c b/drivers/net/mlx5/mlx5_devx.c
index 4b48f94..c918a50 100644
--- a/drivers/net/mlx5/mlx5_devx.c
+++ b/drivers/net/mlx5/mlx5_devx.c
@@ -62,7 +62,7 @@
  * @return
  *   0 on success, a negative errno value otherwise and rte_errno is set.
  */
-static int
+int
 mlx5_devx_modify_rq(struct mlx5_rxq_priv *rxq, uint8_t type)
 {
struct mlx5_devx_modify_rq_attr rq_attr;
@@ -76,6 +76,11 @@
case MLX5_RXQ_MOD_RST2RDY:
rq_attr.rq_state = MLX5_RQC_STATE_RST;
rq_attr.state = MLX5_RQC_STATE_RDY;
+   if (rxq->lwm) {
+   rq_attr.modify_bitmask |=
+   MLX5_MODIFY_RQ_IN_MODIFY_BITMASK_WQ_LWM;
+   rq_attr.lwm = rxq->lwm;
+   }
break;
case MLX5_RXQ_MOD_RDY2ERR:
rq_attr.rq_state = MLX5_RQC_STATE_RDY;
@@ -85,6 +90,12 @@
rq_attr.rq_state = MLX5_RQC_STATE_RDY;
rq_attr.state = MLX5_RQC_STATE_RST;
break;
+   case MLX5_RXQ_MOD_RDY2RDY:
+   rq_attr.rq_state = MLX5_RQC_STATE_RDY;
+   rq_attr.state = MLX5_RQC_STATE_RDY;
+   rq_attr.modify_bitmask |= 
MLX5_MODIFY_RQ_IN_MODIFY_BITMASK_WQ_LWM;
+   rq_attr.lwm = rxq->lwm;
+   break;
default:
break;
}
diff --git a/drivers/net/mlx5/mlx5_devx.h b/drivers/net/mlx5/mlx5_devx.h
index a95207a..ebd1da4 100644
--- a/drivers/net/mlx5/mlx5_devx.h
+++ b/drivers/net/mlx5/mlx5_devx.h
@@ -11,6 +11,7 @@
 int mlx5_txq_devx_modify(struct mlx5_txq_obj *obj,
 enum mlx5_txq_modify_type type, uint8_t dev_port);
 void mlx5_txq_devx_obj_release(struct mlx5_txq_obj *txq_obj);
+int mlx5_devx_modify_rq(struct mlx5_rxq_priv *rxq, uint8_t type);
 
 extern struct mlx5_obj_ops devx_obj_ops;
 
diff --git a/drivers/net/mlx5/mlx5_rx.h b/drivers/net/mlx5/mlx5_rx.h
index e715ed6..25a5f2c 100644
--- a/drivers/net/mlx5/mlx5_rx.h
+++ b/drivers/net/mlx5/mlx5_rx.h
@@ -175,6 +175,7 @@ struct mlx5_rxq_priv {
struct mlx5_devx_rq devx_rq;
struct rte_eth_hairpin_conf hairpin_conf; /* Hairpin configuration. */
uint32_t hairpin_status; /* Hairpin binding status. */
+   uint32_t lwm:16;
 };
 
 /* External RX queue descriptor. */
-- 
1.8.3.1



[PATCH v9 0/6] introduce per-queue available descriptor threshold and host shaper

2022-06-15 Thread Spike Du
available descriptor threshold(ADT for short) is per RX queue attribute, when 
RX queue available descriptors for HW is below ADT, HW sends an event to 
application.
Host shaper can configure shaper rate and avail_thresh-triggered for a host 
port.
The shaper limits the rate of traffic from host port to embedded ARM rx port on 
Nvidia BlueField 2 NIC.
If avail_thresh-triggered is enabled, a 100Mbps shaper is enabled automatically 
when one of the host port's Rx queues receives available descriptor threshold 
event.

These two features can combine to control traffic from host port to wire port 
for BlueField 2 NIC.
The traffic flows from host to embedded ARM, then to the physical port.
The work flow is on the ARM system, configure available descriptor threshold to 
RX queue and enable avail_thresh-triggered flag in host shaper, after receiving 
available descriptor threshold event, delay a while until RX queue is empty , 
then disable the shaper. We recycle this work flow to reduce RX queue drops on 
ARM system.

Add new libethdev API to set available descriptor threshold, add rte event 
RTE_ETH_EVENT_RX_AVAIL_THRESH to handle available descriptor threshold event. 
For host shaper, because it doesn't align to existing DPDK framework and is 
specific to Nvidia NIC, use PMD private API.

For integration with testpmd, put the private cmdline function and available 
descriptor threshold event handler in mlx5 PMD directory by adding a new file 
mlx5_testpmd.c. Follow David Marchand's driver specific commands framework to 
add mlx5 specific commands.

Spike Du (6):
  net/mlx5: add LWM support for Rxq
  common/mlx5: share interrupt management
  net/mlx5: add LWM event handling support
  net/mlx5: support Rx queue based available descriptor threshold
  net/mlx5: add private API to config host port shaper
  app/testpmd: add Host Shaper command

 app/test-pmd/testpmd.c   |   7 +
 doc/guides/nics/mlx5.rst |  93 +
 doc/guides/rel_notes/release_22_07.rst   |   2 +
 drivers/common/mlx5/linux/meson.build|  13 ++
 drivers/common/mlx5/linux/mlx5_common_os.c   | 131 
 drivers/common/mlx5/linux/mlx5_common_os.h   |  11 +
 drivers/common/mlx5/mlx5_prm.h   |  26 +++
 drivers/common/mlx5/version.map  |   2 +
 drivers/common/mlx5/windows/mlx5_common_os.h |  24 +++
 drivers/net/mlx5/linux/mlx5_ethdev_os.c  |  71 ---
 drivers/net/mlx5/linux/mlx5_os.c | 132 +++-
 drivers/net/mlx5/linux/mlx5_socket.c |  53 +
 drivers/net/mlx5/meson.build |   4 +
 drivers/net/mlx5/mlx5.c  |  68 +++
 drivers/net/mlx5/mlx5.h  |  12 +-
 drivers/net/mlx5/mlx5_devx.c |  60 +-
 drivers/net/mlx5/mlx5_devx.h |   1 +
 drivers/net/mlx5/mlx5_rx.c   | 288 +++
 drivers/net/mlx5/mlx5_rx.h   |  13 ++
 drivers/net/mlx5/mlx5_testpmd.c  | 205 +++
 drivers/net/mlx5/mlx5_testpmd.h  |  26 +++
 drivers/net/mlx5/mlx5_txpp.c |  28 +--
 drivers/net/mlx5/rte_pmd_mlx5.h  |  30 +++
 drivers/net/mlx5/version.map |   2 +
 drivers/net/mlx5/windows/mlx5_ethdev_os.c|  22 --
 drivers/vdpa/mlx5/mlx5_vdpa_virtq.c  |  48 +
 26 files changed, 1064 insertions(+), 308 deletions(-)
 create mode 100644 drivers/net/mlx5/mlx5_testpmd.c
 create mode 100644 drivers/net/mlx5/mlx5_testpmd.h

-- 
1.8.3.1



[PATCH v9 2/6] common/mlx5: share interrupt management

2022-06-15 Thread Spike Du
There are many duplicate code of creating and initializing rte_intr_handle.
Add a new mlx5_os API to do this, replace all PMD related code with this
API.

Signed-off-by: Spike Du 
---
 drivers/common/mlx5/linux/mlx5_common_os.c   | 131 ++
 drivers/common/mlx5/linux/mlx5_common_os.h   |  11 +++
 drivers/common/mlx5/version.map  |   2 +
 drivers/common/mlx5/windows/mlx5_common_os.h |  24 +
 drivers/net/mlx5/linux/mlx5_ethdev_os.c  |  71 --
 drivers/net/mlx5/linux/mlx5_os.c | 132 ++-
 drivers/net/mlx5/linux/mlx5_socket.c |  53 ++-
 drivers/net/mlx5/mlx5.h  |   2 -
 drivers/net/mlx5/mlx5_txpp.c |  28 ++
 drivers/net/mlx5/windows/mlx5_ethdev_os.c|  22 -
 drivers/vdpa/mlx5/mlx5_vdpa_virtq.c  |  48 ++
 11 files changed, 217 insertions(+), 307 deletions(-)

diff --git a/drivers/common/mlx5/linux/mlx5_common_os.c 
b/drivers/common/mlx5/linux/mlx5_common_os.c
index d40cfd5..f10a981 100644
--- a/drivers/common/mlx5/linux/mlx5_common_os.c
+++ b/drivers/common/mlx5/linux/mlx5_common_os.c
@@ -11,6 +11,7 @@
 #endif
 #include 
 #include 
+#include 
 
 #include 
 #include 
@@ -964,3 +965,133 @@
claim_zero(mlx5_glue->dereg_mr(pmd_mr->obj));
memset(pmd_mr, 0, sizeof(*pmd_mr));
 }
+
+/**
+ * Rte_intr_handle create and init helper.
+ *
+ * @param[in] mode
+ *   interrupt instance can be shared between primary and secondary
+ *   processes or not.
+ * @param[in] set_fd_nonblock
+ *   Whether to set fd to O_NONBLOCK.
+ * @param[in] fd
+ *   Fd to set in created intr_handle.
+ * @param[in] cb
+ *   Callback to register for intr_handle.
+ * @param[in] cb_arg
+ *   Callback argument for cb.
+ *
+ * @return
+ *  - Interrupt handle on success.
+ *  - NULL on failure, with rte_errno set.
+ */
+struct rte_intr_handle *
+mlx5_os_interrupt_handler_create(int mode, bool set_fd_nonblock, int fd,
+rte_intr_callback_fn cb, void *cb_arg)
+{
+   struct rte_intr_handle *tmp_intr_handle;
+   int ret, flags;
+
+   tmp_intr_handle = rte_intr_instance_alloc(mode);
+   if (!tmp_intr_handle) {
+   rte_errno = ENOMEM;
+   goto err;
+   }
+   if (set_fd_nonblock) {
+   flags = fcntl(fd, F_GETFL);
+   ret = fcntl(fd, F_SETFL, flags | O_NONBLOCK);
+   if (ret) {
+   rte_errno = errno;
+   goto err;
+   }
+   }
+   ret = rte_intr_fd_set(tmp_intr_handle, fd);
+   if (ret)
+   goto err;
+   ret = rte_intr_type_set(tmp_intr_handle, RTE_INTR_HANDLE_EXT);
+   if (ret)
+   goto err;
+   ret = rte_intr_callback_register(tmp_intr_handle, cb, cb_arg);
+   if (ret) {
+   rte_errno = -ret;
+   goto err;
+   }
+   return tmp_intr_handle;
+err:
+   if (tmp_intr_handle)
+   rte_intr_instance_free(tmp_intr_handle);
+   return NULL;
+}
+
+/* Safe unregistration for interrupt callback. */
+static void
+mlx5_intr_callback_unregister(const struct rte_intr_handle *handle,
+ rte_intr_callback_fn cb_fn, void *cb_arg)
+{
+   uint64_t twait = 0;
+   uint64_t start = 0;
+
+   do {
+   int ret;
+
+   ret = rte_intr_callback_unregister(handle, cb_fn, cb_arg);
+   if (ret >= 0)
+   return;
+   if (ret != -EAGAIN) {
+   DRV_LOG(INFO, "failed to unregister interrupt"
+ " handler (error: %d)", ret);
+   MLX5_ASSERT(false);
+   return;
+   }
+   if (twait) {
+   struct timespec onems;
+
+   /* Wait one millisecond and try again. */
+   onems.tv_sec = 0;
+   onems.tv_nsec = NS_PER_S / MS_PER_S;
+   nanosleep(&onems, 0);
+   /* Check whether one second elapsed. */
+   if ((rte_get_timer_cycles() - start) <= twait)
+   continue;
+   } else {
+   /*
+* We get the amount of timer ticks for one second.
+* If this amount elapsed it means we spent one
+* second in waiting. This branch is executed once
+* on first iteration.
+*/
+   twait = rte_get_timer_hz();
+   MLX5_ASSERT(twait);
+   }
+   /*
+* Timeout elapsed, show message (once a second) and retry.
+* We have no other acceptable option here, if we ignore
+* the unregistering return code the handler will not
+* be unr

[PATCH v9 3/6] net/mlx5: add LWM event handling support

2022-06-15 Thread Spike Du
When LWM meets RQ WQE, the kernel driver raises an event to SW.
Use devx event_channel to catch this and to notify the user.
Allocate this channel per shared device.
The channel has a cookie that informs the specific event port and queue.

Signed-off-by: Spike Du 
---
 drivers/net/mlx5/mlx5.c  | 66 
 drivers/net/mlx5/mlx5.h  |  7 +
 drivers/net/mlx5/mlx5_devx.c | 47 +++
 drivers/net/mlx5/mlx5_rx.c   | 33 ++
 drivers/net/mlx5/mlx5_rx.h   |  7 +
 5 files changed, 160 insertions(+)

diff --git a/drivers/net/mlx5/mlx5.c b/drivers/net/mlx5/mlx5.c
index f098871..e04a666 100644
--- a/drivers/net/mlx5/mlx5.c
+++ b/drivers/net/mlx5/mlx5.c
@@ -9,6 +9,7 @@
 #include 
 #include 
 #include 
+#include 
 
 #include 
 #include 
@@ -22,6 +23,7 @@
 #include 
 #include 
 #include 
+#include 
 
 #include 
 #include 
@@ -1525,6 +1527,69 @@ struct mlx5_dev_ctx_shared *
 }
 
 /**
+ * Create LWM event_channel and interrupt handle for shared device
+ * context. All rxqs sharing the device context share the event_channel.
+ * A callback is registered in interrupt thread to receive the LWM event.
+ *
+ * @param[in] priv
+ *   Pointer to mlx5_priv instance.
+ *
+ * @return
+ *   0 on success, negative with rte_errno set.
+ */
+int
+mlx5_lwm_setup(struct mlx5_priv *priv)
+{
+   int fd_lwm;
+
+   pthread_mutex_init(&priv->sh->lwm_config_lock, NULL);
+   priv->sh->devx_channel_lwm = mlx5_os_devx_create_event_channel
+   (priv->sh->cdev->ctx,
+MLX5DV_DEVX_CREATE_EVENT_CHANNEL_FLAGS_OMIT_EV_DATA);
+   if (!priv->sh->devx_channel_lwm)
+   goto err;
+   fd_lwm = mlx5_os_get_devx_channel_fd(priv->sh->devx_channel_lwm);
+   priv->sh->intr_handle_lwm = mlx5_os_interrupt_handler_create
+   (RTE_INTR_INSTANCE_F_SHARED, true,
+fd_lwm, mlx5_dev_interrupt_handler_lwm, priv);
+   if (!priv->sh->intr_handle_lwm)
+   goto err;
+   return 0;
+err:
+   if (priv->sh->devx_channel_lwm) {
+   mlx5_os_devx_destroy_event_channel
+   (priv->sh->devx_channel_lwm);
+   priv->sh->devx_channel_lwm = NULL;
+   }
+   pthread_mutex_destroy(&priv->sh->lwm_config_lock);
+   return -rte_errno;
+}
+
+/**
+ * Destroy LWM event_channel and interrupt handle for shared device
+ * context before free this context. The interrupt handler is also
+ * unregistered.
+ *
+ * @param[in] sh
+ *   Pointer to shared device context.
+ */
+void
+mlx5_lwm_unset(struct mlx5_dev_ctx_shared *sh)
+{
+   if (sh->intr_handle_lwm) {
+   mlx5_os_interrupt_handler_destroy(sh->intr_handle_lwm,
+   mlx5_dev_interrupt_handler_lwm, (void *)-1);
+   sh->intr_handle_lwm = NULL;
+   }
+   if (sh->devx_channel_lwm) {
+   mlx5_os_devx_destroy_event_channel
+   (sh->devx_channel_lwm);
+   sh->devx_channel_lwm = NULL;
+   }
+   pthread_mutex_destroy(&sh->lwm_config_lock);
+}
+
+/**
  * Free shared IB device context. Decrement counter and if zero free
  * all allocated resources and close handles.
  *
@@ -1601,6 +1666,7 @@ struct mlx5_dev_ctx_shared *
claim_zero(mlx5_devx_cmd_destroy(sh->td));
MLX5_ASSERT(sh->geneve_tlv_option_resource == NULL);
pthread_mutex_destroy(&sh->txpp.mutex);
+   mlx5_lwm_unset(sh);
mlx5_free(sh);
return;
 exit:
diff --git a/drivers/net/mlx5/mlx5.h b/drivers/net/mlx5/mlx5.h
index 7ebb2cc..a76f2fe 100644
--- a/drivers/net/mlx5/mlx5.h
+++ b/drivers/net/mlx5/mlx5.h
@@ -1268,6 +1268,9 @@ struct mlx5_dev_ctx_shared {
struct mlx5_lb_ctx self_lb; /* QP to enable self loopback for Devx. */
unsigned int flow_max_priority;
enum modify_reg flow_mreg_c[MLX5_MREG_C_NUM];
+   void *devx_channel_lwm;
+   struct rte_intr_handle *intr_handle_lwm;
+   pthread_mutex_t lwm_config_lock;
/* Availability of mreg_c's. */
struct mlx5_dev_shared_port port[]; /* per device port data array. */
 };
@@ -1405,6 +1408,7 @@ enum mlx5_txq_modify_type {
 };
 
 struct mlx5_rxq_priv;
+struct mlx5_priv;
 
 /* HW objects operations structure. */
 struct mlx5_obj_ops {
@@ -1413,6 +1417,7 @@ struct mlx5_obj_ops {
int (*rxq_event_get)(struct mlx5_rxq_obj *rxq_obj);
int (*rxq_obj_modify)(struct mlx5_rxq_priv *rxq, uint8_t type);
void (*rxq_obj_release)(struct mlx5_rxq_priv *rxq);
+   int (*rxq_event_get_lwm)(struct mlx5_priv *priv, int *rxq_idx, int 
*port_id);
int (*ind_table_new)(struct rte_eth_dev *dev, const unsigned int log_n,
 struct mlx5_ind_table_obj *ind_tbl);
int (*ind_table_modify)(struct rte_eth_dev *dev,
@@ -1603,6 +1608,8 @@ int mlx5_udp_tunnel_port_add(struct rte_eth_dev *dev,
 bool mlx5_is_hpf(struct rte_eth_dev *dev);
 bool mlx5_is_sf_repr

[PATCH v9 4/6] net/mlx5: support Rx queue based available descriptor threshold

2022-06-15 Thread Spike Du
Add mlx5 specific available descriptor threshold configuration
and query handler.
In mlx5 PMD, available descriptor threshold is also called
LWM(limit watermark).
While the Rx queue fullness reaches the LWM limit, the driver catches
an HW event and invokes the user callback.
The query handler finds the next RX queue with pending LWM event
if any, starting from the given RX queue index.

Signed-off-by: Spike Du 
---
 doc/guides/nics/mlx5.rst   |  12 +++
 doc/guides/rel_notes/release_22_07.rst |   1 +
 drivers/common/mlx5/mlx5_prm.h |   1 +
 drivers/net/mlx5/mlx5.c|   2 +
 drivers/net/mlx5/mlx5_rx.c | 151 +
 drivers/net/mlx5/mlx5_rx.h |   5 ++
 6 files changed, 172 insertions(+)

diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst
index d83c56d..cceaddf 100644
--- a/doc/guides/nics/mlx5.rst
+++ b/doc/guides/nics/mlx5.rst
@@ -93,6 +93,7 @@ Features
 - Connection tracking.
 - Sub-Function representors.
 - Sub-Function.
+- Rx queue available descriptor threshold configuration.
 
 
 Limitations
@@ -520,6 +521,9 @@ Limitations
 
 - The NIC egress flow rules on representor port are not supported.
 
+- Available descriptor threshold:
+
+  - Doesn't support shared Rx queue and Hairpin Rx queue.
 
 Statistics
 --
@@ -1680,3 +1684,11 @@ The procedure below is an example of using a ConnectX-5 
adapter card (pf0) with
 #. For each VF PCIe, using the following command to bind the driver::
 
$ echo ":82:00.2" >> /sys/bus/pci/drivers/mlx5_core/bind
+
+Available descriptor threshold introduction
+---
+
+Available descriptor threshold is a per Rx queue attribute, it should be 
configured as
+a percentage of the Rx queue size.
+When Rx queue available descriptors for hardware are below the threshold, an 
event is sent to PMD.
+
diff --git a/doc/guides/rel_notes/release_22_07.rst 
b/doc/guides/rel_notes/release_22_07.rst
index 6fc044e..7fb98cd 100644
--- a/doc/guides/rel_notes/release_22_07.rst
+++ b/doc/guides/rel_notes/release_22_07.rst
@@ -151,6 +151,7 @@ New Features
   * Added support for promiscuous mode on Windows.
   * Added support for MTU on Windows.
   * Added matching and RSS on IPsec ESP.
+  * Added Rx queue available descriptor threshold support.
 
 * **Updated VMware vmxnet3 networking driver.**
 
diff --git a/drivers/common/mlx5/mlx5_prm.h b/drivers/common/mlx5/mlx5_prm.h
index 654e5f4..7c4030a 100644
--- a/drivers/common/mlx5/mlx5_prm.h
+++ b/drivers/common/mlx5/mlx5_prm.h
@@ -3294,6 +3294,7 @@ struct mlx5_aso_wqe {
 
 enum {
MLX5_EVENT_TYPE_OBJECT_CHANGE = 0x27,
+   MLX5_EVENT_TYPE_SRQ_LIMIT_REACHED = 0x14,
 };
 
 enum {
diff --git a/drivers/net/mlx5/mlx5.c b/drivers/net/mlx5/mlx5.c
index e04a666..998846a 100644
--- a/drivers/net/mlx5/mlx5.c
+++ b/drivers/net/mlx5/mlx5.c
@@ -2071,6 +2071,8 @@ struct mlx5_dev_ctx_shared *
.dev_supported_ptypes_get = mlx5_dev_supported_ptypes_get,
.vlan_filter_set = mlx5_vlan_filter_set,
.rx_queue_setup = mlx5_rx_queue_setup,
+   .rx_queue_avail_thresh_set = mlx5_rx_queue_lwm_set,
+   .rx_queue_avail_thresh_query = mlx5_rx_queue_lwm_query,
.rx_hairpin_queue_setup = mlx5_rx_hairpin_queue_setup,
.tx_queue_setup = mlx5_tx_queue_setup,
.tx_hairpin_queue_setup = mlx5_tx_hairpin_queue_setup,
diff --git a/drivers/net/mlx5/mlx5_rx.c b/drivers/net/mlx5/mlx5_rx.c
index 197d708..2cb7006 100644
--- a/drivers/net/mlx5/mlx5_rx.c
+++ b/drivers/net/mlx5/mlx5_rx.c
@@ -25,6 +25,7 @@
 #include "mlx5.h"
 #include "mlx5_utils.h"
 #include "mlx5_rxtx.h"
+#include "mlx5_devx.h"
 #include "mlx5_rx.h"
 
 
@@ -128,6 +129,16 @@
return RTE_ETH_RX_DESC_AVAIL;
 }
 
+/* Get rxq lwm percentage according to lwm number. */
+static uint8_t
+mlx5_rxq_lwm_to_percentage(struct mlx5_rxq_priv *rxq)
+{
+   struct mlx5_rxq_data *rxq_data = &rxq->ctrl->rxq;
+   uint32_t wqe_cnt = 1 << (rxq_data->elts_n - rxq_data->sges_n);
+
+   return rxq->lwm * 100 / wqe_cnt;
+}
+
 /**
  * DPDK callback to get the RX queue information.
  *
@@ -150,6 +161,7 @@
 {
struct mlx5_rxq_ctrl *rxq_ctrl = mlx5_rxq_ctrl_get(dev, rx_queue_id);
struct mlx5_rxq_data *rxq = mlx5_rxq_data_get(dev, rx_queue_id);
+   struct mlx5_rxq_priv *rxq_priv = mlx5_rxq_get(dev, rx_queue_id);
 
if (!rxq)
return;
@@ -169,6 +181,8 @@
qinfo->nb_desc = mlx5_rxq_mprq_enabled(rxq) ?
RTE_BIT32(rxq->elts_n) * RTE_BIT32(rxq->log_strd_num) :
RTE_BIT32(rxq->elts_n);
+   qinfo->avail_thresh = rxq_priv ?
+   mlx5_rxq_lwm_to_percentage(rxq_priv) : 0;
 }
 
 /**
@@ -1188,6 +1202,34 @@ int mlx5_get_monitor_addr(void *rx_queue, struct 
rte_power_monitor_cond *pmc)
return -ENOTSUP;
 }
 
+int
+mlx5_rx_queue_lwm_query(struct rte_eth_dev *dev,
+   uint16_t *queue_id, uint8_t *lwm)
+{
+   struct mlx5_priv *priv = 

[PATCH v9 5/6] net/mlx5: add private API to config host port shaper

2022-06-15 Thread Spike Du
Host port shaper can be configured with QSHR(QoS Shaper Host Register).
Add check in build files to enable this function or not.

The host shaper configuration affects all the ethdev ports belonging to the
same host port.

Host shaper can configure shaper rate and lwm-triggered for a host port.
The shaper limits the rate of traffic from host port to wire port.
If lwm-triggered is enabled, a 100Mbps shaper is enabled automatically
when one of the host port's Rx queues receives available descriptor
threshold event.

Signed-off-by: Spike Du 
---
 doc/guides/nics/mlx5.rst   |  35 +++
 doc/guides/rel_notes/release_22_07.rst |   1 +
 drivers/common/mlx5/linux/meson.build  |  13 +
 drivers/common/mlx5/mlx5_prm.h |  25 
 drivers/net/mlx5/mlx5.h|   2 +
 drivers/net/mlx5/mlx5_rx.c | 104 +
 drivers/net/mlx5/rte_pmd_mlx5.h|  30 ++
 drivers/net/mlx5/version.map   |   2 +
 8 files changed, 212 insertions(+)

diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst
index cceaddf..5f7b060 100644
--- a/doc/guides/nics/mlx5.rst
+++ b/doc/guides/nics/mlx5.rst
@@ -94,6 +94,7 @@ Features
 - Sub-Function representors.
 - Sub-Function.
 - Rx queue available descriptor threshold configuration.
+- Host shaper support.
 
 
 Limitations
@@ -525,6 +526,12 @@ Limitations
 
   - Doesn't support shared Rx queue and Hairpin Rx queue.
 
+- Host shaper:
+
+  - Support BlueField series NIC from BlueField 2.
+  - When configure host shaper with 
MLX5_HOST_SHAPER_FLAG_AVAIL_THRESH_TRIGGERED flag set,
+only rate 0 and 100Mbps are supported.
+
 Statistics
 --
 
@@ -1692,3 +1699,31 @@ Available descriptor threshold is a per Rx queue 
attribute, it should be configu
 a percentage of the Rx queue size.
 When Rx queue available descriptors for hardware are below the threshold, an 
event is sent to PMD.
 
+Host shaper introduction
+
+
+Host shaper register is per host port register which sets a shaper
+on the host port.
+All VF/hostPF representors belonging to one host port share one host shaper.
+For example, if representor 0 and representor 1 belong to same host port,
+and a host shaper rate of 1Gbps is configured, the shaper throttles both
+representors' traffic from host.
+Host shaper has two modes for setting the shaper, immediate and deferred to
+available descriptor threshold event trigger. In immediate mode, the rate 
limit is configured
+immediately to host shaper. When deferring to available descriptor threshold 
trigger, the shaper
+is not set until an available descriptor threshold event is received by any Rx 
queue in a VF
+representor belonging to the host port. The only rate supported for deferred
+mode is 100Mbps (there is no limit on the supported rates for immediate mode).
+In deferred mode, the shaper is set on the host port by the firmware upon
+receiving the available descriptor threshold event, which allows throttling 
host traffic on
+available descriptor threshold events at minimum latency, preventing excess 
drops in the
+Rx queue.
+
+Host shaper dependency for mstflint package
+---
+
+In order to configure host shaper register, ``librte_net_mlx5`` depends on 
``libmtcr_ul``
+which can be installed from OFED mstflint package.
+Meson detects ``libmtcr_ul`` existence at configure stage.
+If the library is detected, the application must link with ``-lmtcr_ul``,
+as done by the pkg-config file libdpdk.pc.
diff --git a/doc/guides/rel_notes/release_22_07.rst 
b/doc/guides/rel_notes/release_22_07.rst
index 7fb98cd..199a775 100644
--- a/doc/guides/rel_notes/release_22_07.rst
+++ b/doc/guides/rel_notes/release_22_07.rst
@@ -152,6 +152,7 @@ New Features
   * Added support for MTU on Windows.
   * Added matching and RSS on IPsec ESP.
   * Added Rx queue available descriptor threshold support.
+  * Added host shaper support.
 
 * **Updated VMware vmxnet3 networking driver.**
 
diff --git a/drivers/common/mlx5/linux/meson.build 
b/drivers/common/mlx5/linux/meson.build
index 5335f5b..51c6e5d 100644
--- a/drivers/common/mlx5/linux/meson.build
+++ b/drivers/common/mlx5/linux/meson.build
@@ -45,6 +45,13 @@ if static_ibverbs
 ext_deps += declare_dependency(link_args:ibv_ldflags.split())
 endif
 
+libmtcr_ul_found = false
+lib = cc.find_library('mtcr_ul', required:false)
+if lib.found() and run_command('meson', 
'--version').stdout().version_compare('>= 0.49.2')
+libmtcr_ul_found = true
+ext_deps += lib
+endif
+
 sources += files('mlx5_nl.c')
 sources += files('mlx5_common_auxiliary.c')
 sources += files('mlx5_common_os.c')
@@ -207,6 +214,12 @@ has_sym_args = [
 [ 'HAVE_MLX5_IBV_IMPORT_CTX_PD_AND_MR', 'infiniband/verbs.h',
 'ibv_import_device' ],
 ]
+if  libmtcr_ul_found
+has_sym_args += [
+[  'HAVE_MLX5_MSTFLINT', 'mstflint/mtcr.h',
+'mopen'],
+]
+endif
 config = configuration_da

[PATCH v9 6/6] app/testpmd: add Host Shaper command

2022-06-15 Thread Spike Du
Add command line options to support host shaper configure.
- Command syntax:
  mlx5 set port  host_shaper avail_thresh_triggered <0|1> rate


- Example commands:
To enable avail_thresh_triggered on port 1 and disable current host
shaper:
testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0

To disable avail_thresh_triggered and current host shaper on port 1:
testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0

The rate unit is 100Mbps.
To disable avail_thresh_triggered and configure a shaper of 5Gbps on
port 1:
testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50

Add sample code to handle rxq available descriptor threshold event, it
delays a while so that rxq empties, then disables host shaper and
rearms available descriptor threshold event.

Signed-off-by: Spike Du 
---
 app/test-pmd/testpmd.c  |   7 ++
 doc/guides/nics/mlx5.rst|  46 +
 drivers/net/mlx5/meson.build|   4 +
 drivers/net/mlx5/mlx5_testpmd.c | 205 
 drivers/net/mlx5/mlx5_testpmd.h |  26 +
 5 files changed, 288 insertions(+)
 create mode 100644 drivers/net/mlx5/mlx5_testpmd.c
 create mode 100644 drivers/net/mlx5/mlx5_testpmd.h

diff --git a/app/test-pmd/testpmd.c b/app/test-pmd/testpmd.c
index 205d98e..e6321bd 100644
--- a/app/test-pmd/testpmd.c
+++ b/app/test-pmd/testpmd.c
@@ -69,6 +69,9 @@
 #ifdef RTE_NET_BOND
 #include 
 #endif
+#ifdef RTE_NET_MLX5
+#include "mlx5_testpmd.h"
+#endif
 
 #include "testpmd.h"
 
@@ -3726,6 +3729,10 @@ struct pmd_test_command {
break;
printf("Received avail_thresh event, port: %u, rxq_id: 
%u\n",
   port_id, rxq_id);
+
+#ifdef RTE_NET_MLX5
+   mlx5_test_avail_thresh_event_handler(port_id, rxq_id);
+#endif
}
break;
}
diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst
index 5f7b060..64eaddf 100644
--- a/doc/guides/nics/mlx5.rst
+++ b/doc/guides/nics/mlx5.rst
@@ -1727,3 +1727,49 @@ which can be installed from OFED mstflint package.
 Meson detects ``libmtcr_ul`` existence at configure stage.
 If the library is detected, the application must link with ``-lmtcr_ul``,
 as done by the pkg-config file libdpdk.pc.
+
+How to use available descriptor threshold and Host Shaper
+-
+
+There is a command to configure available descriptor threshold in testpmd.
+Testpmd also contains sample logic to handle available descriptor threshold 
event.
+The typical workflow is: testpmd configure available descriptor threshold for 
Rx queues, enable
+avail_thresh_triggered in host shaper and register a callback, when traffic 
from host is
+too high and Rx queue emptiness is below available descriptor threshold, PMD 
receives an event and
+firmware configures a 100Mbps shaper on host port automatically, then PMD call
+the callback registered previously, which will delay a while to let Rx queue
+empty, then disable host shaper.
+
+Let's assume we have a simple BlueField 2 setup: port 0 is uplink, port 1
+is VF representor. Each port has 2 Rx queues.
+In order to control traffic from host to ARM, we can enable available 
descriptor threshold in testpmd by:
+
+.. code-block:: console
+
+   testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0
+   testpmd> set port 1 rxq 0 avail_thresh 70
+   testpmd> set port 1 rxq 1 avail_thresh 70
+
+The first command disables current host shaper, and enables available 
descriptor threshold triggered mode.
+The other commands configure available descriptor threshold to 70% of Rx queue 
size for both Rx queues,
+When traffic from host is too high, you can see testpmd console prints log
+about available descriptor threshold event receiving, then host shaper is 
disabled.
+The traffic rate from host is controlled and less drop happens in Rx queues.
+
+The threshold event and shaper can be disabled like this:
+
+.. code-block:: console
+
+   testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0
+   testpmd> set port 1 rxq 0 avail_thresh 0
+   testpmd> set port 1 rxq 1 avail_thresh 0
+
+It's recommended an application disables available descriptor threshold and 
avail_thresh_triggered before exit,
+if it enables them before.
+
+We can also configure the shaper with a value, the rate unit is 100Mbps, below
+command sets current shaper to 5Gbps and disables avail_thresh_triggered.
+
+.. code-block:: console
+
+   testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50
diff --git a/drivers/net/mlx5/meson.build b/drivers/net/mlx5/meson.build
index 99210fd..941642b 100644
--- a/drivers/net/mlx5/meson.build
+++ b/drivers/net/mlx5/meson.build
@@ -68,4 +68,8 @@ if get_option('buildtype').contains('debug')
 else
 cflags += [ '-UPEDANTIC' ]
 endif
+
+testpmd_sources += files('mlx5_testpmd.c')
+testpmd_drivers_deps += 'net_mlx5'
+
 subdir(exec_e

Re: [PATCH 00/12] Fix compilation with gcc 12

2022-06-15 Thread Stephen Hemminger
On Wed, 15 Jun 2022 10:49:17 +0200
David Marchand  wrote:

> On Wed, May 18, 2022 at 12:17 PM David Marchand
>  wrote:
> >
> > Fedora 36 is out since early may and comes with gcc 12.
> > This series fixes compilation or waives some checks.
> >
> > There might be something fishy with rte_memcpy on x86 but, for now,
> > the rte_memcpy related fixes are on the caller side.
> >
> > Some "base" drivers have issues, I chose the simple solution of waiving
> > the checks for them.
> >
> > Compilation is the only thing checked.
> > Please driver maintainers, check nothing got broken.  
> 
> I applied the patches that got acked and that had no objection or
> comment from maintainers (i.e. patch 3, 4, 5, 6, 9, 11).
> I also cleaned the mess in bugzilla where we had multiple reports of
> the same issues, or stale bugs that I can't reproduce with released
> gcc 12.
> 
> I'll respin separately the patches for which I have clear comments,
> and drop my patches waiving the compiler checks.
> 
> We still need to agree on the best approach to handle the new checks.
> We have two rfc series from Stephen, how do we move forward?

Lets fix all the bugs and remove any workarounds using pragma's.

Some of them may mean removing rte_memcpy where it is not needed.


[dpdk-dev] [PATCH 0/8] bnxt PMD fixes

2022-06-15 Thread Kalesh A P
From: Kalesh AP 

This patchset contains bug fixes in bnxt PMD.

Ajit Khaparde (1):
  net/bnxt: fix switch domain allocation

Damodharam Ammepalli (3):
  net/bnxt: allow Tx only or Rx only configs in PMD
  net/bnxt: disallow MTU change when device is started
  net/bnxt: cleanups in MTU set callback

Kalesh AP (2):
  net/bnxt: reduce the verbosity of a log
  net/bnxt: fix setting forced speed

Somnath Kotur (2):
  net/bnxt: remove assert for zero data len in Tx path
  net/bnxt: fix the check for autoneg enablement in the PHY FW

 drivers/net/bnxt/bnxt_ethdev.c | 95 ++
 drivers/net/bnxt/bnxt_hwrm.c   | 45 +---
 drivers/net/bnxt/bnxt_hwrm.h   |  3 ++
 drivers/net/bnxt/bnxt_rxq.c|  9 +---
 drivers/net/bnxt/bnxt_txr.c| 29 +++--
 5 files changed, 101 insertions(+), 80 deletions(-)

-- 
2.10.1



[dpdk-dev] [PATCH 1/8] net/bnxt: remove assert for zero data len in Tx path

2022-06-15 Thread Kalesh A P
From: Somnath Kotur 

Currently the PMD tries to detect a potential 0 byte DMA by
using RTE_VERIFY.
But since RTE_VERIFY internally calls rte_panic() it is fatal to
the application and some applications want to avoid that.
So return an error from the bnxt xmit handler if such a bad pkt is
encountered by logging an error message, dumping the pkt header and
dump the current stack as well

Signed-off-by: Somnath Kotur 
Reviewed-by: Ajit Khaparde 
---
 drivers/net/bnxt/bnxt_txr.c | 29 ++---
 1 file changed, 26 insertions(+), 3 deletions(-)

diff --git a/drivers/net/bnxt/bnxt_txr.c b/drivers/net/bnxt/bnxt_txr.c
index 7a7196a..67e0167 100644
--- a/drivers/net/bnxt/bnxt_txr.c
+++ b/drivers/net/bnxt/bnxt_txr.c
@@ -123,6 +123,26 @@ bnxt_xmit_need_long_bd(struct rte_mbuf *tx_pkt, struct 
bnxt_tx_queue *txq)
return false;
 }
 
+static bool
+bnxt_zero_data_len_tso_segsz(struct rte_mbuf *tx_pkt, uint8_t data_len_chk)
+{
+   const char *type_str = "Data len";
+   uint16_t len_to_check = tx_pkt->data_len;
+
+   if (data_len_chk == 0) {
+   type_str = "TSO Seg size";
+   len_to_check = tx_pkt->tso_segsz;
+   }
+
+   if (len_to_check == 0) {
+   PMD_DRV_LOG(ERR, "Error! Tx pkt %s == 0\n", type_str);
+   rte_pktmbuf_dump(stdout, tx_pkt, 64);
+   rte_dump_stack();
+   return true;
+   }
+   return false;
+}
+
 static uint16_t bnxt_start_xmit(struct rte_mbuf *tx_pkt,
struct bnxt_tx_queue *txq,
uint16_t *coal_pkts,
@@ -179,7 +199,8 @@ static uint16_t bnxt_start_xmit(struct rte_mbuf *tx_pkt,
}
 
/* Check non zero data_len */
-   RTE_VERIFY(tx_pkt->data_len);
+   if (unlikely(bnxt_zero_data_len_tso_segsz(tx_pkt, 1)))
+   return -EIO;
 
prod = RING_IDX(ring, txr->tx_raw_prod);
tx_buf = &txr->tx_buf_ring[prod];
@@ -256,7 +277,8 @@ static uint16_t bnxt_start_xmit(struct rte_mbuf *tx_pkt,
 */
txbd1->kid_or_ts_low_hdr_size = hdr_size >> 1;
txbd1->kid_or_ts_high_mss = tx_pkt->tso_segsz;
-   RTE_VERIFY(txbd1->kid_or_ts_high_mss);
+   if (unlikely(bnxt_zero_data_len_tso_segsz(tx_pkt, 0)))
+   return -EIO;
 
} else if ((tx_pkt->ol_flags & PKT_TX_OIP_IIP_TCP_UDP_CKSUM) ==
   PKT_TX_OIP_IIP_TCP_UDP_CKSUM) {
@@ -330,7 +352,8 @@ static uint16_t bnxt_start_xmit(struct rte_mbuf *tx_pkt,
m_seg = tx_pkt->next;
while (m_seg) {
/* Check non zero data_len */
-   RTE_VERIFY(m_seg->data_len);
+   if (unlikely(bnxt_zero_data_len_tso_segsz(m_seg, 1)))
+   return -EIO;
txr->tx_raw_prod = RING_NEXT(txr->tx_raw_prod);
 
prod = RING_IDX(ring, txr->tx_raw_prod);
-- 
2.10.1



[dpdk-dev] [PATCH 2/8] net/bnxt: fix switch domain allocation

2022-06-15 Thread Kalesh A P
From: Ajit Khaparde 

Allocate switch domain after the trusted VF capability is queried
from the FW. Currently we are calling the function earlier.
Since the switch domain is allocated only for PFs or trusted VF,
the current location of code fails to allocate the domain during init.
But during cleanup we try to free the domain incorrectly.
Fix the behavior by changing the sequence of function calls.

Fixes: 3127f99274b67 ("net/bnxt: refactor init/uninit")
Cc: sta...@dpdk.org

Signed-off-by: Ajit Khaparde 
Reviewed-by: Somnath Kotur 
Reviewed-by: Kalesh AP 
Reviewed-by: Andy Gospodarek 
---
 drivers/net/bnxt/bnxt_ethdev.c | 45 +-
 1 file changed, 23 insertions(+), 22 deletions(-)

diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c
index 0f0f40b..34f2149 100644
--- a/drivers/net/bnxt/bnxt_ethdev.c
+++ b/drivers/net/bnxt/bnxt_ethdev.c
@@ -5287,6 +5287,25 @@ bnxt_init_locks(struct bnxt *bp)
return err;
 }
 
+/* This should be called after we have queried trusted VF cap */
+static int bnxt_alloc_switch_domain(struct bnxt *bp)
+{
+   int rc = 0;
+
+   if (BNXT_PF(bp) || BNXT_VF_IS_TRUSTED(bp)) {
+   rc = rte_eth_switch_domain_alloc(&bp->switch_domain_id);
+   if (rc)
+   PMD_DRV_LOG(ERR,
+   "Failed to alloc switch domain: %d\n", rc);
+   else
+   PMD_DRV_LOG(INFO,
+   "Switch domain allocated %d\n",
+   bp->switch_domain_id);
+   }
+
+   return rc;
+}
+
 static int bnxt_init_resources(struct bnxt *bp, bool reconfig_dev)
 {
int rc = 0;
@@ -5295,6 +5314,10 @@ static int bnxt_init_resources(struct bnxt *bp, bool 
reconfig_dev)
if (rc)
return rc;
 
+   rc = bnxt_alloc_switch_domain(bp);
+   if (rc)
+   return rc;
+
if (!reconfig_dev) {
rc = bnxt_setup_mac_addr(bp->eth_dev);
if (rc)
@@ -5734,24 +5757,6 @@ bnxt_parse_dev_args(struct bnxt *bp, struct rte_devargs 
*devargs)
return ret;
 }
 
-static int bnxt_alloc_switch_domain(struct bnxt *bp)
-{
-   int rc = 0;
-
-   if (BNXT_PF(bp) || BNXT_VF_IS_TRUSTED(bp)) {
-   rc = rte_eth_switch_domain_alloc(&bp->switch_domain_id);
-   if (rc)
-   PMD_DRV_LOG(ERR,
-   "Failed to alloc switch domain: %d\n", rc);
-   else
-   PMD_DRV_LOG(INFO,
-   "Switch domain allocated %d\n",
-   bp->switch_domain_id);
-   }
-
-   return rc;
-}
-
 /* Allocate and initialize various fields in bnxt struct that
  * need to be allocated/destroyed only once in the lifetime of the driver
  */
@@ -5828,10 +5833,6 @@ static int bnxt_drv_init(struct rte_eth_dev *eth_dev)
if (rc)
return rc;
 
-   rc = bnxt_alloc_switch_domain(bp);
-   if (rc)
-   return rc;
-
return rc;
 }
 
-- 
2.10.1



[dpdk-dev] [PATCH 3/8] net/bnxt: reduce the verbosity of a log

2022-06-15 Thread Kalesh A P
From: Kalesh AP 

Broadcom HW drops packets when there are no descriptors available.
It does not matter what flag the application specifies in "rx_drop_en"
when configuring the Rx ring.

Reduce the verbosity of the log to print the status of the "rx_drop_en"
when configuring the Rx ring.

Signed-off-by: Kalesh AP 
Reviewed-by: Somnath Kotur 
Reviewed-by: Ajit Khaparde 
---
 drivers/net/bnxt/bnxt_rxq.c | 5 ++---
 1 file changed, 2 insertions(+), 3 deletions(-)

diff --git a/drivers/net/bnxt/bnxt_rxq.c b/drivers/net/bnxt/bnxt_rxq.c
index ef2ef83..f4b6417 100644
--- a/drivers/net/bnxt/bnxt_rxq.c
+++ b/drivers/net/bnxt/bnxt_rxq.c
@@ -360,9 +360,8 @@ int bnxt_rx_queue_setup_op(struct rte_eth_dev *eth_dev,
rxq->rx_free_thresh =
RTE_MIN(rte_align32pow2(nb_desc) / 4, RTE_BNXT_MAX_RX_BURST);
 
-   if (rx_conf->rx_drop_en != BNXT_DEFAULT_RX_DROP_EN)
-   PMD_DRV_LOG(NOTICE,
-   "Per-queue config of drop-en is not supported.\n");
+   PMD_DRV_LOG(DEBUG,
+   "App supplied RXQ drop_en status : %d\n", 
rx_conf->rx_drop_en);
rxq->drop_en = BNXT_DEFAULT_RX_DROP_EN;
 
PMD_DRV_LOG(DEBUG, "RX Buf MTU %d\n", eth_dev->data->mtu);
-- 
2.10.1



[dpdk-dev] [PATCH 4/8] net/bnxt: allow Tx only or Rx only configs in PMD

2022-06-15 Thread Kalesh A P
From: Damodharam Ammepalli 

Currently, we fail the init/probe of pmd if eth_dev->data->nb_tx_queues
or eth_dev->data->nb_rx_queues is 0. We are removing this check.

Fixes: daef48efe5e5 ("net/bnxt: support set MTU")
Cc: sta...@dpdk.org

Signed-off-by: Damodharam Ammepalli 
Reviewed-by: Ajit Khaparde 
Reviewed-by: Somnath Kotur 
---
 drivers/net/bnxt/bnxt_ethdev.c | 7 +--
 1 file changed, 1 insertion(+), 6 deletions(-)

diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c
index 34f2149..8181e1f 100644
--- a/drivers/net/bnxt/bnxt_ethdev.c
+++ b/drivers/net/bnxt/bnxt_ethdev.c
@@ -723,7 +723,7 @@ static int bnxt_alloc_prev_ring_stats(struct bnxt *bp)
 sizeof(struct bnxt_ring_stats) *
 bp->tx_cp_nr_rings,
 0);
-   if (bp->prev_tx_ring_stats == NULL)
+   if (bp->tx_cp_nr_rings > 0 && bp->prev_tx_ring_stats == NULL)
goto error;
 
return 0;
@@ -1567,11 +1567,6 @@ int bnxt_dev_start_op(struct rte_eth_dev *eth_dev)
int vlan_mask = 0;
int rc, retry_cnt = BNXT_IF_CHANGE_RETRY_COUNT;
 
-   if (!eth_dev->data->nb_tx_queues || !eth_dev->data->nb_rx_queues) {
-   PMD_DRV_LOG(ERR, "Queues are not configured yet!\n");
-   return -EINVAL;
-   }
-
if (bp->rx_cp_nr_rings > RTE_ETHDEV_QUEUE_STAT_CNTRS)
PMD_DRV_LOG(ERR,
"RxQ cnt %d > RTE_ETHDEV_QUEUE_STAT_CNTRS %d\n",
-- 
2.10.1



[dpdk-dev] [PATCH 5/8] net/bnxt: fix setting forced speed

2022-06-15 Thread Kalesh A P
From: Kalesh AP 

The "active_fec_signal_mode" in HWRM_PORT_PHY_QCFG response
does not return correct value till the link is up. Driver cannot
rely on active_fec_signal_mode while setting forced speed.

While setting forced speed of 50G/100G/200G, check if PAM4 speeds
are supported for the port first and then populate the HWRM request
accordingly.

Also, If PAM4 speed is supported, use PAM4 supported speed while
reporting speed capabilities.

Fixes: c23f9ded0391 ("net/bnxt: support 200G PAM4 link")
Cc: sta...@dpdk.org

Signed-off-by: Kalesh AP 
Reviewed-by: Ajit Khaparde 
Reviewed-by: Somnath Kotur 
---
 drivers/net/bnxt/bnxt_ethdev.c | 11 ++-
 drivers/net/bnxt/bnxt_hwrm.c   | 27 +++
 drivers/net/bnxt/bnxt_hwrm.h   |  3 +++
 3 files changed, 28 insertions(+), 13 deletions(-)

diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c
index 8181e1f..4e4791c 100644
--- a/drivers/net/bnxt/bnxt_ethdev.c
+++ b/drivers/net/bnxt/bnxt_ethdev.c
@@ -903,6 +903,7 @@ static int bnxt_shutdown_nic(struct bnxt *bp)
 
 uint32_t bnxt_get_speed_capabilities(struct bnxt *bp)
 {
+   uint32_t pam4_link_speed = 0;
uint32_t link_speed = 0;
uint32_t speed_capa = 0;
 
@@ -912,8 +913,8 @@ uint32_t bnxt_get_speed_capabilities(struct bnxt *bp)
link_speed = bp->link_info->support_speeds;
 
/* If PAM4 is configured, use PAM4 supported speed */
-   if (link_speed == 0 && bp->link_info->support_pam4_speeds > 0)
-   link_speed = bp->link_info->support_pam4_speeds;
+   if (bp->link_info->support_pam4_speeds > 0)
+   pam4_link_speed = bp->link_info->support_pam4_speeds;
 
if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_LINK_SPEED_100MB)
speed_capa |= RTE_ETH_LINK_SPEED_100M;
@@ -935,11 +936,11 @@ uint32_t bnxt_get_speed_capabilities(struct bnxt *bp)
speed_capa |= RTE_ETH_LINK_SPEED_50G;
if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_SPEEDS_100GB)
speed_capa |= RTE_ETH_LINK_SPEED_100G;
-   if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_50G)
+   if (pam4_link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_50G)
speed_capa |= RTE_ETH_LINK_SPEED_50G;
-   if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_100G)
+   if (pam4_link_speed & 
HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_100G)
speed_capa |= RTE_ETH_LINK_SPEED_100G;
-   if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_200G)
+   if (pam4_link_speed & 
HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_200G)
speed_capa |= RTE_ETH_LINK_SPEED_200G;
 
if (bp->link_info->auto_mode ==
diff --git a/drivers/net/bnxt/bnxt_hwrm.c b/drivers/net/bnxt/bnxt_hwrm.c
index 9eb8b8d..206ac20 100644
--- a/drivers/net/bnxt/bnxt_hwrm.c
+++ b/drivers/net/bnxt/bnxt_hwrm.c
@@ -2970,7 +2970,7 @@ static uint16_t bnxt_check_eth_link_autoneg(uint32_t 
conf_link)
 }
 
 static uint16_t bnxt_parse_eth_link_speed(uint32_t conf_link_speed,
- uint16_t pam4_link)
+ struct bnxt_link_info *link_info)
 {
uint16_t eth_link_speed = 0;
 
@@ -3009,18 +3009,29 @@ static uint16_t bnxt_parse_eth_link_speed(uint32_t 
conf_link_speed,
HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_40GB;
break;
case RTE_ETH_LINK_SPEED_50G:
-   eth_link_speed = pam4_link ?
-   HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_50GB :
-   HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_50GB;
+   if (link_info->support_pam4_speeds &
+   HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_50G) {
+   eth_link_speed = 
HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_50GB;
+   link_info->link_signal_mode = BNXT_SIG_MODE_PAM4;
+   } else {
+   eth_link_speed = 
HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_50GB;
+   link_info->link_signal_mode = BNXT_SIG_MODE_NRZ;
+   }
break;
case RTE_ETH_LINK_SPEED_100G:
-   eth_link_speed = pam4_link ?
-   HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_100GB :
-   HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_100GB;
+   if (link_info->support_pam4_speeds &
+   HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_100G) {
+   eth_link_speed = 
HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_100GB;
+   link_info->link_signal_mode = BNXT_SIG_MODE_PAM4;
+   } else {
+   eth_link_speed = 
HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_100GB;
+   link_info->link_signal_mode = BNXT_SIG_MODE_NRZ;
+   }
break;
case RTE_ETH_LINK_SPEED_200G:
   

[dpdk-dev] [PATCH 6/8] net/bnxt: disallow MTU change when device is started

2022-06-15 Thread Kalesh A P
From: Damodharam Ammepalli 

With this patch, bnxt_mtu_set_op() will return an error code if the
device has already started. The user application will have to take
care to bring down device before invoking the mtu_set()

Fixes: daef48efe5e5 ("net/bnxt: support set MTU")
Cc: sta...@dpdk.org

Signed-off-by: Damodharam Ammepalli 
Reviewed-by: Andy Gospodarek 
Reviewed-by: Ajit Khaparde 
---
 drivers/net/bnxt/bnxt_ethdev.c | 28 
 1 file changed, 8 insertions(+), 20 deletions(-)

diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c
index 4e4791c..f040cdc 100644
--- a/drivers/net/bnxt/bnxt_ethdev.c
+++ b/drivers/net/bnxt/bnxt_ethdev.c
@@ -3019,9 +3019,7 @@ bnxt_tx_burst_mode_get(struct rte_eth_dev *dev, 
__rte_unused uint16_t queue_id,
 
 int bnxt_mtu_set_op(struct rte_eth_dev *eth_dev, uint16_t new_mtu)
 {
-   uint32_t overhead = BNXT_MAX_PKT_LEN - BNXT_MAX_MTU;
struct bnxt *bp = eth_dev->data->dev_private;
-   uint32_t new_pkt_size;
uint32_t rc;
uint32_t i;
 
@@ -3029,25 +3027,15 @@ int bnxt_mtu_set_op(struct rte_eth_dev *eth_dev, 
uint16_t new_mtu)
if (rc)
return rc;
 
+   /* Return if port is active */
+   if (eth_dev->data->dev_started) {
+   PMD_DRV_LOG(ERR, "Stop port before changing MTU\n");
+   return -EPERM;
+   }
+
/* Exit if receive queues are not configured yet */
if (!eth_dev->data->nb_rx_queues)
-   return rc;
-
-   new_pkt_size = new_mtu + overhead;
-
-   /*
-* Disallow any MTU change that would require scattered receive support
-* if it is not already enabled.
-*/
-   if (eth_dev->data->dev_started &&
-   !eth_dev->data->scattered_rx &&
-   (new_pkt_size >
-eth_dev->data->min_rx_buf_size - RTE_PKTMBUF_HEADROOM)) {
-   PMD_DRV_LOG(ERR,
-   "MTU change would require scattered rx support. ");
-   PMD_DRV_LOG(ERR, "Stop port before changing MTU.\n");
-   return -EINVAL;
-   }
+   return -ENOTSUP;
 
if (new_mtu > RTE_ETHER_MTU)
bp->flags |= BNXT_FLAG_JUMBO;
@@ -3056,7 +3044,7 @@ int bnxt_mtu_set_op(struct rte_eth_dev *eth_dev, uint16_t 
new_mtu)
 
/* Is there a change in mtu setting? */
if (eth_dev->data->mtu == new_mtu)
-   return rc;
+   return 0;
 
for (i = 0; i < bp->nr_vnics; i++) {
struct bnxt_vnic_info *vnic = &bp->vnic_info[i];
-- 
2.10.1



[dpdk-dev] [PATCH 7/8] net/bnxt: cleanups in MTU set callback

2022-06-15 Thread Kalesh A P
From: Damodharam Ammepalli 

Minor cleanup in bnxt_mtu_set_op() to move pre-mature
setting of jumbo flag post mtu check and remove
a redundant mtu set operation from rxq vnic configs.

Fixes: daef48efe5e5 ("net/bnxt: support set MTU")
Cc: sta...@dpdk.org

Signed-off-by: Damodharam Ammepalli 
Reviewed-by: Ajit Khaparde 
Reviewed-by: Somnath Kotur 
---
 drivers/net/bnxt/bnxt_ethdev.c | 8 
 drivers/net/bnxt/bnxt_rxq.c| 4 
 2 files changed, 4 insertions(+), 8 deletions(-)

diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c
index f040cdc..e275d3a 100644
--- a/drivers/net/bnxt/bnxt_ethdev.c
+++ b/drivers/net/bnxt/bnxt_ethdev.c
@@ -3037,15 +3037,15 @@ int bnxt_mtu_set_op(struct rte_eth_dev *eth_dev, 
uint16_t new_mtu)
if (!eth_dev->data->nb_rx_queues)
return -ENOTSUP;
 
+   /* Is there a change in mtu setting? */
+   if (eth_dev->data->mtu == new_mtu)
+   return 0;
+
if (new_mtu > RTE_ETHER_MTU)
bp->flags |= BNXT_FLAG_JUMBO;
else
bp->flags &= ~BNXT_FLAG_JUMBO;
 
-   /* Is there a change in mtu setting? */
-   if (eth_dev->data->mtu == new_mtu)
-   return 0;
-
for (i = 0; i < bp->nr_vnics; i++) {
struct bnxt_vnic_info *vnic = &bp->vnic_info[i];
uint16_t size = 0;
diff --git a/drivers/net/bnxt/bnxt_rxq.c b/drivers/net/bnxt/bnxt_rxq.c
index f4b6417..fabbbd4 100644
--- a/drivers/net/bnxt/bnxt_rxq.c
+++ b/drivers/net/bnxt/bnxt_rxq.c
@@ -402,10 +402,6 @@ int bnxt_rx_queue_setup_op(struct rte_eth_dev *eth_dev,
rxq->rx_started = rxq->rx_deferred_start ? false : true;
rxq->vnic = BNXT_GET_DEFAULT_VNIC(bp);
 
-   /* Configure mtu if it is different from what was configured before */
-   if (!queue_idx)
-   bnxt_mtu_set_op(eth_dev, eth_dev->data->mtu);
-
return 0;
 err:
bnxt_rx_queue_release_op(eth_dev, queue_idx);
-- 
2.10.1



[dpdk-dev] [PATCH 8/8] net/bnxt: fix the check for autoneg enablement in the PHY FW

2022-06-15 Thread Kalesh A P
From: Somnath Kotur 

The current combination of checks to determine whether autoneg is
enabled in the card is a bit convoluted and may become incorrect
as well in the future as one of the fields being used - auto_link_speed
might become deprecated.
Switch to using the 'auto_mode' field obtained from the response of
HWRM_PHY_QCFG cmd as that is always deterministically set by the PHY
FW.
Fixed a bug in the 40G check to only look for the bit setting and
not the actual value.
Also, check the forced speeds first before trying to enforce the
auto speeds

Allow the user to set autoneg speed in all cases except for PAM4 200G
as PAM4 200G will come up only in forced mode.

Fixes: c23f9ded0391 ("net/bnxt: support 200G PAM4 link")

Signed-off-by: Somnath Kotur 
Reviewed-by: Kalesh AP 
Reviewed-by: Ajit Khaparde 
---
 drivers/net/bnxt/bnxt_hwrm.c | 18 +++---
 1 file changed, 11 insertions(+), 7 deletions(-)

diff --git a/drivers/net/bnxt/bnxt_hwrm.c b/drivers/net/bnxt/bnxt_hwrm.c
index 206ac20..9c52573 100644
--- a/drivers/net/bnxt/bnxt_hwrm.c
+++ b/drivers/net/bnxt/bnxt_hwrm.c
@@ -3235,9 +3235,11 @@ int bnxt_set_hwrm_link_config(struct bnxt *bp, bool 
link_up)
if (!link_up)
goto port_phy_cfg;
 
+   /* Get user requested autoneg setting */
autoneg = bnxt_check_eth_link_autoneg(dev_conf->link_speeds);
+
if (BNXT_CHIP_P5(bp) &&
-   dev_conf->link_speeds == RTE_ETH_LINK_SPEED_40G) {
+   dev_conf->link_speeds & RTE_ETH_LINK_SPEED_40G) {
/* 40G is not supported as part of media auto detect.
 * The speed should be forced and autoneg disabled
 * to configure 40G speed.
@@ -3246,11 +3248,13 @@ int bnxt_set_hwrm_link_config(struct bnxt *bp, bool 
link_up)
autoneg = 0;
}
 
-   /* No auto speeds and no auto_pam4_link. Disable autoneg */
-   if (bp->link_info->auto_link_speed == 0 &&
-   bp->link_info->link_signal_mode &&
-   bp->link_info->auto_pam4_link_speed_mask == 0)
+   /* Override based on current Autoneg setting in PHY for 200G */
+   if (autoneg == 1 && BNXT_CHIP_P5(bp) && bp->link_info->auto_mode == 0 &&
+   bp->link_info->force_pam4_link_speed ==
+   HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_200GB) {
autoneg = 0;
+   PMD_DRV_LOG(DEBUG, "Disabling autoneg for 200G\n");
+   }
 
speed = bnxt_parse_eth_link_speed(dev_conf->link_speeds,
  bp->link_info);
@@ -3283,14 +3287,14 @@ int bnxt_set_hwrm_link_config(struct bnxt *bp, bool 
link_up)
else if (bp->link_info->force_pam4_link_speed)
link_req.link_speed =
bp->link_info->force_pam4_link_speed;
+   else if (bp->link_info->force_link_speed)
+   link_req.link_speed = bp->link_info->force_link_speed;
else if (bp->link_info->auto_pam4_link_speed_mask)
link_req.link_speed =
bp->link_info->auto_pam4_link_speed_mask;
else if (bp->link_info->support_pam4_speeds)
link_req.link_speed =
bp->link_info->support_pam4_speeds;
-   else if (bp->link_info->force_link_speed)
-   link_req.link_speed = bp->link_info->force_link_speed;
else
link_req.link_speed = bp->link_info->auto_link_speed;
/* Auto PAM4 link speed is zero, but auto_link_speed is not
-- 
2.10.1



Re: [PATCH 00/12] Fix compilation with gcc 12

2022-06-15 Thread Thomas Monjalon
15/06/2022 16:45, Stephen Hemminger:
> On Wed, 15 Jun 2022 10:49:17 +0200
> David Marchand  wrote:
> 
> > On Wed, May 18, 2022 at 12:17 PM David Marchand
> >  wrote:
> > >
> > > Fedora 36 is out since early may and comes with gcc 12.
> > > This series fixes compilation or waives some checks.
> > >
> > > There might be something fishy with rte_memcpy on x86 but, for now,
> > > the rte_memcpy related fixes are on the caller side.
> > >
> > > Some "base" drivers have issues, I chose the simple solution of waiving
> > > the checks for them.
> > >
> > > Compilation is the only thing checked.
> > > Please driver maintainers, check nothing got broken.  
> > 
> > I applied the patches that got acked and that had no objection or
> > comment from maintainers (i.e. patch 3, 4, 5, 6, 9, 11).
> > I also cleaned the mess in bugzilla where we had multiple reports of
> > the same issues, or stale bugs that I can't reproduce with released
> > gcc 12.
> > 
> > I'll respin separately the patches for which I have clear comments,
> > and drop my patches waiving the compiler checks.
> > 
> > We still need to agree on the best approach to handle the new checks.
> > We have two rfc series from Stephen, how do we move forward?
> 
> Lets fix all the bugs and remove any workarounds using pragma's.
> 
> Some of them may mean removing rte_memcpy where it is not needed.

What about your series Stephen?
Please would you like to respin?




RE: [EXT] [PATCH 1/3] ipsec: fix NAT-T ports and length

2022-06-15 Thread Akhil Goyal
> --
> Fix the UDP header fields, wrong byte order used for src and dst port
> and wrong offset used when updating UDP datagram length.
> 
> Fixes: 01eef5907fc3 ("ipsec: support NAT-T")
> Cc: sta...@dpdk.org
> 
> Signed-off-by: Radu Nicolau 
Series applied to dpdk-next-crypto

Thanks.


RE: [EXT] [PATCH v2] test/crypto: fix warnings for optimization=1 build

2022-06-15 Thread Akhil Goyal
> Fail IPSec ESN and antireplay cases, if there are no packets. Fixes
> following warning when using optimization=1 build flag with GCC 11.
> 
> ../app/test/test_cryptodev.c: In function ‘test_ipsec_pkt_replay’:
> ../app/test/test_cryptodev.c:10074:15: warning: ‘td_outb’ may be used
> uninitialized [-Wmaybe-uninitialized]
>  ret = test_ipsec_proto_process(td_outb, td_inb, nb_pkts, true,
>^~~~
> &flags);
> ~~~
> ../app/test/test_cryptodev.c:9150:1: note: by argument 1 of type ‘const struct
> ipsec_test_data[]’ to ‘test_ipsec_proto_process’ declared here
>  test_ipsec_proto_process(const struct ipsec_test_data td[],
>  ^~~~
> ../app/test/test_cryptodev.c:10056:32: note: ‘td_outb’ declared here
>  struct ipsec_test_data td_outb[IPSEC_TEST_PACKETS_MAX];
> ^~~
> 
> Bugzilla ID: 1032
> Fixes: d02c6bfcb99a ("test/crypto: add ESN and antireplay cases")
> Cc: sta...@dpdk.org
> 
> Reported-by: Daxue Gao 
> Signed-off-by: Rahul Lakkireddy 
Applied to dpdk-next-crypto

Thanks.


RE: [PATCH] examples/ipsec-secgw: fix ESN setting

2022-06-15 Thread Akhil Goyal
> >
> > Fix ESN option flag and initial value for the rte_ipsec library path.
> >
> > Fixes: 560029d5cfc9 ("examples/ipsec-secgw: define initial ESN value")
> > Cc: sta...@dpdk.org
> >
> > Signed-off-by: Radu Nicolau 
> Acked-by: Fan Zhang 
Applied to dpdk-next-crypto

Thanks.


RE: [PATCH 1/1] examples/ipsec-secgw: add option for descriptors per QP

2022-06-15 Thread Akhil Goyal
> > Added option to configure number of queue pair descriptors via command
> > line (--desc-nb NUMBER_OF_DESC).
> >
> > When the crypto processing takes longer durations, small queue pair size
> > would result in cryptodev enqueue failures. Larger queue pair size would
> > allow more packets to stay in flight simultaneously and reduce enqueue
> > failures.
> >
> > Signed-off-by: Volodymyr Fialko 
> 
> Acked-by: Anoob Joseph 

Acked-by: Akhil Goyal 

Applied to dpdk-next-crypto



Re: [PATCH 00/12] Fix compilation with gcc 12

2022-06-15 Thread Stephen Hemminger
On Wed, 15 Jun 2022 16:59:51 +0200
Thomas Monjalon  wrote:

> 15/06/2022 16:45, Stephen Hemminger:
> > On Wed, 15 Jun 2022 10:49:17 +0200
> > David Marchand  wrote:
> >   
> > > On Wed, May 18, 2022 at 12:17 PM David Marchand
> > >  wrote:  
> > > >
> > > > Fedora 36 is out since early may and comes with gcc 12.
> > > > This series fixes compilation or waives some checks.
> > > >
> > > > There might be something fishy with rte_memcpy on x86 but, for now,
> > > > the rte_memcpy related fixes are on the caller side.
> > > >
> > > > Some "base" drivers have issues, I chose the simple solution of waiving
> > > > the checks for them.
> > > >
> > > > Compilation is the only thing checked.
> > > > Please driver maintainers, check nothing got broken.
> > > 
> > > I applied the patches that got acked and that had no objection or
> > > comment from maintainers (i.e. patch 3, 4, 5, 6, 9, 11).
> > > I also cleaned the mess in bugzilla where we had multiple reports of
> > > the same issues, or stale bugs that I can't reproduce with released
> > > gcc 12.
> > > 
> > > I'll respin separately the patches for which I have clear comments,
> > > and drop my patches waiving the compiler checks.
> > > 
> > > We still need to agree on the best approach to handle the new checks.
> > > We have two rfc series from Stephen, how do we move forward?  
> > 
> > Lets fix all the bugs and remove any workarounds using pragma's.
> > 
> > Some of them may mean removing rte_memcpy where it is not needed.  
> 
> What about your series Stephen?
> Please would you like to respin?
> 
> 

Yes will recollate based on current main branch.


RE: [PATCH] examples/ipsec-secgw: fix packet type parsing

2022-06-15 Thread Akhil Goyal
> > Add new packet type flags instead of overwriting.
> >
> > Fixes: d04bb1c52647 ("examples/ipsec-secgw: use HW parsed packet type in
> > poll mode")
> >
> > Signed-off-by: Radu Nicolau 
> Acked-by: Fan Zhang 

Acked-by: Akhil Goyal 

Applied to dpdk-next-crypto

Thanks.


RE: [PATCH 0/2] add zuc-256 support

2022-06-15 Thread Akhil Goyal
> Adds support for zuc-256 cipher in cnxk crypto PMD.
> 
> Ankur Dwivedi (2):
>   common/cnxk: swap zuc-256 key
>   crypto/cnxk: swap zuc-256 iv
> 
>  drivers/common/cnxk/roc_se.c  |  7 +--
>  drivers/common/cnxk/roc_se.h  | 22 ++
>  drivers/crypto/cnxk/cnxk_se.h | 24 +---
>  3 files changed, 40 insertions(+), 13 deletions(-)
> 
Applied to dpdk-next-crypto

Thanks.


RE: [PATCH] crypto/cnxk: fix condition check

2022-06-15 Thread Akhil Goyal


> 
> > Subject: [PATCH] crypto/cnxk: fix condition check
> >
> > The value of ec_mdata cannot be NULL, instead check that value of
> > private_data_offset was set.
> >
> >
> > Fixes: c3a498c01121 ("crypto/cnxk: add event metadata set operation")
> > Coverity issue: 378861 Coverity issue: 378865
> >
> > Signed-off-by: Volodymyr Fialko 
> 
> Acked-by: Anoob Joseph 

Applied to dpdk-next-crypto

Thanks.


RE: [PATCH 1/2] crypto/octeontx: pass sub event type in event

2022-06-15 Thread Akhil Goyal
> Subject: [PATCH 1/2] crypto/octeontx: pass sub event type in event
> 
> Response info may have valid sub_event_type. Pass this to the event
> generated by CPT.
> 
> Signed-off-by: Anoob Joseph 
> Reviewed-by: Shijith Thotton 
> Reviewed-by: Jerin Jacob Kollanukkaran 
Series applied to dpdk-next-crypto

Thanks.


RE: [EXT] [PATCH v4] app/test-crypto-perf: support SDAP for PDCP operations

2022-06-15 Thread Akhil Goyal
> Add a command line option "--enable-sdap" to enable
> Service Data Adaptation Protocol.
> 
> example command:
> ./dpdk-test-crypto-perf -c 0xc  --log-level=8 --
> --devtype crypto_dpaa2_sec --optype pdcp --cipher-algo aes-ctr
> --cipher-op encrypt --auth-algo zuc-eia3 --auth-op generate
> --auth-key-sz 16 --ptest throughput --total-ops 10 --burst-sz 64
> --buffer-sz 64,390,1512  --pool-sz 4096 --silent --pdcp-sn-sz 12
> --pdcp-domain control --enable-sdap
> 
> Signed-off-by: Gagandeep Singh 
Acked-by: Akhil Goyal 

Applied to dpdk-next-crypto
Thanks.


Re: [PATCH 1/5] usertools: use non-strict when json-loads in telemetry

2022-06-15 Thread Bruce Richardson
On Wed, Jun 15, 2022 at 03:54:57PM +0200, Morten Brørup wrote:
> > From: Chengwen Feng [mailto:fengcheng...@huawei.com]
> > Sent: Wednesday, 15 June 2022 09.39
> > 
> > Use 'strict=False' in json-loads, it will ignore control characters
> > (e.g. '\n\t'), this patch is prepared for the support of telemetry dump
> > in the future.
> > 
> > Signed-off-by: Chengwen Feng 
> > ---
> >  usertools/dpdk-telemetry.py | 2 +-
> >  1 file changed, 1 insertion(+), 1 deletion(-)
> > 
> > diff --git a/usertools/dpdk-telemetry.py b/usertools/dpdk-telemetry.py
> > index a81868a547..63f8004566 100755
> > --- a/usertools/dpdk-telemetry.py
> > +++ b/usertools/dpdk-telemetry.py
> > @@ -27,7 +27,7 @@ def read_socket(sock, buf_len, echo=True):
> >  """ Read data from socket and return it in JSON format """
> >  reply = sock.recv(buf_len).decode()
> >  try:
> > -ret = json.loads(reply)
> > +ret = json.loads(reply, strict=False)
> >  except json.JSONDecodeError:
> >  print("Error in reply: ", reply)
> >  sock.close()
> > --
> > 2.33.0
> > 
> 
> As I understand this patch, it accepts non-JSON data from the telemetry 
> socket.
> 
> Isn't it is a major protocol violation if the telemetry socket produces 
> output requiring this modification? Doing that would break other applications 
> expecting strictly JSON formatted output from the telemetry socket.
> 

I would tend to agree.

As an alternative, I think you should add to the telemetry library an
"escape string" function which can then be used by the telemetry functions
to properly json encode the strings back from the dump functions before
sending them out.

/Bruce


Re: [PATCH] net/octeontx_ep: updated ethdev ops

2022-06-15 Thread Jerin Jacob
On Thu, Jun 9, 2022 at 8:50 PM Sathesh B Edara  wrote:
>
> Updated ethdev ops support for link_update(),
> stats_get() and stats_reset()

a) Split as two patches (Suggested heading)
1) net/octeontx_ep: support basic stats

2) net/octeontx_ep: support link status

b) Update doc/guides/nics/features/octeontx_ep.ini
See doc/guides/nics/features.rst

>
> Signed-off-by: Sathesh B Edara 

Since this is your first path, you may consider changing to  "Sathesh
Edara "

> +static int
> +otx_ep_link_update(struct rte_eth_dev *eth_dev, int wait_to_complete)
> +{
> +   struct rte_eth_link link;
> +
> +   RTE_SET_USED(wait_to_complete);
> +
> +   if (!eth_dev->data->dev_started)
> +   return 0;
> +
> +   link.link_speed = RTE_ETH_SPEED_NUM_10G;

Isn't this RTE_ETH_SPEED_NUM_NONE? as its speed is "Not defined"

>


[PATCH v2 0/2] add includes to help when editing in Eclipse

2022-06-15 Thread Bruce Richardson
These two small patches add in some additional header includes to 3 DPDK
files, in order to improve the experience of anyone who, like me,
sometimes uses eclipse-cdt as an editor for working with DPDK code.
The C indexer in eclipse seems easily confused when missing some of
our macro definitions, so we can reduce that confusion by ensuring
that we include all needed headers to get those definitions.

V2: fix copy-paste error

Bruce Richardson (2):
  examples/l3fwd: add include for macro definition
  common/cnxk: add include for macro definition

 drivers/common/cnxk/roc_io.h | 2 ++
 examples/l3fwd/l3fwd_em.h| 2 ++
 examples/l3fwd/l3fwd_em_sequential.h | 2 ++
 3 files changed, 6 insertions(+)

--
2.34.1



[PATCH v2 2/2] common/cnxk: add include for macro definition

2022-06-15 Thread Bruce Richardson
The header file "roc_io.h" uses the "__plt_always_inline" macro but
don't include "roc_platform.h" to get the definition of it. This
inclusion is not necessary for compilation, but the lack of it can
confuse some indexers - such as those in eclipse, which reports the
lines:

"static __plt_always_inline uint64_t"

as possible definitions of a variable called "uint64_t". This confusion
leads to uint64_t being flagged as an unknown type in all other parts of
the project being indexed, e.g. across all of DPDK code.

Adding in the include of roc_platform.h makes it clear to the indexer
that those lines are  part of a function definition, and that allows
eclipse to correctly recognise uint64_t as a type from stdint.h

Signed-off-by: Bruce Richardson 
---
V2: fix copy-paste error, where header name replacement was missed
---
 drivers/common/cnxk/roc_io.h | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/drivers/common/cnxk/roc_io.h b/drivers/common/cnxk/roc_io.h
index 62e98d9d00..9d73e263f7 100644
--- a/drivers/common/cnxk/roc_io.h
+++ b/drivers/common/cnxk/roc_io.h
@@ -5,6 +5,8 @@
 #ifndef _ROC_IO_H_
 #define _ROC_IO_H_

+#include "roc_platform.h" /* for __plt_always_inline macro */
+
 #define ROC_LMT_BASE_ID_GET(lmt_addr, lmt_id)  
\
do {   \
/* 32 Lines per core */\
--
2.34.1



[PATCH v2 1/2] examples/l3fwd: add include for macro definition

2022-06-15 Thread Bruce Richardson
The header files "l3fwd_em.h" and "l3fwd_em_sequential.h" use the
"__rte_always_inline" macro but don't directly include "rte_common.h" to
get the definition of it. This inclusion is not necessary for
compilation, but the lack of it can confuse some indexers - such as
those in eclipse, which reports the lines:

"static __rte_always_inline uint16_t"

as possible definitions of a variable called "uint16_t". This confusion
leads to uint16_t being flagged as an unknown type in all other parts of
the project being indexed, e.g. across all of DPDK code.

Adding in the include of rte_common.h makes it clear to the indexer that
those lines are  part of a function definition, and that allows eclipse
to correctly recognise uint16_t as a type from stdint.h

Signed-off-by: Bruce Richardson 
---
 examples/l3fwd/l3fwd_em.h| 2 ++
 examples/l3fwd/l3fwd_em_sequential.h | 2 ++
 2 files changed, 4 insertions(+)

diff --git a/examples/l3fwd/l3fwd_em.h b/examples/l3fwd/l3fwd_em.h
index 11bd951858..fe2ee59f6a 100644
--- a/examples/l3fwd/l3fwd_em.h
+++ b/examples/l3fwd/l3fwd_em.h
@@ -5,6 +5,8 @@
 #ifndef __L3FWD_EM_H__
 #define __L3FWD_EM_H__
 
+#include 
+
 static __rte_always_inline uint16_t
 l3fwd_em_handle_ipv4(struct rte_mbuf *m, uint16_t portid,
 struct rte_ether_hdr *eth_hdr, struct lcore_conf *qconf)
diff --git a/examples/l3fwd/l3fwd_em_sequential.h 
b/examples/l3fwd/l3fwd_em_sequential.h
index f426c508ef..d2f75edb8a 100644
--- a/examples/l3fwd/l3fwd_em_sequential.h
+++ b/examples/l3fwd/l3fwd_em_sequential.h
@@ -5,6 +5,8 @@
 #ifndef __L3FWD_EM_SEQUENTIAL_H__
 #define __L3FWD_EM_SEQUENTIAL_H__
 
+#include 
+
 /**
  * @file
  * This is an optional implementation of packet classification in Exact-Match
-- 
2.34.1



[PATCH] common/iavf: replace zero-length arrays with flexible ones

2022-06-15 Thread 835703180
From: Shiqi Liu <835703...@qq.com>

This patch replaces instances of zero-sized arrays i.e. those at the
end of structures with "[0]" with the more standard syntax of "[]".
Replacement was done using coccinelle script, with some revert and
cleanup of whitespace afterwards.

Signed-off-by: Shiqi Liu <835703...@qq.com>
---
 drivers/common/iavf/virtchnl_inline_ipsec.h | 24 ++---
 drivers/common/mlx5/mlx5_prm.h  |  4 ++--
 2 files changed, 14 insertions(+), 14 deletions(-)

diff --git a/drivers/common/iavf/virtchnl_inline_ipsec.h 
b/drivers/common/iavf/virtchnl_inline_ipsec.h
index 2f4bf15725..9d5c433d32 100644
--- a/drivers/common/iavf/virtchnl_inline_ipsec.h
+++ b/drivers/common/iavf/virtchnl_inline_ipsec.h
@@ -500,25 +500,25 @@ struct inline_ipsec_msg {
 
union {
/* IPsec request */
-   struct virtchnl_ipsec_sa_cfg sa_cfg[0];
-   struct virtchnl_ipsec_sp_cfg sp_cfg[0];
-   struct virtchnl_ipsec_sa_update sa_update[0];
-   struct virtchnl_ipsec_sa_destroy sa_destroy[0];
-   struct virtchnl_ipsec_sp_destroy sp_destroy[0];
+   struct virtchnl_ipsec_sa_cfg sa_cfg[];
+   struct virtchnl_ipsec_sp_cfg sp_cfg[];
+   struct virtchnl_ipsec_sa_update sa_update [];
+   struct virtchnl_ipsec_sa_destroy sa_destroy[];
+   struct virtchnl_ipsec_sp_destroy sp_destroy[];
 
/* IPsec response */
-   struct virtchnl_ipsec_sa_cfg_resp sa_cfg_resp[0];
-   struct virtchnl_ipsec_sp_cfg_resp sp_cfg_resp[0];
-   struct virtchnl_ipsec_cap ipsec_cap[0];
-   struct virtchnl_ipsec_status ipsec_status[0];
+   struct virtchnl_ipsec_sa_cfg_resp sa_cfg_resp[];
+   struct virtchnl_ipsec_sp_cfg_resp sp_cfg_resp[];
+   struct virtchnl_ipsec_cap ipsec_cap[];
+   struct virtchnl_ipsec_status ipsec_status[];
/* response to del_sa, del_sp, update_sa */
-   struct virtchnl_ipsec_resp ipsec_resp[0];
+   struct virtchnl_ipsec_resp ipsec_resp[];
 
/* IPsec event (no req_id is required) */
-   struct virtchnl_ipsec_event event[0];
+   struct virtchnl_ipsec_event event[];
 
/* Reserved */
-   struct virtchnl_ipsec_sa_read sa_read[0];
+   struct virtchnl_ipsec_sa_read sa_read[];
} ipsec_data;
 } __rte_packed;
 
diff --git a/drivers/common/mlx5/mlx5_prm.h b/drivers/common/mlx5/mlx5_prm.h
index 654e5f44ee..c841d34216 100644
--- a/drivers/common/mlx5/mlx5_prm.h
+++ b/drivers/common/mlx5/mlx5_prm.h
@@ -552,8 +552,8 @@ struct mlx5_umr_wqe {
struct mlx5_wqe_umr_cseg ucseg;
struct mlx5_wqe_mkey_cseg mkc;
union {
-   struct mlx5_wqe_dseg kseg[0];
-   struct mlx5_wqe_umr_bsf_seg bsf[0];
+   struct mlx5_wqe_dseg kseg[];
+   struct mlx5_wqe_umr_bsf_seg bsf[];
};
 } __rte_packed;
 
-- 
2.35.1.windows.2




[V2] common/iavf:replace zero-length arrays with flexible ones

2022-06-15 Thread 835703180
From: Shiqi Liu <835703...@qq.com>

This patch replaces instances of zero-sized arrays i.e. those at
the end of structures with "[0]" with the more standard syntax of "[]".
Replacement was done using coccinelle script, with some revert and
cleanup of whitespace afterwards.
v2:
* revise a error with BRACKET_SPACE

Signed-off-by: Shiqi Liu <835703...@qq.com>
---
 ...ace-zero-length-arrays-with-flexible.patch | 76 +++
 drivers/common/iavf/virtchnl_inline_ipsec.h   |  2 +-
 2 files changed, 77 insertions(+), 1 deletion(-)
 create mode 100644 
0001-common-iavf-replace-zero-length-arrays-with-flexible.patch

diff --git a/0001-common-iavf-replace-zero-length-arrays-with-flexible.patch 
b/0001-common-iavf-replace-zero-length-arrays-with-flexible.patch
new file mode 100644
index 00..25c5658791
--- /dev/null
+++ b/0001-common-iavf-replace-zero-length-arrays-with-flexible.patch
@@ -0,0 +1,76 @@
+From e680b5d91aad9e22d45652efdd85cd2c40105247 Mon Sep 17 00:00:00 2001
+From: Shiqi Liu <835703...@qq.com>
+Date: Thu, 16 Jun 2022 01:12:47 +0800
+Subject: [PATCH] common/iavf: replace zero-length arrays with flexible ones
+
+This patch replaces instances of zero-sized arrays i.e. those at the
+end of structures with "[0]" with the more standard syntax of "[]".
+Replacement was done using coccinelle script, with some revert and
+cleanup of whitespace afterwards.
+
+Signed-off-by: Shiqi Liu <835703...@qq.com>
+---
+ drivers/common/iavf/virtchnl_inline_ipsec.h | 24 ++---
+ drivers/common/mlx5/mlx5_prm.h  |  4 ++--
+ 2 files changed, 14 insertions(+), 14 deletions(-)
+
+diff --git a/drivers/common/iavf/virtchnl_inline_ipsec.h 
b/drivers/common/iavf/virtchnl_inline_ipsec.h
+index 2f4bf15725..9d5c433d32 100644
+--- a/drivers/common/iavf/virtchnl_inline_ipsec.h
 b/drivers/common/iavf/virtchnl_inline_ipsec.h
+@@ -500,25 +500,25 @@ struct inline_ipsec_msg {
+ 
+   union {
+   /* IPsec request */
+-  struct virtchnl_ipsec_sa_cfg sa_cfg[0];
+-  struct virtchnl_ipsec_sp_cfg sp_cfg[0];
+-  struct virtchnl_ipsec_sa_update sa_update[0];
+-  struct virtchnl_ipsec_sa_destroy sa_destroy[0];
+-  struct virtchnl_ipsec_sp_destroy sp_destroy[0];
++  struct virtchnl_ipsec_sa_cfg sa_cfg[];
++  struct virtchnl_ipsec_sp_cfg sp_cfg[];
++  struct virtchnl_ipsec_sa_update sa_update [];
++  struct virtchnl_ipsec_sa_destroy sa_destroy[];
++  struct virtchnl_ipsec_sp_destroy sp_destroy[];
+ 
+   /* IPsec response */
+-  struct virtchnl_ipsec_sa_cfg_resp sa_cfg_resp[0];
+-  struct virtchnl_ipsec_sp_cfg_resp sp_cfg_resp[0];
+-  struct virtchnl_ipsec_cap ipsec_cap[0];
+-  struct virtchnl_ipsec_status ipsec_status[0];
++  struct virtchnl_ipsec_sa_cfg_resp sa_cfg_resp[];
++  struct virtchnl_ipsec_sp_cfg_resp sp_cfg_resp[];
++  struct virtchnl_ipsec_cap ipsec_cap[];
++  struct virtchnl_ipsec_status ipsec_status[];
+   /* response to del_sa, del_sp, update_sa */
+-  struct virtchnl_ipsec_resp ipsec_resp[0];
++  struct virtchnl_ipsec_resp ipsec_resp[];
+ 
+   /* IPsec event (no req_id is required) */
+-  struct virtchnl_ipsec_event event[0];
++  struct virtchnl_ipsec_event event[];
+ 
+   /* Reserved */
+-  struct virtchnl_ipsec_sa_read sa_read[0];
++  struct virtchnl_ipsec_sa_read sa_read[];
+   } ipsec_data;
+ } __rte_packed;
+ 
+diff --git a/drivers/common/mlx5/mlx5_prm.h b/drivers/common/mlx5/mlx5_prm.h
+index 654e5f44ee..c841d34216 100644
+--- a/drivers/common/mlx5/mlx5_prm.h
 b/drivers/common/mlx5/mlx5_prm.h
+@@ -552,8 +552,8 @@ struct mlx5_umr_wqe {
+   struct mlx5_wqe_umr_cseg ucseg;
+   struct mlx5_wqe_mkey_cseg mkc;
+   union {
+-  struct mlx5_wqe_dseg kseg[0];
+-  struct mlx5_wqe_umr_bsf_seg bsf[0];
++  struct mlx5_wqe_dseg kseg[];
++  struct mlx5_wqe_umr_bsf_seg bsf[];
+   };
+ } __rte_packed;
+ 
+-- 
+2.35.1.windows.2
+
diff --git a/drivers/common/iavf/virtchnl_inline_ipsec.h 
b/drivers/common/iavf/virtchnl_inline_ipsec.h
index 9d5c433d32..0781de1fe1 100644
--- a/drivers/common/iavf/virtchnl_inline_ipsec.h
+++ b/drivers/common/iavf/virtchnl_inline_ipsec.h
@@ -502,7 +502,7 @@ struct inline_ipsec_msg {
/* IPsec request */
struct virtchnl_ipsec_sa_cfg sa_cfg[];
struct virtchnl_ipsec_sp_cfg sp_cfg[];
-   struct virtchnl_ipsec_sa_update sa_update [];
+   struct virtchnl_ipsec_sa_update sa_update[];
struct virtchnl_ipsec_sa_destroy sa_destroy[];
struct virtchnl_ipsec_sp_destroy sp_destroy[];
 
-- 
2.35.1.windows.2



Re: [PATCH] common/cnxk: return on fail to init ROC Model

2022-06-15 Thread Jerin Jacob
On Fri, Jun 10, 2022 at 1:44 PM Hanumanth Pothula  wrote:
>
> Return with error on fail to initialize RoC Model.
>
> Signed-off-by: Hanumanth Pothula 


Updated the git commit as follows and applied to
dpdk-next-net-mrvl/for-next-net. Thanks

common/cnxk: handle ROC model init failure

Return with error on fail to initialize ROC model.

Fixes: 014a9e222bac ("common/cnxk: add model init and IO handling API")
Cc: sta...@dpdk.org

Signed-off-by: Hanumanth Pothula 


> ---
>  drivers/common/cnxk/roc_platform.c | 7 ++-
>  1 file changed, 6 insertions(+), 1 deletion(-)
>
> diff --git a/drivers/common/cnxk/roc_platform.c 
> b/drivers/common/cnxk/roc_platform.c
> index ebb6225f4d..443aa8d396 100644
> --- a/drivers/common/cnxk/roc_platform.c
> +++ b/drivers/common/cnxk/roc_platform.c
> @@ -37,7 +37,12 @@ roc_plt_init(void)
> plt_err("Failed to reserve mem for 
> roc_model");
> return -ENOMEM;
> }
> -   roc_model_init(mz->addr);
> +   if (roc_model_init(mz->addr)) {
> +   plt_err("Failed to init roc_model");
> +
> +   rte_memzone_free(mz);
> +   return -EINVAL;
> +   }
> }
> } else {
> if (mz == NULL) {
> --
> 2.25.1
>


RE: [PATCH 1/5] usertools: use non-strict when json-loads in telemetry

2022-06-15 Thread Morten Brørup
+CC Ciara Power, Telemetry lib maintainer

> From: Bruce Richardson [mailto:bruce.richard...@intel.com]
> Sent: Wednesday, 15 June 2022 18.55
> 
> On Wed, Jun 15, 2022 at 03:54:57PM +0200, Morten Brørup wrote:
> > > From: Chengwen Feng [mailto:fengcheng...@huawei.com]
> > > Sent: Wednesday, 15 June 2022 09.39
> > >
> > > Use 'strict=False' in json-loads, it will ignore control characters
> > > (e.g. '\n\t'), this patch is prepared for the support of telemetry
> dump
> > > in the future.
> > >
> > > Signed-off-by: Chengwen Feng 
> > > ---
> > >  usertools/dpdk-telemetry.py | 2 +-
> > >  1 file changed, 1 insertion(+), 1 deletion(-)
> > >
> > > diff --git a/usertools/dpdk-telemetry.py b/usertools/dpdk-
> telemetry.py
> > > index a81868a547..63f8004566 100755
> > > --- a/usertools/dpdk-telemetry.py
> > > +++ b/usertools/dpdk-telemetry.py
> > > @@ -27,7 +27,7 @@ def read_socket(sock, buf_len, echo=True):
> > >  """ Read data from socket and return it in JSON format """
> > >  reply = sock.recv(buf_len).decode()
> > >  try:
> > > -ret = json.loads(reply)
> > > +ret = json.loads(reply, strict=False)
> > >  except json.JSONDecodeError:
> > >  print("Error in reply: ", reply)
> > >  sock.close()
> > > --
> > > 2.33.0
> > >
> >
> > As I understand this patch, it accepts non-JSON data from the
> telemetry socket.
> >
> > Isn't it is a major protocol violation if the telemetry socket
> produces output requiring this modification? Doing that would break
> other applications expecting strictly JSON formatted output from the
> telemetry socket.
> >
> 
> I would tend to agree.
> 
> As an alternative, I think you should add to the telemetry library an
> "escape string" function which can then be used by the telemetry
> functions
> to properly json encode the strings back from the dump functions before
> sending them out.

Instead of adding a separate JSON encode function, the rte_tel_data_string() 
and rte_tel_data_add_array_string() functions should simply JSON encode the 
provided strings if required. Their descriptions do not mention any 
requirements to the strings provided, and being control plane functions, I 
would certainly expect them to JSON encode.

Warning: Although I consider such a change a bugfix, others might consider it 
an ABI breakage. :-(

@Ciara, what is your opinion about my suggestion here?

For minimal changes, RTE_TEL_MAX_STRING_LEN and RTE_TEL_MAX_SINGLE_STRING_LEN 
should keep their meaning, i.e. define the maximum length of the string after 
any JSON encoding.

And optionally, new rte_tel_data_[array_]string_raw() performance optimized 
functions could be provided for strings known not to require any encoding.



Re: [PATCH v2 2/3] eal: fix trace init fail with long file-prefix

2022-06-15 Thread Jerin Jacob
On Wed, Jun 15, 2022 at 7:32 PM David Marchand
 wrote:
>
> On Tue, Jun 14, 2022 at 8:06 AM Chengwen Feng  wrote:
> >
> > Bug scenario:
> > 1. start testpmd:
> > dpdk-testpmd -l 4-6 -a :7d:00.0 --trace=.* \
> > -file-prefix=trace_autotest -- -i
> > 2. then observed:
> > EAL: eal_trace_init():94 failed to initialize trace [File exists]
>
> This is not directly related to the issue being fixed, but this error
> log is really obscure..
> Can we enhance this, maybe in a followup patch?
>
>
> > EAL: FATAL: Cannot init trace
> > EAL: Cannot init trace
> > EAL: Error - exiting with code: 1
> >
> > The root cause it that the offset set wrong with long file-prefix and
> > then lead the strftime return failed.
> >
> > Fixes: 321dd5f8fa62 ("trace: add internal init and fini interface")
> > Cc: sta...@dpdk.org
> >
> > Signed-off-by: Chengwen Feng 
>
> Reviewed-by: David Marchand 

Acked-by: Jerin Jacob 


Re: [EXT] Re: [PATCH v2] devtools: ensure proper tag sequence

2022-06-15 Thread Thomas Monjalon
15/06/2022 10:46, Jakub Palider:
> > -Original Message-
> > From: Thomas Monjalon 
> > Sent: Wednesday, June 15, 2022 8:21 AM
> > To: Jakub Palider 
> > Cc: dev@dpdk.org; david.march...@redhat.com
> > Subject: [EXT] Re: [PATCH v2] devtools: ensure proper tag sequence
> > 
> > External Email
> > 
> > --
> > 14/06/2022 00:21, jpali...@marvell.com:
> > > + SEQ[0] = "Coverity issue";
> > > + SEQ[1] = "Bugzilla ID";
> > > + SEQ[2] = "Fixes";
> > > + SEQ[3] = "Cc";
> > > + SEQ[4] = "^$";
> > > + SEQ[5] = "Suggested-by";
> > > + SEQ[6] = "Reported-by";
> > > + SEQ[7] = "Signed-off-by";
> > > + latest = 0;
> > 
> > Do you think you could check that Review, Ack and Test are added after the
> > first Signed-off?
> 
> Ok, thank you for clarification. So any further Signed-offs need not to be in 
> a particular 
> sequence as long as the first one precedes Review/Ack/Test (these 3 in any 
> order).

Yes

> > > +Tag order
> > > +~
> > > +
> > > +There is a pattern indicating how certain tags should relate to each 
> > > other.
> > > +
> > > +Example of proper tag sequence::
> > > +
> > > + Coverity issue:
> > > + Bugzilla ID:
> > > + Fixes:
> > > + Cc:
> > > +
> > > + Suggested-by:
> > > + Reported-by:
> > > + Signed-off-by:
> > 
> > Given it is an example, you could add Reviewed-by, Acked-by and Tested-by.
> 
> Sure, I will restore it.
> 
> > > +
> > > +Between first and second tag section there is and empty line.
> > > +
> > > +While ``Signed-off-by:`` is an obligatory tag and must exists in each
> > > +commit, all other tags are optional. Any tag, as long as it is in
> > > +proper location to other adjacent tags (if present), may occur multiple
> > times.
> > > +
> > > +Other tags shall be laid out in a chronological order.
> > 
> > Yes, after the first Signed-off-by.
> 
> Side question: what about Sponsored-by, I noticed it appeared just recently?

It is a way to add the name of a company wishing to appear.
There is no constraint on it, it can be anywhere I think.
If it is in chronological order, it looks better before the Signed-off.
I think we should not advertise it here, because real names are more useful.




  1   2   >