[PATCH] crypto/qat: fix cleanup function default behaviour
Full cookie data should be cleared by default if not algorithm specified. This is protection in case algorithm specific cleanup is not implemented. Fixes: 3b78aa7b2317 ("crypto/qat: refactor asymmetric crypto functions") Signed-off-by: Arek Kusztal --- drivers/crypto/qat/qat_asym.c | 4 1 file changed, 4 insertions(+) diff --git a/drivers/crypto/qat/qat_asym.c b/drivers/crypto/qat/qat_asym.c index e4ab4bf33d..6e43a6cac8 100644 --- a/drivers/crypto/qat/qat_asym.c +++ b/drivers/crypto/qat/qat_asym.c @@ -144,6 +144,10 @@ cleanup(struct qat_asym_op_cookie *cookie, cleanup_arrays(cookie, QAT_ASYM_RSA_NUM_IN_PARAMS, QAT_ASYM_RSA_NUM_OUT_PARAMS, alg_size); } + } else { + cleanup_arrays(cookie, QAT_ASYM_MAX_PARAMS, + QAT_ASYM_MAX_PARAMS, + QAT_PKE_MAX_LN_SIZE); } } -- 2.13.6
Re: [PATCH 2/2] common/cnxk: add include for macro definition
14/06/2022 14:29, Bruce Richardson: > The header file "roc_io.h" uses the "__plt_always_inline" macro but > don't include "roc_platform.h" to get the definition of it. This > inclusion is not necessary for compilation, but the lack of it can > confuse some indexers - such as those in eclipse, which reports the > lines: > > "static __plt_always_inline uint64_t" > > as possible definitions of a variable called "uint64_t". This confusion > leads to uint64_t being flagged as an unknown type in all other parts of > the project being indexed, e.g. across all of DPDK code. > > Adding in the include of rte_common.h makes it clear to the indexer that > those lines are part of a function definition, and that allows eclipse > to correctly recognise uint64_t as a type from stdint.h It is not rte_common.h Looks like a copy past of the first patch.
[PATCH v2] vdpa/ifc: fix null pointer dereference
Fix null pointer dereference reported in coverity scan. Coverity issue: 378882 Fixes: 5d75517beffe ("vdpa/ifc/base: access correct register for blk device") Signed-off-by: Andy Pei --- drivers/vdpa/ifc/base/ifcvf.c | 9 + 1 file changed, 9 insertions(+) diff --git a/drivers/vdpa/ifc/base/ifcvf.c b/drivers/vdpa/ifc/base/ifcvf.c index dd475a7..0a9f71a 100644 --- a/drivers/vdpa/ifc/base/ifcvf.c +++ b/drivers/vdpa/ifc/base/ifcvf.c @@ -255,6 +255,10 @@ u32 ring_state; cfg = hw->common_cfg; + if (!cfg) { + DEBUGOUT("common_cfg in HW is NULL.\n"); + return; + } IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->msix_config); for (i = 0; i < hw->nr_vring; i++) { @@ -262,6 +266,11 @@ IFCVF_WRITE_REG16(0, &cfg->queue_enable); IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->queue_msix_vector); + if (!hw->lm_cfg) { + DEBUGOUT("live migration cfg in HW is NULL.\n"); + continue; + } + if (hw->device_type == IFCVF_BLK) ring_state = *(u32 *)(hw->lm_cfg + IFCVF_LM_RING_STATE_OFFSET + -- 1.8.3.1
RE: [PATCH v7 2/5] raw/ifpga: add N3000 AFU driver
Hi Wei, Some comments. Thanks, Rosen > -Original Message- > From: Huang, Wei > Sent: Thursday, June 09, 2022 15:37 > To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com; > hemant.agra...@nxp.com > Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei > ; Zhang, Qi Z ; Huang, Wei > > Subject: [PATCH v7 2/5] raw/ifpga: add N3000 AFU driver > > N3000 AFU includes NLB0 and DMA modules, NLB0 is used to test PCI bus > and DMA is used to test local memory. > This driver initialize the modules and report test result. > > Signed-off-by: Wei Huang > --- > v2: move source files to ifpga and rename, refine code > --- > v3: fix Ubuntu 20.04 ARM build > --- > drivers/raw/ifpga/afu_pmd_core.h | 19 + > drivers/raw/ifpga/afu_pmd_n3000.c | 2019 > + > drivers/raw/ifpga/afu_pmd_n3000.h | 339 +++ > drivers/raw/ifpga/meson.build |3 +- > drivers/raw/ifpga/rte_pmd_afu.h | 97 ++ > 5 files changed, 2476 insertions(+), 1 deletion(-) > create mode 100644 drivers/raw/ifpga/afu_pmd_n3000.c > create mode 100644 drivers/raw/ifpga/afu_pmd_n3000.h > create mode 100644 drivers/raw/ifpga/rte_pmd_afu.h > > diff --git a/drivers/raw/ifpga/afu_pmd_core.h > b/drivers/raw/ifpga/afu_pmd_core.h > index 4fad2c7..a938172 100644 > --- a/drivers/raw/ifpga/afu_pmd_core.h > +++ b/drivers/raw/ifpga/afu_pmd_core.h > @@ -14,6 +14,7 @@ > #include > > #include > +#include > #include > #include > > @@ -60,6 +61,24 @@ struct afu_rawdev { > return rawdev ? (struct afu_rawdev *)rawdev->dev_private : NULL; > } > > +#define CLS_TO_SIZE(n) ((n) << 6) /* get size of n cache lines */ > +#define SIZE_TO_CLS(s) ((s) >> 6) /* convert size to number of cache lines > */ > +#define MHZ(f) ((f) * 100) > + > +#define dsm_poll_timeout(addr, val, cond, invl, timeout) \ > +({ \ > + uint64_t __wait = 0; \ > + uint64_t __invl = (invl);\ > + uint64_t __timeout = (timeout); \ > + for (; __wait <= __timeout; __wait += __invl) { \ > + (val) = *(addr); \ > + if (cond)\ > + break; \ > + rte_delay_ms(__invl);\ > + }\ > + (cond) ? 0 : 1; \ > +}) > + > void afu_pmd_register(struct afu_rawdev_drv *driver); > void afu_pmd_unregister(struct afu_rawdev_drv *driver); > > diff --git a/drivers/raw/ifpga/afu_pmd_n3000.c > b/drivers/raw/ifpga/afu_pmd_n3000.c > new file mode 100644 > index 000..14f7fe0 > --- /dev/null > +++ b/drivers/raw/ifpga/afu_pmd_n3000.c > @@ -0,0 +1,2019 @@ > +/* SPDX-License-Identifier: BSD-3-Clause > + * Copyright(c) 2022 Intel Corporation > + */ > + > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +#include "afu_pmd_core.h" > +#include "afu_pmd_n3000.h" > + > +static int nlb_afu_config(struct afu_rawdev *dev) > +{ > + struct n3000_afu_priv *priv = NULL; > + struct rte_pmd_afu_nlb_cfg *cfg = NULL; > + struct nlb_csr_cfg v; > + > + if (!dev) > + return -EINVAL; > + > + if (!dev->priv) > + return -ENOENT; > + > + priv = (struct n3000_afu_priv *)dev->priv; > + cfg = &priv->nlb_cfg; > + > + v.csr = 0; > + > + if (cfg->cont) > + v.cont = 1; > + > + if (cfg->cache_policy == NLB_WRPUSH_I) > + v.wrpush_i = 1; > + else > + v.wrthru_en = cfg->cache_policy; > + > + if (cfg->cache_hint == NLB_RDLINE_MIXED) > + v.rdsel = 3; > + else > + v.rdsel = cfg->cache_hint; > + > + v.mode = cfg->mode; > + v.chsel = cfg->read_vc; > + v.wr_chsel = cfg->write_vc; > + v.wrfence_chsel = cfg->wrfence_vc; > + v.wrthru_en = cfg->cache_policy; > + v.multicl_len = cfg->multi_cl - 1; > + > + IFPGA_RAWDEV_PMD_DEBUG("cfg: 0x%08x", v.csr); > + rte_write32(v.csr, priv->nlb_ctx.addr + CSR_CFG); > + > + return 0; > +} > + > +static void nlb_afu_report(struct afu_rawdev *dev, uint32_t cl) > +{ > + struct n3000_afu_priv *priv = NULL; > + struct rte_pmd_afu_nlb_cfg *cfg = NULL; > + struct nlb_dsm_status *stat = NULL; > + uint64_t ticks = 0; > + double num, rd_bw, wr_bw; > + > + if (!dev || !dev->priv) > + return; > + > + priv = (struct n3000_afu_priv *)dev->priv; > + > + cfg = &priv->nlb_cfg; > + stat = priv->nlb_ctx.status_ptr; > + > + if (cfg->cont) > + ticks = stat->num_clocks - stat->start_overhead; > + else > + ticks = st
RE: [PATCH] vdpa/ifc: fix null pointer dereference
Hi Chenbo, Thanks for your reply. I will fix this in next version. > -Original Message- > From: Xia, Chenbo > Sent: Wednesday, June 15, 2022 10:04 AM > To: Pei, Andy ; dev@dpdk.org > Cc: maxime.coque...@redhat.com; Wang, Xiao W ; > Xu, Rosen ; Xiao, QimaiX > Subject: RE: [PATCH] vdpa/ifc: fix null pointer dereference > > Hi Andy, > > > -Original Message- > > From: Pei, Andy > > Sent: Wednesday, June 8, 2022 3:34 PM > > To: dev@dpdk.org > > Cc: Xia, Chenbo ; maxime.coque...@redhat.com; > > Wang, Xiao W ; Xu, Rosen ; > > Xiao, QimaiX > > Subject: [PATCH] vdpa/ifc: fix null pointer dereference > > > > Fix null pointer dereference reported in coverity scan. > > > > Coverity issue: 378882 > > Fixes: 8162a4a9 ("vdpa/ifc/base: access correct register for blk > > device") > > Commit id need to be 12 digits. Also pls leave a blank line between Fixes tag > and signed-off > > Thanks, > Chenbo > > > Signed-off-by: Andy Pei > > --- > > drivers/vdpa/ifc/base/ifcvf.c | 9 + > > 1 file changed, 9 insertions(+) > > > > diff --git a/drivers/vdpa/ifc/base/ifcvf.c > > b/drivers/vdpa/ifc/base/ifcvf.c index dd475a7..0a9f71a 100644 > > --- a/drivers/vdpa/ifc/base/ifcvf.c > > +++ b/drivers/vdpa/ifc/base/ifcvf.c > > @@ -255,6 +255,10 @@ > > u32 ring_state; > > > > cfg = hw->common_cfg; > > + if (!cfg) { > > + DEBUGOUT("common_cfg in HW is NULL.\n"); > > + return; > > + } > > > > IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->msix_config); > > for (i = 0; i < hw->nr_vring; i++) { @@ -262,6 +266,11 @@ > > IFCVF_WRITE_REG16(0, &cfg->queue_enable); > > IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg- > > >queue_msix_vector); > > > > + if (!hw->lm_cfg) { > > + DEBUGOUT("live migration cfg in HW is NULL.\n"); > > + continue; > > + } > > + > > if (hw->device_type == IFCVF_BLK) > > ring_state = *(u32 *)(hw->lm_cfg + > > IFCVF_LM_RING_STATE_OFFSET + > > -- > > 1.8.3.1
RE: [PATCH v7 3/5] raw/ifpga: add HE-LPBK AFU driver
Hi Wei, > -Original Message- > From: Huang, Wei > Sent: Thursday, June 09, 2022 15:37 > To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com; > hemant.agra...@nxp.com > Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei > ; Zhang, Qi Z ; Huang, Wei > > Subject: [PATCH v7 3/5] raw/ifpga: add HE-LPBK AFU driver > > HE-LPBK and HE-MEM-LPBK are host exerciser modules in OFS FPGA, HE- > LPBK is used to test PCI bus and HE-MEM-LPBK is used to test local memory. > This driver initialize the modules and report test result. > > Signed-off-by: Wei Huang > --- > v2: move source files to ifpga and rename, refine code > --- > drivers/raw/ifpga/afu_pmd_he_lpbk.c | 436 > > drivers/raw/ifpga/afu_pmd_he_lpbk.h | 126 +++ > drivers/raw/ifpga/meson.build | 2 +- > drivers/raw/ifpga/rte_pmd_afu.h | 14 ++ > 4 files changed, 577 insertions(+), 1 deletion(-) create mode 100644 > drivers/raw/ifpga/afu_pmd_he_lpbk.c > create mode 100644 drivers/raw/ifpga/afu_pmd_he_lpbk.h > > diff --git a/drivers/raw/ifpga/afu_pmd_he_lpbk.c > b/drivers/raw/ifpga/afu_pmd_he_lpbk.c > new file mode 100644 > index 000..8b2c85b > --- /dev/null > +++ b/drivers/raw/ifpga/afu_pmd_he_lpbk.c > @@ -0,0 +1,436 @@ > +/* SPDX-License-Identifier: BSD-3-Clause > + * Copyright(c) 2022 Intel Corporation > + */ > + > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +#include "afu_pmd_core.h" > +#include "afu_pmd_he_lpbk.h" > + > +static int he_lpbk_afu_config(struct afu_rawdev *dev) { > + struct he_lpbk_priv *priv = NULL; > + struct rte_pmd_afu_he_lpbk_cfg *cfg = NULL; > + struct he_lpbk_csr_cfg v; > + > + if (!dev) > + return -EINVAL; > + > + priv = (struct he_lpbk_priv *)dev->priv; > + if (!priv) > + return -ENOENT; > + > + cfg = &priv->he_lpbk_cfg; > + > + v.csr = 0; > + > + if (cfg->cont) > + v.cont = 1; > + > + v.mode = cfg->mode; > + v.trput_interleave = cfg->trput_interleave; > + if (cfg->multi_cl == 4) > + v.multicl_len = 2; > + else > + v.multicl_len = cfg->multi_cl - 1; > + > + IFPGA_RAWDEV_PMD_DEBUG("cfg: 0x%08x", v.csr); > + rte_write32(v.csr, priv->he_lpbk_ctx.addr + CSR_CFG); > + > + return 0; > +} > + > +static void he_lpbk_report(struct afu_rawdev *dev, uint32_t cl) { > + struct he_lpbk_priv *priv = NULL; > + struct rte_pmd_afu_he_lpbk_cfg *cfg = NULL; > + struct he_lpbk_ctx *ctx = NULL; > + struct he_lpbk_dsm_status *stat = NULL; > + struct he_lpbk_status0 stat0; > + struct he_lpbk_status1 stat1; > + uint64_t swtest_msg = 0; > + uint64_t ticks = 0; > + uint64_t info = 0; > + double num, rd_bw, wr_bw; > + > + if (!dev || !dev->priv) > + return; > + > + priv = (struct he_lpbk_priv *)dev->priv; > + cfg = &priv->he_lpbk_cfg; > + ctx = &priv->he_lpbk_ctx; > + > + stat = ctx->status_ptr; > + > + swtest_msg = rte_read64(ctx->addr + CSR_SWTEST_MSG); > + stat0.csr = rte_read64(ctx->addr + CSR_STATUS0); > + stat1.csr = rte_read64(ctx->addr + CSR_STATUS1); > + > + if (cfg->cont) > + ticks = stat->num_clocks - stat->start_overhead; > + else > + ticks = stat->num_clocks - > + (stat->start_overhead + stat->end_overhead); > + > + if (cfg->freq_mhz == 0) { > + info = rte_read64(ctx->addr + CSR_HE_INFO0); > + IFPGA_RAWDEV_PMD_INFO("API version: %"PRIx64, info >> > 16); > + cfg->freq_mhz = info & 0x; > + if (cfg->freq_mhz == 0) { > + IFPGA_RAWDEV_PMD_INFO("Frequency of AFU > clock is unknown." > + " Assuming 350 MHz."); > + cfg->freq_mhz = 350; > + } > + } > + > + num = (double)stat0.num_reads; > + rd_bw = (num * CLS_TO_SIZE(1) * MHZ(cfg->freq_mhz)) / ticks; > + num = (double)stat0.num_writes; > + wr_bw = (num * CLS_TO_SIZE(1) * MHZ(cfg->freq_mhz)) / ticks; > + > + printf("Cachelines Read_Count Write_Count Pend_Read > Pend_Write " > + "Clocks@%uMHz Rd_Bandwidth Wr_Bandwidth\n", > + cfg->freq_mhz); > + printf("%10u %10u %10u %10u %10u %12"PRIu64 > + " %7.3f GB/s %7.3f GB/s\n", > + cl, stat0.num_reads, stat0.num_writes, > + stat1.num_pend_reads, stat1.num_pend_writes, > + ticks, rd_bw / 1e9, wr_bw / 1e9); > + printf("Test Message: 0x%"PRIx64"\n", swtest_msg); } > + > +static int he_lpbk_test(struct afu_rawdev *dev) { > + struct he_lpbk_priv *priv = NULL; > + struct rte_pmd_afu_he_lpbk_cfg *cfg = NULL; > + struct he_lpbk_ctx *ctx = NULL; > + struct he_lpbk_csr_ctl ctl; > + uin
RE: [PATCH v7 4/5] raw/ifpga: add HE-MEM AFU driver
Hi Wei, > -Original Message- > From: Huang, Wei > Sent: Thursday, June 09, 2022 15:37 > To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com; > hemant.agra...@nxp.com > Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei > ; Zhang, Qi Z ; Huang, Wei > > Subject: [PATCH v7 4/5] raw/ifpga: add HE-MEM AFU driver > > HE-MEM is one of the host exerciser modules in OFS FPGA, which is used to > test local memory with built-in traffic generator. > This driver initialize the module and report test result. > > Signed-off-by: Wei Huang > Acked-by: Tianfei Zhang > --- > v2: move source files to ifpga and rename > --- > drivers/raw/ifpga/afu_pmd_he_mem.c | 183 > + > drivers/raw/ifpga/afu_pmd_he_mem.h | 46 ++ > drivers/raw/ifpga/meson.build | 2 +- > drivers/raw/ifpga/rte_pmd_afu.h| 7 ++ > 4 files changed, 237 insertions(+), 1 deletion(-) create mode 100644 > drivers/raw/ifpga/afu_pmd_he_mem.c > create mode 100644 drivers/raw/ifpga/afu_pmd_he_mem.h > > diff --git a/drivers/raw/ifpga/afu_pmd_he_mem.c > b/drivers/raw/ifpga/afu_pmd_he_mem.c > new file mode 100644 > index 000..0f57a03 > --- /dev/null > +++ b/drivers/raw/ifpga/afu_pmd_he_mem.c > @@ -0,0 +1,183 @@ > +/* SPDX-License-Identifier: BSD-3-Clause > + * Copyright(c) 2022 Intel Corporation > + */ > + > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +#include "afu_pmd_core.h" > +#include "afu_pmd_he_mem.h" > + > +static int he_mem_tg_test(struct afu_rawdev *dev) { > + struct he_mem_tg_priv *priv = NULL; > + struct rte_pmd_afu_he_mem_tg_cfg *cfg = NULL; > + struct he_mem_tg_ctx *ctx = NULL; > + uint64_t value = 0x12345678; > + uint64_t cap = 0; > + uint64_t channel_mask = 0; > + int i, t = 0; > + > + if (!dev) > + return -EINVAL; > + > + priv = (struct he_mem_tg_priv *)dev->priv; > + if (!priv) > + return -ENOENT; > + > + cfg = &priv->he_mem_tg_cfg; > + ctx = &priv->he_mem_tg_ctx; > + > + IFPGA_RAWDEV_PMD_DEBUG("Channel mask: 0x%x", cfg- > >channel_mask); > + > + rte_write64(value, ctx->addr + MEM_TG_SCRATCHPAD); > + cap = rte_read64(ctx->addr + MEM_TG_SCRATCHPAD); > + IFPGA_RAWDEV_PMD_DEBUG("Scratchpad value: 0x%"PRIx64, cap); > + if (cap != value) { > + IFPGA_RAWDEV_PMD_ERR("Test scratchpad register failed"); > + return -EIO; > + } > + > + cap = rte_read64(ctx->addr + MEM_TG_CTRL); > + IFPGA_RAWDEV_PMD_DEBUG("Capability: 0x%"PRIx64, cap); > + > + channel_mask = cfg->channel_mask & cap; > + /* start traffic generators */ > + rte_write64(channel_mask, ctx->addr + MEM_TG_CTRL); > + > + /* check test status */ > + while (t < MEM_TG_TIMEOUT_MS) { > + value = rte_read64(ctx->addr + MEM_TG_STAT); > + for (i = 0; i < NUM_MEM_TG_CHANNELS; i++) { > + if (channel_mask & (1 << i)) { > + if (TGACTIVE(value, i)) > + continue; > + printf("TG channel %d test %s\n", i, > + TGPASS(value, i) ? "pass" : > + TGTIMEOUT(value, i) ? "timeout" : > + TGFAIL(value, i) ? "fail" : "error"); > + channel_mask &= ~(1 << i); > + } > + } > + if (!channel_mask) > + break; > + rte_delay_ms(MEM_TG_POLL_INTERVAL_MS); > + t += MEM_TG_POLL_INTERVAL_MS; > + } > + > + if (channel_mask) { > + IFPGA_RAWDEV_PMD_ERR("Timeout 0x%04lx", (unsigned > long)value); > + return channel_mask; > + } > + > + return 0; > +} > + > +static int he_mem_tg_init(struct afu_rawdev *dev) { > + struct he_mem_tg_priv *priv = NULL; > + struct he_mem_tg_ctx *ctx = NULL; > + > + if (!dev) > + return -EINVAL; > + > + priv = (struct he_mem_tg_priv *)dev->priv; > + if (!priv) { > + priv = rte_zmalloc(NULL, sizeof(struct he_mem_tg_priv), 0); > + if (!priv) > + return -ENOMEM; > + dev->priv = priv; > + } > + > + ctx = &priv->he_mem_tg_ctx; > + ctx->addr = (uint8_t *)dev->addr; > + > + return 0; > +} > + > +static int he_mem_tg_config(struct afu_rawdev *dev, void *config, > + size_t config_size) > +{ > + struct he_mem_tg_priv *priv = NULL; > + > + if (!dev || !config || !config_size) > + return -EINVAL; > + > + priv = (struct he_mem_tg_priv *)dev->priv; > + if (!priv) > + return -ENOENT; > + > + if (config_size != sizeof(struct rte_pmd_afu_he_mem_tg_cfg)) > + return -EINVAL; > + >
[PATCH v2] vdpa/ifc: fix null pointer dereference
Fix null pointer dereference reported in coverity scan. Coverity issue: 378882 Fixes: 5d75517beffe ("vdpa/ifc/base: access correct register for blk device") Signed-off-by: Andy Pei Acked-by: Xiao Wang --- drivers/vdpa/ifc/base/ifcvf.c | 9 + 1 file changed, 9 insertions(+) diff --git a/drivers/vdpa/ifc/base/ifcvf.c b/drivers/vdpa/ifc/base/ifcvf.c index dd475a7..0a9f71a 100644 --- a/drivers/vdpa/ifc/base/ifcvf.c +++ b/drivers/vdpa/ifc/base/ifcvf.c @@ -255,6 +255,10 @@ u32 ring_state; cfg = hw->common_cfg; + if (!cfg) { + DEBUGOUT("common_cfg in HW is NULL.\n"); + return; + } IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->msix_config); for (i = 0; i < hw->nr_vring; i++) { @@ -262,6 +266,11 @@ IFCVF_WRITE_REG16(0, &cfg->queue_enable); IFCVF_WRITE_REG16(IFCVF_MSI_NO_VECTOR, &cfg->queue_msix_vector); + if (!hw->lm_cfg) { + DEBUGOUT("live migration cfg in HW is NULL.\n"); + continue; + } + if (hw->device_type == IFCVF_BLK) ring_state = *(u32 *)(hw->lm_cfg + IFCVF_LM_RING_STATE_OFFSET + -- 1.8.3.1
Re: [PATCH] config: remove explicit undef of unset values
On Thu, Dec 16, 2021 at 12:15 PM Bruce Richardson wrote: > > Rather than explicitly clearing any setting of undefined values in our > rte_config.h file, it's better to instead just add a comment that the > value is not set. Using a comment allows the user to set the value using > CFLAGS or similar mechanism without the config file clearing the value > again. > > The text used " is not set" is modelled after the kernel approach > of doing the same thing. > > Signed-off-by: Bruce Richardson Acked-by: Morten Brørup Acked-by: Tyler Retzlaff Rebased, added check as agreed, and applied. Thanks Bruce. -- David Marchand
RE: [PATCH v7 5/5] raw/ifpga: add HE-HSSI AFU driver
Hi Wei, > -Original Message- > From: Huang, Wei > Sent: Thursday, June 09, 2022 15:37 > To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com; > hemant.agra...@nxp.com > Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei > ; Zhang, Qi Z ; Huang, Wei > > Subject: [PATCH v7 5/5] raw/ifpga: add HE-HSSI AFU driver > > HE-HSSI is one of the host exerciser modules in OFS FPGA, which is used to > test HSSI (High Speed Serial Interface). > This driver initialize the module and report test result. > > Signed-off-by: Wei Huang > --- > v2: move source files to ifpga and rename, refine code > --- > drivers/raw/ifpga/afu_pmd_he_hssi.c | 371 > > drivers/raw/ifpga/afu_pmd_he_hssi.h | 109 +++ > drivers/raw/ifpga/meson.build | 3 +- > drivers/raw/ifpga/rte_pmd_afu.h | 18 ++ > 4 files changed, 500 insertions(+), 1 deletion(-) create mode 100644 > drivers/raw/ifpga/afu_pmd_he_hssi.c > create mode 100644 drivers/raw/ifpga/afu_pmd_he_hssi.h > > diff --git a/drivers/raw/ifpga/afu_pmd_he_hssi.c > b/drivers/raw/ifpga/afu_pmd_he_hssi.c > new file mode 100644 > index 000..102de50 > --- /dev/null > +++ b/drivers/raw/ifpga/afu_pmd_he_hssi.c > @@ -0,0 +1,371 @@ > +/* SPDX-License-Identifier: BSD-3-Clause > + * Copyright(c) 2022 Intel Corporation > + */ > + > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +#include > +#include > +#include > +#include > +#include > +#include > +#include > +#include > + > +#include "afu_pmd_core.h" > +#include "afu_pmd_he_hssi.h" > + > +static int he_hssi_indirect_write(struct he_hssi_ctx *ctx, uint32_t addr, > + uint32_t value) > +{ > + struct traffic_ctrl_cmd cmd; > + struct traffic_ctrl_data data; > + uint32_t i = 0; > + > + IFPGA_RAWDEV_PMD_DEBUG("Indirect write 0x%x, value 0x%08x", > addr, > +value); > + > + if (!ctx) > + return -EINVAL; > + > + data.write_data = value; > + rte_write64(data.csr, ctx->addr + TRAFFIC_CTRL_DATA); > + > + cmd.csr = 0; > + cmd.write_cmd = 1; > + cmd.afu_cmd_addr = addr; > + rte_write64(cmd.csr, ctx->addr + TRAFFIC_CTRL_CMD); > + > + while (i < MAILBOX_TIMEOUT_MS) { > + rte_delay_ms(MAILBOX_POLL_INTERVAL_MS); > + cmd.csr = rte_read64(ctx->addr + TRAFFIC_CTRL_CMD); > + if (cmd.ack_trans) > + break; > + i += MAILBOX_POLL_INTERVAL_MS; > + } > + if (i >= MAILBOX_TIMEOUT_MS) > + return -ETIMEDOUT; > + > + i = 0; > + cmd.csr = 0; > + while (i < MAILBOX_TIMEOUT_MS) { > + cmd.ack_trans = 1; > + rte_write64(cmd.csr, ctx->addr + TRAFFIC_CTRL_CMD); > + rte_delay_ms(MAILBOX_POLL_INTERVAL_MS); > + cmd.csr = rte_read64(ctx->addr + TRAFFIC_CTRL_CMD); > + if (!cmd.ack_trans) > + break; > + i += MAILBOX_POLL_INTERVAL_MS; > + } > + if (i >= MAILBOX_TIMEOUT_MS) > + return -ETIMEDOUT; > + > + return 0; > +} > + > +static int he_hssi_indirect_read(struct he_hssi_ctx *ctx, uint32_t addr, > + uint32_t *value) > +{ > + struct traffic_ctrl_cmd cmd; > + struct traffic_ctrl_data data; > + uint32_t i = 0; > + > + if (!ctx) > + return -EINVAL; > + > + cmd.csr = 0; > + cmd.read_cmd = 1; > + cmd.afu_cmd_addr = addr; > + rte_write64(cmd.csr, ctx->addr + TRAFFIC_CTRL_CMD); > + > + while (i < MAILBOX_TIMEOUT_MS) { > + rte_delay_ms(MAILBOX_POLL_INTERVAL_MS); > + cmd.csr = rte_read64(ctx->addr + TRAFFIC_CTRL_CMD); > + if (cmd.ack_trans) { > + data.csr = rte_read64(ctx->addr + > TRAFFIC_CTRL_DATA); > + *value = data.read_data; > + break; > + } > + i += MAILBOX_POLL_INTERVAL_MS; > + } > + if (i >= MAILBOX_TIMEOUT_MS) > + return -ETIMEDOUT; > + > + i = 0; > + cmd.csr = 0; > + while (i < MAILBOX_TIMEOUT_MS) { > + cmd.ack_trans = 1; > + rte_write64(cmd.csr, ctx->addr + TRAFFIC_CTRL_CMD); > + rte_delay_ms(MAILBOX_POLL_INTERVAL_MS); > + cmd.csr = rte_read64(ctx->addr + TRAFFIC_CTRL_CMD); > + if (!cmd.ack_trans) > + break; > + i += MAILBOX_POLL_INTERVAL_MS; > + } > + if (i >= MAILBOX_TIMEOUT_MS) > + return -ETIMEDOUT; > + > + IFPGA_RAWDEV_PMD_DEBUG("Indirect read 0x%x, value 0x%08x", > addr, *value); > + return 0; > +} > + > +static void he_hssi_report(struct he_hssi_ctx *ctx) { > + uint32_t val = 0; > + uint64_t v64 = 0; > + int ret = 0; > + > + ret = he_hssi_indirect_read(ctx, TM_PKT_GOOD, &val); > + if (ret) > + return; > + printf("Number of good packets received: %u\n", val); > + > + ret = he_hssi_indir
[Bug 1035] __rte_raw_cksum() crash with misaligned pointer
https://bugs.dpdk.org/show_bug.cgi?id=1035 Bug ID: 1035 Summary: __rte_raw_cksum() crash with misaligned pointer Product: DPDK Version: 21.11 Hardware: All OS: All Status: UNCONFIRMED Severity: normal Priority: Normal Component: ethdev Assignee: dev@dpdk.org Reporter: emil.b...@ericsson.com Target Milestone: --- See rte_raw_cksum() in rte_ip.h, which is part of the public API. See also the subfunction __rte_raw_cksum(). _rte_raw_cksum assumes that the buffer over which the checksum is calculated is an even address (divisible by two). See for example this stack overflow post: https://stackoverflow.com/questions/46790550/c-undefined-behavior-strict-aliasing-rule-or-incorrect-alignment The post explains that there is undefined behavior in C11 when "conversion between two pointer types produces a result that is incorrectly aligned". When the buf argument starts on an odd address we thus have undefined behavior, since a pointer is cast from void* to uint16_t*. In most cases (at least on x86) that isn't a problem, but with higher optimization levels it may break due to vector instructions. This new function seems to be easier to optimize by the compiler, resulting in a crash when the buf argument is odd. Please note that the undefined behavior is present in earlier versions of dpdk as well. Now you're probably thinking: "Just align your buffers". The problem is that we have a packet buffer which is aligned. The checksum is calculated on a subset of that aligned packet buffer, and that sometimes lies on odd addresses. The question remains if this is an issue with dpdk or not. Maybe you do the assumption that odd addresses are never passed in when calculating the checksum, and that's all right. But perhaps a public API comment should be added in that case. Or perhaps you use it incorrectly as well, and in that case something should be done about it. Triggering this bug seems to require high optimization levels as well as odd values of buf. Of interest may be that in older version of dpdk you added the a comment: /* workaround gcc strict-aliasing warning */ showing that you willfully ignored the warning. -- You are receiving this mail because: You are the assignee for the bug.
RE: [PATCH v1] raw/ifpga: free file handle before function return
Hi Wei, > -Original Message- > From: Huang, Wei > Sent: Thursday, June 09, 2022 16:50 > To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com; > hemant.agra...@nxp.com > Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei > ; Zhang, Qi Z ; Huang, Wei > > Subject: [PATCH v1] raw/ifpga: free file handle before function return > > Coverity issue: 379064 > Fixes: 673c897f4d73 ("raw/ifpga: support OFS card probing") > > Signed-off-by: Wei Huang > --- > drivers/raw/ifpga/base/ifpga_enumerate.c | 28 +++ > - > 1 file changed, 19 insertions(+), 9 deletions(-) > > diff --git a/drivers/raw/ifpga/base/ifpga_enumerate.c > b/drivers/raw/ifpga/base/ifpga_enumerate.c > index 7a5d264..61eb660 100644 > --- a/drivers/raw/ifpga/base/ifpga_enumerate.c > +++ b/drivers/raw/ifpga/base/ifpga_enumerate.c > @@ -837,8 +837,10 @@ static int find_dfls_by_vsec(struct > dfl_fpga_enum_info *info) > vndr_hdr = 0; > ret = pread(fd, &vndr_hdr, sizeof(vndr_hdr), > voff + PCI_VNDR_HEADER); > - if (ret < 0) > - return -EIO; > + if (ret < 0) { > + ret = -EIO; > + goto free_handle; > + } > if (PCI_VNDR_HEADER_ID(vndr_hdr) == > PCI_VSEC_ID_INTEL_DFLS && > pci_data->vendor_id == PCI_VENDOR_ID_INTEL) > break; > @@ -846,19 +848,23 @@ static int find_dfls_by_vsec(struct > dfl_fpga_enum_info *info) > > if (!voff) { > dev_debug(hw, "%s no DFL VSEC found\n", __func__); > - return -ENODEV; > + ret = -ENODEV; > + goto free_handle; > } > > dfl_cnt = 0; > ret = pread(fd, &dfl_cnt, sizeof(dfl_cnt), voff + > PCI_VNDR_DFLS_CNT); > - if (ret < 0) > - return -EIO; > + if (ret < 0) { > + ret = -EIO; > + goto free_handle; > + } > > dfl_res_off = voff + PCI_VNDR_DFLS_RES; > if (dfl_res_off + (dfl_cnt * sizeof(u32)) > PCI_CFG_SPACE_EXP_SIZE) > { > dev_err(hw, "%s DFL VSEC too big for PCIe config space\n", > __func__); > - return -EINVAL; > + ret = -EINVAL; > + goto free_handle; > } > > for (i = 0; i < dfl_cnt; i++, dfl_res_off += sizeof(u32)) { @@ -868,7 > +874,8 @@ static int find_dfls_by_vsec(struct dfl_fpga_enum_info *info) > if (bir >= PCI_MAX_RESOURCE) { > dev_err(hw, "%s bad bir number %d\n", > __func__, bir); > - return -EINVAL; > + ret = -EINVAL; > + goto free_handle; > } > > len = pci_data->region[bir].len; > @@ -876,7 +883,8 @@ static int find_dfls_by_vsec(struct > dfl_fpga_enum_info *info) > if (offset >= len) { > dev_err(hw, "%s bad offset %u >= %"PRIu64"\n", > __func__, offset, len); > - return -EINVAL; > + ret = -EINVAL; > + goto free_handle; > } > > dev_debug(hw, "%s BAR %d offset 0x%x\n", __func__, bir, > offset); @@ -886,7 +894,9 @@ static int find_dfls_by_vsec(struct > dfl_fpga_enum_info *info) > dfl_fpga_enum_info_add_dfl(info, start, len, addr); > } > > - return 0; > +free_handle: > + close(fd); > + return ret; > } > > /* default method of finding dfls starting at offset 0 of bar 0 */ > -- > 1.8.3.1 It looks good for me. Reviewed-by: Rosen Xu
Re: [PATCH 0/3] Fix xmm_t to rte_xmm_t scalar conversion
On Thu, Jun 9, 2022 at 2:17 PM Stanislaw Kardach wrote: > > As David noticed in [1] there is an issue with C++ compilation of the > rte_vect.h header in RISC-V. Upon closer inspection, the problem appears on > all architectures due to the type conversion rules in C++. > More precisely a union type rte_xmm_t requires a conversion constructor > from xmm_t type. > The most obvious fix is to use a structure initializer for such copies > (since rte_xmm_t union contains xmm_t anyway). The generated assembly > at -O2 is exactly the same, so there's no real impact. > > The bigger question is whether accessing bits of the architecture specific > xmm_t type in an array fashion is always correct? All current architectures > define rte_xmm_t in the same manner implying that. Copying other arch maintainers. > > Additionally change RISC-V CI settings to use crossbuild-essential-riscv64 > package which provides tools that enable C++ checks. > > [1] http://mails.dpdk.org/archives/dev/2022-June/243683.html > > Stanislaw Kardach (3): > eal/riscv: fix xmm_t casting for C++ > lpm: fix xmm_t casting for C++ in scalar version > ci: use crossbuild-essential-riscv64 for compiling In any case, this series looks good. Series applied, thanks Stanislaw. -- David Marchand
[PATCH] net/igc: fix segmentation fault in secondary dpdk-symmetric_mp
From: Zhichao Zeng In the secondary dpdk-symmetric_mp process, the "smp_port_init" was skipped, which cause some function pointers not to be initialized, and a segmentation fault occurred when calling these function pointers. This patch assigns initial values to rx_pkt_burst, tx_pkt_burst and tx_pkt_prepare pointers to avoid calling null function pointer. Signed-off-by: Zhichao Zeng --- drivers/net/igc/igc_ethdev.c | 3 +++ drivers/net/igc/igc_ethdev.h | 5 + drivers/net/igc/igc_txrx.c | 6 +++--- 3 files changed, 11 insertions(+), 3 deletions(-) diff --git a/drivers/net/igc/igc_ethdev.c b/drivers/net/igc/igc_ethdev.c index b9933b395d..dd4a028ea1 100644 --- a/drivers/net/igc/igc_ethdev.c +++ b/drivers/net/igc/igc_ethdev.c @@ -1234,6 +1234,9 @@ eth_igc_dev_init(struct rte_eth_dev *dev) dev->rx_queue_count = eth_igc_rx_queue_count; dev->rx_descriptor_status = eth_igc_rx_descriptor_status; dev->tx_descriptor_status = eth_igc_tx_descriptor_status; + dev->rx_pkt_burst = &igc_recv_pkts; + dev->tx_pkt_burst = &igc_xmit_pkts; + dev->tx_pkt_prepare = ð_igc_prep_pkts; /* * for secondary processes, we don't initialize any further as primary diff --git a/drivers/net/igc/igc_ethdev.h b/drivers/net/igc/igc_ethdev.h index f56cad79e9..2fa3d51bba 100644 --- a/drivers/net/igc/igc_ethdev.h +++ b/drivers/net/igc/igc_ethdev.h @@ -263,6 +263,11 @@ struct igc_adapter { #define IGC_DEV_PRIVATE_FLOW_LIST(_dev) \ (&((struct igc_adapter *)(_dev)->data->dev_private)->flow_list) +uint16_t igc_recv_pkts(void *rx_queue, struct rte_mbuf **rx_pkts, uint16_t nb_pkts); +uint16_t igc_xmit_pkts(void *tx_queue, struct rte_mbuf **tx_pkts, uint16_t nb_pkts); +uint16_t eth_igc_prep_pkts(__rte_unused void *tx_queue, struct rte_mbuf **tx_pkts, + uint16_t nb_pkts); + static inline void igc_read_reg_check_set_bits(struct igc_hw *hw, uint32_t reg, uint32_t bits) { diff --git a/drivers/net/igc/igc_txrx.c b/drivers/net/igc/igc_txrx.c index e48d5df11a..753ac8a411 100644 --- a/drivers/net/igc/igc_txrx.c +++ b/drivers/net/igc/igc_txrx.c @@ -345,7 +345,7 @@ rx_desc_get_pkt_info(struct igc_rx_queue *rxq, struct rte_mbuf *rxm, rxm->packet_type = rx_desc_pkt_info_to_pkt_type(pkt_info); } -static uint16_t +uint16_t igc_recv_pkts(void *rx_queue, struct rte_mbuf **rx_pkts, uint16_t nb_pkts) { struct igc_rx_queue * const rxq = rx_queue; @@ -1397,7 +1397,7 @@ eth_igc_rx_queue_setup(struct rte_eth_dev *dev, } /* prepare packets for transmit */ -static uint16_t +uint16_t eth_igc_prep_pkts(__rte_unused void *tx_queue, struct rte_mbuf **tx_pkts, uint16_t nb_pkts) { @@ -1604,7 +1604,7 @@ tx_desc_cksum_flags_to_olinfo(uint64_t ol_flags) return tmp; } -static uint16_t +uint16_t igc_xmit_pkts(void *tx_queue, struct rte_mbuf **tx_pkts, uint16_t nb_pkts) { struct igc_tx_queue * const txq = tx_queue; -- 2.25.1
RE: [PATCH V2] vdpa/mlx5: return correct error code after rte_intr_instance_alloc failed
Hi Shiqi From: 835703...@qq.com <835703...@qq.com> > From: Shiqi Liu <835703...@qq.com> > > To Maxime, > I am so sorry too late to reply your comment.My name is Shiqi Liu. > I got stuck in how to reply the comment before.Now, it seenms to be > ok!Maybe there will be some mistakes I have taken , when I reply,please tell > me how to do it correctly! > Thanks, > Shiqi Liu > > diff --git a/drivers/vdpa/mlx5/mlx5_vdpa_event.c > b/drivers/vdpa/mlx5/mlx5_vdpa_event.c > index 7167a98db0..6223afaae8 100644 > --- a/drivers/vdpa/mlx5/mlx5_vdpa_event.c > +++ b/drivers/vdpa/mlx5/mlx5_vdpa_event.c > @@ -395,6 +395,7 @@ mlx5_vdpa_err_event_setup(struct mlx5_vdpa_priv > *priv) > rte_intr_instance_alloc(RTE_INTR_INSTANCE_F_SHARED); > if (priv->err_intr_handle == NULL) { > DRV_LOG(ERR, "Fail to allocate intr_handle"); > + rte_errno = ENOMEM; Shouldn't rte_intr_instance_alloc set it? > goto error; > } > if (rte_intr_fd_set(priv->err_intr_handle, priv->err_chnl->fd)) > --
RE: [PATCH v7] app/testpmd: add Host Shaper command
From: Spike Du > Add command line options to support host shaper configure. > - Command syntax: > mlx5 set port host_shaper avail_thresh_triggered <0|1> rate > > > - Example commands: > To enable avail_thresh_triggered on port 1 and disable current host > shaper: > testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0 > > To disable avail_thresh_triggered and current host shaper on port 1: > testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0 > > The rate unit is 100Mbps. > To disable avail_thresh_triggered and configure a shaper of 5Gbps on port 1: > testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50 > > Add sample code to handle rxq available descriptor threshold event, it delays > a while so that rxq empties, then disables host shaper and rearms available > descriptor threshold event. > > Signed-off-by: Spike Du Acked-by: Matan Azrad
DPDK Compilation consuming too much memory
Hi, I've been facing an issue while building DPDK (mostly noticeable when building cross compilation on arm64) as follows: meson --werror --buildtype=debugoptimized --cross-file config/arm/arm64_armv8_linux_gcc -Dexamples=all /tmp/build-aarch64-gcc; ninja -C /tmp/build-aarch64-gcc/ I can see the memory footprint for it is around 52G which basically took all my system memory. It happens when linking CNXK drivers. Kindest regards, Raslan Darawsheh
Re: [PATCH v2] net/mlx5: fix build with clang 14
11/05/2022 18:41, Ali Alnubani: > Use fgets instead of fscanf to resolve the following warning > reported by clang 14.0.0 in Fedora 37 (Rawhide): > > drivers/net/mlx5/linux/mlx5_ethdev_os.c:1137:52: error: > 'fscanf' may overflow; destination buffer in argument 3 has size 16, > but the corresponding specifier may require size 17 > [-Werror,-Wfortify-source] > ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name); Some other patches are proposing to declare the variable of size IF_NAMESIZE+1 but I think it's wrong because IF_NAMESIZE includes a terminating null byte. > Fixes: 63d1db710fbc ("net/mlx5: fix unlimited parsing of switch info") > Cc: michae...@mellanox.com > Cc: sta...@dpdk.org > > Signed-off-by: Ali Alnubani [...] > - ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name); > - fclose(file); > - if (ret == 1) > + if (fgets(port_name, IF_NAMESIZE, file) != NULL) > mlx5_translate_port_name(port_name, &data); > + fclose(file); Acked-by: Thomas Monjalon
event_eth_rx_intr_adapter_autotest failing
Hi dpdk-dev, Test command: DPDK_TEST=event_eth_rx_intr_adapter_autotest ./dpdk-test event_eth_rx_intr_adapter_autotest is failing while adding interrupt queue to adapter instance. It's failing in rte_eth_dev_rx_intr_ctl_q(), at dev->intr_handle == NULL check 5248 if (!dev->intr_handle) { 5249 RTE_ETHDEV_LOG(ERR, "Rx Intr handle unset\n"); 5250 return -ENOTSUP; 5251 } "net_null" PMD is used for the test. Does "net-null" supports interrupt queue? I'm able to run event_eth_rx_adapter_autotest with 'DPDK_TEST=event_eth_rx_adapter_autotest ./dpdk-test' Could you please let me know how to run event_eth_rx_intr_adapter_autotest? Thanks, Ganapati
RE: [PATCH v2] net/mlx5: fix build with clang 14
> -Original Message- > From: Ali Alnubani > Sent: Wednesday, May 11, 2022 19:41 > To: dev@dpdk.org > Cc: Michael Baum ; sta...@dpdk.org > Subject: [PATCH v2] net/mlx5: fix build with clang 14 > > Use fgets instead of fscanf to resolve the following warning reported by clang > 14.0.0 in Fedora 37 (Rawhide): > > drivers/net/mlx5/linux/mlx5_ethdev_os.c:1137:52: error: > 'fscanf' may overflow; destination buffer in argument 3 has size 16, > but the corresponding specifier may require size 17 > [-Werror,-Wfortify-source] > ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name); > > Fixes: 63d1db710fbc ("net/mlx5: fix unlimited parsing of switch info") > Cc: michae...@mellanox.com > Cc: sta...@dpdk.org > > Signed-off-by: Ali Alnubani Acked-by: Viacheslav Ovsiienko
[Bug 857] gcc12 build error, lib/mbuf
https://bugs.dpdk.org/show_bug.cgi?id=857 David Marchand (david.march...@redhat.com) changed: What|Removed |Added CC||david.march...@redhat.com Resolution|--- |WORKSFORME Status|UNCONFIRMED |RESOLVED --- Comment #4 from David Marchand (david.march...@redhat.com) --- I can't reproduce with released gcc 12. -- You are receiving this mail because: You are the assignee for the bug.
[Bug 858] gcc12 build error, net/failsafe
https://bugs.dpdk.org/show_bug.cgi?id=858 David Marchand (david.march...@redhat.com) changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|--- |WORKSFORME CC||david.march...@redhat.com --- Comment #1 from David Marchand (david.march...@redhat.com) --- Same as 822, I can't reproduce with released gcc 12. -- You are receiving this mail because: You are the assignee for the bug.
[Bug 860] gcc12 build error, net/ngbe
https://bugs.dpdk.org/show_bug.cgi?id=860 David Marchand (david.march...@redhat.com) changed: What|Removed |Added CC||david.march...@redhat.com Status|UNCONFIRMED |RESOLVED Resolution|--- |WORKSFORME --- Comment #2 from David Marchand (david.march...@redhat.com) --- I can't reproduce with released gcc 12. -- You are receiving this mail because: You are the assignee for the bug.
[Bug 1016] [dpdk-22.07*] drivers/net/ice meson build failed with gcc12.1.1 on Fedora36/64
https://bugs.dpdk.org/show_bug.cgi?id=1016 David Marchand (david.march...@redhat.com) changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED CC||david.march...@redhat.com Resolution|--- |DUPLICATE --- Comment #4 from David Marchand (david.march...@redhat.com) --- Please, don't mix several issues in the same report. I'll focus on the initial report, about net/ice which is a duplicate of bug 852. *** This bug has been marked as a duplicate of bug 852 *** -- You are receiving this mail because: You are the assignee for the bug.
[Bug 849] gcc12 build error, net/ena
https://bugs.dpdk.org/show_bug.cgi?id=849 David Marchand (david.march...@redhat.com) changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED CC||david.march...@redhat.com Resolution|--- |FIXED --- Comment #1 from David Marchand (david.march...@redhat.com) --- Fixed in https://git.dpdk.org/dpdk/commit/?id=244994958466 -- You are receiving this mail because: You are the assignee for the bug.
[Bug 855] gcc12 build error, vdpa/ifc
https://bugs.dpdk.org/show_bug.cgi?id=855 David Marchand (david.march...@redhat.com) changed: What|Removed |Added CC||david.march...@redhat.com Status|UNCONFIRMED |RESOLVED Resolution|--- |FIXED --- Comment #2 from David Marchand (david.march...@redhat.com) --- Fixed in https://git.dpdk.org/dpdk/commit/?id=2a213b794fdd -- You are receiving this mail because: You are the assignee for the bug.
[Bug 856] gcc12 build error, app/test-flow-perf
https://bugs.dpdk.org/show_bug.cgi?id=856 David Marchand (david.march...@redhat.com) changed: What|Removed |Added CC||david.march...@redhat.com Status|UNCONFIRMED |RESOLVED Resolution|--- |FIXED --- Comment #2 from David Marchand (david.march...@redhat.com) --- Fixed in https://git.dpdk.org/dpdk/commit/?id=5fc2eece8d45 -- You are receiving this mail because: You are the assignee for the bug.
Re: [PATCH v2] net/mlx5: fix build with clang 14
On Wed, Jun 15, 2022 at 10:17 AM Thomas Monjalon wrote: > > 11/05/2022 18:41, Ali Alnubani: > > Use fgets instead of fscanf to resolve the following warning > > reported by clang 14.0.0 in Fedora 37 (Rawhide): > > > > drivers/net/mlx5/linux/mlx5_ethdev_os.c:1137:52: error: > > 'fscanf' may overflow; destination buffer in argument 3 has size 16, > > but the corresponding specifier may require size 17 > > [-Werror,-Wfortify-source] > > ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name); > > Some other patches are proposing to declare the variable > of size IF_NAMESIZE+1 but I think it's wrong because > IF_NAMESIZE includes a terminating null byte. Ack, I rejected my patch accordingly. -- David Marchand
dev@dpdk.org
https://bugs.dpdk.org/show_bug.cgi?id=1033 David Marchand (david.march...@redhat.com) changed: What|Removed |Added CC||david.march...@redhat.com Status|UNCONFIRMED |RESOLVED Resolution|--- |DUPLICATE --- Comment #2 from David Marchand (david.march...@redhat.com) --- Already reported earlier. *** This bug has been marked as a duplicate of bug 856 *** -- You are receiving this mail because: You are the assignee for the bug.
[Bug 1008] Building DPDK with gcc 12.1.1 throws warnings on Fedora 36
https://bugs.dpdk.org/show_bug.cgi?id=1008 David Marchand (david.march...@redhat.com) changed: What|Removed |Added Resolution|--- |INVALID CC||david.march...@redhat.com Status|UNCONFIRMED |RESOLVED --- Comment #1 from David Marchand (david.march...@redhat.com) --- Thanks, there were several other reports for the various issues, I prefer to close this one. Feel free to open a bugzilla for any remaining issue that has no assigned bz. -- You are receiving this mail because: You are the assignee for the bug.
RE: [EXT] Re: [PATCH v2] devtools: ensure proper tag sequence
> -Original Message- > From: Thomas Monjalon > Sent: Wednesday, June 15, 2022 8:21 AM > To: Jakub Palider > Cc: dev@dpdk.org; david.march...@redhat.com > Subject: [EXT] Re: [PATCH v2] devtools: ensure proper tag sequence > > External Email > > -- > 14/06/2022 00:21, jpali...@marvell.com: > > + SEQ[0] = "Coverity issue"; > > + SEQ[1] = "Bugzilla ID"; > > + SEQ[2] = "Fixes"; > > + SEQ[3] = "Cc"; > > + SEQ[4] = "^$"; > > + SEQ[5] = "Suggested-by"; > > + SEQ[6] = "Reported-by"; > > + SEQ[7] = "Signed-off-by"; > > + latest = 0; > > Do you think you could check that Review, Ack and Test are added after the > first Signed-off? Ok, thank you for clarification. So any further Signed-offs need not to be in a particular sequence as long as the first one precedes Review/Ack/Test (these 3 in any order). > > > +Tag order > > +~ > > + > > +There is a pattern indicating how certain tags should relate to each other. > > + > > +Example of proper tag sequence:: > > + > > + Coverity issue: > > + Bugzilla ID: > > + Fixes: > > + Cc: > > + > > + Suggested-by: > > + Reported-by: > > + Signed-off-by: > > Given it is an example, you could add Reviewed-by, Acked-by and Tested-by. Sure, I will restore it. > > + > > +Between first and second tag section there is and empty line. > > + > > +While ``Signed-off-by:`` is an obligatory tag and must exists in each > > +commit, all other tags are optional. Any tag, as long as it is in > > +proper location to other adjacent tags (if present), may occur multiple > times. > > + > > +Other tags shall be laid out in a chronological order. > > Yes, after the first Signed-off-by. Side question: what about Sponsored-by, I noticed it appeared just recently?
Re: [PATCH 00/12] Fix compilation with gcc 12
On Wed, May 18, 2022 at 12:17 PM David Marchand wrote: > > Fedora 36 is out since early may and comes with gcc 12. > This series fixes compilation or waives some checks. > > There might be something fishy with rte_memcpy on x86 but, for now, > the rte_memcpy related fixes are on the caller side. > > Some "base" drivers have issues, I chose the simple solution of waiving > the checks for them. > > Compilation is the only thing checked. > Please driver maintainers, check nothing got broken. I applied the patches that got acked and that had no objection or comment from maintainers (i.e. patch 3, 4, 5, 6, 9, 11). I also cleaned the mess in bugzilla where we had multiple reports of the same issues, or stale bugs that I can't reproduce with released gcc 12. I'll respin separately the patches for which I have clear comments, and drop my patches waiving the compiler checks. We still need to agree on the best approach to handle the new checks. We have two rfc series from Stephen, how do we move forward? -- David Marchand
RE: [PATCH v7 1/5] raw/ifpga: introduce AFU driver framework
> -Original Message- > From: Xu, Rosen > Sent: Wednesday, June 15, 2022 14:55 > To: Huang, Wei ; dev@dpdk.org; > tho...@monjalon.net; nipun.gu...@nxp.com; hemant.agra...@nxp.com > Cc: sta...@dpdk.org; Zhang, Tianfei ; Zhang, Qi Z > > Subject: RE: [PATCH v7 1/5] raw/ifpga: introduce AFU driver framework > > Hi Wei, > > Some comments. > > Thanks, > Rosen > > > -Original Message- > > From: Huang, Wei > > Sent: Thursday, June 09, 2022 15:37 > > To: dev@dpdk.org; tho...@monjalon.net; nipun.gu...@nxp.com; > > hemant.agra...@nxp.com > > Cc: sta...@dpdk.org; Xu, Rosen ; Zhang, Tianfei > > ; Zhang, Qi Z ; Huang, > > Wei > > Subject: [PATCH v7 1/5] raw/ifpga: introduce AFU driver framework > > > > AFU (Acceleration Function Unit) is part of FPGA and enumerated by > > ifpga driver. > > This driver implements common AFU device interfaces and exposes them > > to application as standard raw device APIs. > > Normally application can operate specified AFU as below, 1. call > > rte_rawdev_pmd_get_named_dev() to find AFU device. > > 2. call rte_rawdev_configure() to initialize AFU device. > > 3. call rte_rawdev_selftest() to test AFU device. > > > > Signed-off-by: Wei Huang > > --- > > v2: fix typo > > --- > > v3: fix build error in FreeBSD13-64, UB2004-32 and UB2204-32 > > --- > > v4: fix coding style issue and build error in FreeBSD13-64 > > --- > > v5: split patch into several patches > > --- > > v6: move source files to ifpga and rename, use spinlock > > --- > > drivers/raw/ifpga/afu_pmd_core.c | 453 > > +++ > > drivers/raw/ifpga/afu_pmd_core.h | 76 +++ > > drivers/raw/ifpga/meson.build| 2 +- > > 3 files changed, 530 insertions(+), 1 deletion(-) create mode 100644 > > drivers/raw/ifpga/afu_pmd_core.c create mode 100644 > > drivers/raw/ifpga/afu_pmd_core.h > > > > diff --git a/drivers/raw/ifpga/afu_pmd_core.c > > b/drivers/raw/ifpga/afu_pmd_core.c > > new file mode 100644 > > index 000..a24b517 > > --- /dev/null > > +++ b/drivers/raw/ifpga/afu_pmd_core.c > > @@ -0,0 +1,453 @@ > > +/* SPDX-License-Identifier: BSD-3-Clause > > + * Copyright 2022 Intel Corporation > > + * > > + * AFU is Acceleration Function Unit in FPGA, it is enumerated by ifpga > pmd. > > + * Suppose AFU is found in FPGA at PCI address 31:00.0, then you can > > +create > > + * and test a AFU device by following steps in application. > > + * 1. rte_vdev_init("ifpga_rawdev_cfg0", "ifpga=31:00.0,port=0") > > + * 2. rawdev = rte_rawdev_pmd_get_named_dev("afu_0|31:00.0") > > + * 3. rte_rawdev_configure(rawdev->dev_id, &cfg, sizeof(cfg)) > > + * 4. rte_rawdev_selftest(rawdev->dev_id) > > + * 5. rte_vdev_uninit("ifpga_rawdev_cfg0") > > + * > > + * AFU device name format used in rte_rawdev_pmd_get_named_dev is > > + * "afu_[port]|[BDF]". Please refer to OPAE documentation for the > > +meaning of > > + * port. Each AFU device has specific configuration data, they are > > +defined > > + * in rte_pmd_afu.h. > > + * > > + */ > > + > Could you split these notes? General speaking we just announce license in > head of source file. > Agree > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > +#include > > + > > +#include > > +#include > > +#include > > +#include > > + > > +#include "afu_pmd_core.h" > > + > > +static struct rte_afu_uuid > > afu_pmd_uuid_map[AFU_RAWDEV_MAX_DRVS+1]; > > +TAILQ_HEAD(afu_drv_list, afu_rawdev_drv); static struct afu_drv_list > > +afu_pmd_list = TAILQ_HEAD_INITIALIZER(afu_pmd_list); > > + > > +static inline int afu_rawdev_trylock(struct afu_rawdev *dev) { > > + if (!dev || !dev->sd) > > + return 0; > > + > > + return rte_spinlock_trylock(&dev->sd->lock); > > +} > > + > > +static inline void afu_rawdev_unlock(struct afu_rawdev *dev) { > > + if (!dev || !dev->sd) > > + return; > > + > > + rte_spinlock_unlock(&dev->sd->lock); > > +} > > + > > +static int afu_rawdev_configure(const struct rte_rawdev *rawdev, > > + rte_rawdev_obj_t config, size_t config_size) { > > + struct afu_rawdev *dev = NULL; > > + int ret = 0; > > + > > + IFPGA_RAWDEV_PMD_FUNC_TRACE(); > > + > > + dev = afu_rawdev_get_priv(rawdev); > > + if (!dev) > > + return -ENODEV; > > + > > + if (dev->ops && dev->ops->config) > > + ret = (*dev->ops->config)(dev, config, config_size); > > + > > + return ret; > > +} > > + > > +static int afu_rawdev_start(struct rte_rawdev *rawdev) { > > + struct afu_rawdev *dev = NULL; > > + int ret = 0; > > + > > + IFPGA_RAWDEV_PMD_FUNC_TRACE(); > > + > > + dev = afu_rawdev_get_priv(rawdev); > > + if (!dev) > > + return -ENODEV; > > + > > + ret = afu_rawdev_trylock(dev); > > + if (!ret) { > > + IFPGA_RAWDEV_PMD_WARN("AFU is busy, please start it > > later"); > > + return ret; > > + } > > + > > + if (dev->ops && dev->ops->start) > > + ret = (*dev->ops->start)(dev); > > + > > +
[PATCH v3] vdpa/mlx5: workaround var offset within page
vDPA driver first uses kernel driver to allocate doorbell(VAR) area for each device. Then uses var->mmap_off and var->length to mmap uverbs device file as doorbell userspace virtual address. Current kernel driver provides var->mmap_off equal to page start of VAR. It's fine with x86 4K page server, because VAR physical address is only 4K aligned thus locate in 4K page start. But with aarch64 64K page server, the actual VAR physical address has offset within page(not locate in 64K page start). So vDPA driver need add this within page offset(caps.doorbell_bar_offset) to get right VAR virtual address. Fixes: 62c813706e4 ("vdpa/mlx5: map doorbell") Cc: sta...@dpdk.org Signed-off-by: Yajun Wu Acked-by: Matan Azrad Reviewed-by: Maxime Coquelin --- drivers/vdpa/mlx5/mlx5_vdpa.c | 8 +++- 1 file changed, 7 insertions(+), 1 deletion(-) diff --git a/drivers/vdpa/mlx5/mlx5_vdpa.c b/drivers/vdpa/mlx5/mlx5_vdpa.c index 76fa5d4299..8a33a0c9a1 100644 --- a/drivers/vdpa/mlx5/mlx5_vdpa.c +++ b/drivers/vdpa/mlx5/mlx5_vdpa.c @@ -14,6 +14,7 @@ #include #include #include +#include #include #include @@ -560,6 +561,9 @@ mlx5_vdpa_create_dev_resources(struct mlx5_vdpa_priv *priv) rte_errno = errno; return -rte_errno; } + /* Add within page offset for 64K page system. */ + priv->virtq_db_addr = (char *)priv->virtq_db_addr + + ((rte_mem_page_size() - 1) & priv->caps.doorbell_bar_offset); DRV_LOG(DEBUG, "VAR address of doorbell mapping is %p.", priv->virtq_db_addr); priv->td = mlx5_devx_cmd_create_td(ctx); @@ -705,7 +709,9 @@ mlx5_vdpa_release_dev_resources(struct mlx5_vdpa_priv *priv) if (priv->td) claim_zero(mlx5_devx_cmd_destroy(priv->td)); if (priv->virtq_db_addr) - claim_zero(munmap(priv->virtq_db_addr, priv->var->length)); + /* Mask out the within page offset for munmap. */ + claim_zero(munmap((void *)((uintptr_t)priv->virtq_db_addr & + ~(rte_mem_page_size() - 1)), priv->var->length)); if (priv->var) mlx5_glue->dv_free_var(priv->var); } -- 2.30.2
RE: [PATCH v2] net/mlx5: fix build with clang 14
Hi, > -Original Message- > From: Ali Alnubani > Sent: Wednesday, May 11, 2022 7:41 PM > To: dev@dpdk.org > Cc: Michael Baum ; sta...@dpdk.org > Subject: [PATCH v2] net/mlx5: fix build with clang 14 > > Use fgets instead of fscanf to resolve the following warning reported by > clang 14.0.0 in Fedora 37 (Rawhide): > > drivers/net/mlx5/linux/mlx5_ethdev_os.c:1137:52: error: > 'fscanf' may overflow; destination buffer in argument 3 has size 16, > but the corresponding specifier may require size 17 > [-Werror,-Wfortify-source] > ret = fscanf(file, "%" RTE_STR(IF_NAMESIZE) "s", port_name); > > Fixes: 63d1db710fbc ("net/mlx5: fix unlimited parsing of switch info") > Cc: michae...@mellanox.com > Cc: sta...@dpdk.org > > Signed-off-by: Ali Alnubani > --- > Changes in v2: > - Removed unnecessary variable. > Patch applied to next-net-mlx, Kindest regards, Raslan Darawsheh
[PATCH v2] net/nfp: initialize the flbufsz
When the testpmd start-up, it will check MTU range, if MTU > flubfsz, it will lead testpmd start fail. Because the hw->flbufsz doesn't have the initialized value, so it will lead the bug. Fixes: 97a30c4d4c7b ("net/nfp: make sure MTU is never larger than mbuf size") Cc: sta...@dpdk.org Signed-off-by: Peng Zhang Signed-off-by: Chaoyong He --- drivers/net/nfp/nfp_ethdev.c| 1 + drivers/net/nfp/nfp_ethdev_vf.c | 1 + 2 files changed, 2 insertions(+) diff --git a/drivers/net/nfp/nfp_ethdev.c b/drivers/net/nfp/nfp_ethdev.c index 9166f65da3..1a9f7581a7 100644 --- a/drivers/net/nfp/nfp_ethdev.c +++ b/drivers/net/nfp/nfp_ethdev.c @@ -502,6 +502,7 @@ nfp_net_init(struct rte_eth_dev *eth_dev) hw->cap = nn_cfg_readl(hw, NFP_NET_CFG_CAP); hw->max_mtu = nn_cfg_readl(hw, NFP_NET_CFG_MAX_MTU); hw->mtu = RTE_ETHER_MTU; + hw->flbufsz = RTE_ETHER_MTU; /* VLAN insertion is incompatible with LSOv2 */ if (hw->cap & NFP_NET_CFG_CTRL_LSO2) diff --git a/drivers/net/nfp/nfp_ethdev_vf.c b/drivers/net/nfp/nfp_ethdev_vf.c index 0034d68ea6..0781f34764 100644 --- a/drivers/net/nfp/nfp_ethdev_vf.c +++ b/drivers/net/nfp/nfp_ethdev_vf.c @@ -369,6 +369,7 @@ nfp_netvf_init(struct rte_eth_dev *eth_dev) hw->cap = nn_cfg_readl(hw, NFP_NET_CFG_CAP); hw->max_mtu = nn_cfg_readl(hw, NFP_NET_CFG_MAX_MTU); hw->mtu = RTE_ETHER_MTU; + hw->flbufsz = RTE_ETHER_MTU; /* VLAN insertion is incompatible with LSOv2 */ if (hw->cap & NFP_NET_CFG_CTRL_LSO2) -- 2.27.0
Re: [PATCH 2/2] common/cnxk: add include for macro definition
On Wed, Jun 15, 2022 at 09:06:26AM +0200, Thomas Monjalon wrote: > 14/06/2022 14:29, Bruce Richardson: > > The header file "roc_io.h" uses the "__plt_always_inline" macro but > > don't include "roc_platform.h" to get the definition of it. This > > inclusion is not necessary for compilation, but the lack of it can > > confuse some indexers - such as those in eclipse, which reports the > > lines: > > > > "static __plt_always_inline uint64_t" > > > > as possible definitions of a variable called "uint64_t". This confusion > > leads to uint64_t being flagged as an unknown type in all other parts of > > the project being indexed, e.g. across all of DPDK code. > > > > Adding in the include of rte_common.h makes it clear to the indexer that > > those lines are part of a function definition, and that allows eclipse > > to correctly recognise uint64_t as a type from stdint.h > > It is not rte_common.h > Looks like a copy past of the first patch. > Yes, it's a copy paste error. I got most instances replaced but missed one (or at least I don't see a second miss).
Re: [PATCH v7] app/testpmd: add Host Shaper command
Few comments below, 14/06/2022 14:01, Spike Du: > Add command line options to support host shaper configure. > - Command syntax: > mlx5 set port host_shaper avail_thresh_triggered <0|1> rate > > > - Example commands: > To enable avail_thresh_triggered on port 1 and disable current host > shaper: > testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0 > > To disable avail_thresh_triggered and current host shaper on port 1: > testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0 > > The rate unit is 100Mbps. > To disable avail_thresh_triggered and configure a shaper of 5Gbps on > port 1: > testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50 > > Add sample code to handle rxq available descriptor threshold event, it > delays a while so that rxq empties, then disables host shaper and > rearms available descriptor threshold event. > > Signed-off-by: Spike Du [...] > @@ -3659,6 +3662,10 @@ struct pmd_test_command { > break; > printf("Received avail_thresh event, port:%d > rxq_id:%d\n", > port_id, rxq_id); > + > +#ifdef RTE_NET_MLX5 > + mlx5_test_avail_thresh_event_handler(port_id, rxq_id); > +#endif This function includes a check of the driver in use for the port, so I assume it is OK to call it unconditionally here. [...] > +How to use available descriptor threshold and Host Shaper > +-- Underlining should be of same size as the title. > + > +There are sample command lines to configure available descriptor threshold > in testpmd. Simpler: There is a command to configure available descriptor threshold in testpmd. You could add a link here to the doc of the command. > +Testpmd also contains sample logic to handle available descriptor threshold > event. > +The typical workflow is: testpmd configure available descriptor threshold > for Rx queues, enable > +avail_thresh_triggered in host shaper and register a callback, when traffic > from host is > +too high and Rx queue emptiness is below available descriptor threshold, PMD > receives an event and > +firmware configures a 100Mbps shaper on host port automatically, then PMD > call > +the callback registered previously, which will delay a while to let Rx queue > +empty, then disable host shaper. > + > +Let's assume we have a simple Blue Field 2 setup: port 0 is uplink, port 1 There's no space in BlueField. > +is VF representor. Each port has 2 Rx queues. > +In order to control traffic from host to ARM, we can enable available > descriptor threshold in testpmd by: > + > +.. code-block:: console > + > + testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0 > + testpmd> set port 1 rxq 0 avail_thresh 70 > + testpmd> set port 1 rxq 1 avail_thresh 70 > + > +The first command disables current host shaper, and enables available > descriptor threshold triggered mode. > +The left commands configure available descriptor threshold to 70% of Rx > queue size for both Rx queues, left -> other > +When traffic from host is too high, you can see testpmd console prints log > +about available descriptor threshold event receiving, then host shaper is > disabled. > +The traffic rate from host is controlled and less drop happens in Rx queues. > + > +When disable available descriptor threshold and avail_thresh_triggered, we > can invoke below commands in testpmd: Reword: The threshold event and shaper can be disabled like this: > + > +.. code-block:: console > + > + testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0 > + testpmd> set port 1 rxq 0 avail_thresh 0 > + testpmd> set port 1 rxq 1 avail_thresh 0 > + > +It's recommended an application disables available descriptor threshold and > avail_thresh_triggered before exit, > +if it enables them before. > + > +We can also configure the shaper with a value, the rate unit is 100Mbps, > below > +command sets current shaper to 5Gbps and disables avail_thresh_triggered. > + > +.. code-block:: console > + > + testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50
[PATCH] net/mlx5: block negative integrity flows
Negative integrity flows are supported, but malfunctioning due to a bug, and an API change is necessary to fix. Drop negative flow support for now. Signed-off-by: Raja Zidane Acked-by: Matan Azrad Acked-by: Gregory Etelson --- doc/guides/nics/mlx5.rst| 4 +++- drivers/net/mlx5/mlx5_flow_dv.c | 6 ++ 2 files changed, 9 insertions(+), 1 deletion(-) diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst index f41703277e..f28a874006 100644 --- a/doc/guides/nics/mlx5.rst +++ b/doc/guides/nics/mlx5.rst @@ -467,7 +467,9 @@ Limitations TCP or UDP, must be in the rule pattern as well:: flow create 0 ingress pattern integrity level is 0 value mask l3_ok value spec l3_ok / eth / ipv6 / end … - or + +Negative integrity flows are not supported. i.e. + flow create 0 ingress pattern integrity level is 0 value mask l4_ok value spec 0 / eth / ipv4 proto is udp / end … - Connection tracking: diff --git a/drivers/net/mlx5/mlx5_flow_dv.c b/drivers/net/mlx5/mlx5_flow_dv.c index 65b02b20ce..f82986ac31 100644 --- a/drivers/net/mlx5/mlx5_flow_dv.c +++ b/drivers/net/mlx5/mlx5_flow_dv.c @@ -6809,6 +6809,12 @@ flow_dv_validate_item_integrity(struct rte_eth_dev *dev, RTE_FLOW_ERROR_TYPE_ITEM, integrity_item, "unsupported integrity filter"); + if ((mask->l3_ok & !spec->l3_ok) || (mask->l4_ok & !spec->l4_ok) || + (mask->ipv4_csum_ok & !spec->ipv4_csum_ok) || + (mask->l4_csum_ok & !spec->l4_csum_ok)) + return rte_flow_error_set(error, EINVAL, + RTE_FLOW_ERROR_TYPE_ITEM, + NULL, "negative integrity flow is not supported"); if (spec->level > 1) { if (pattern_flags & MLX5_FLOW_ITEM_INNER_INTEGRITY) return rte_flow_error_set -- 2.21.0
[Bug 1036] vhost primary and secondary process symmetric test meet issue
https://bugs.dpdk.org/show_bug.cgi?id=1036 Bug ID: 1036 Summary: vhost primary and secondary process symmetric test meet issue Product: DPDK Version: 22.03 Hardware: x86 OS: Linux Status: UNCONFIRMED Severity: normal Priority: Normal Component: vhost/virtio Assignee: dev@dpdk.org Reporter: xingguang...@intel.com Target Milestone: --- Prerequisites - 1. Compile symmetric_mp example application:: meson configure -Dexamples=multi_process/symmetric_mp x86_64-native-linuxapp-gcc ninja -C x86_64-native-linuxapp-gcc Steps - 1. Launch vhost testpmd as primary process:: ./x86_64-native-linuxapp-gcc/examples/dpdk-symmetric_mp -l 1 -n 4 \ --vdev 'net_vhost,iface=vhost-net,queues=2,client=1' \ --vdev 'net_vhost1,iface=vhost-net1,queues=2,client=1' \ --proc-type=auto -- -p 3 --num-procs=2 --proc-id=0 2. Launch vhost testpmd as secondary process:: ./x86_64-native-linuxapp-gcc/examples/dpdk-symmetric_mp -l 2 -n 4 \ --vdev 'net_vhost,iface=vhost-net,queues=2,client=1' \ --vdev 'net_vhost1,iface=vhost-net1,queues=2,client=1' \ --proc-type=auto -- -p 3 --num-procs=2 --proc-id=1 3. Launch virtio-user:: ./x86_64-native-linuxapp-gcc/app/dpdk-testpmd -n 4 -l 7-8 --no-pci --file-prefix=virtio1 \ --vdev=net_virtio_user1,mac=00:01:02:03:04:05,path=./vhost-net,queues=2,server=1,mrg_rxbuf=0,in_order=0 \ -- -i --nb-cores=1 --rxq=2 --txq=2 --txd=1024 --rxd=1024 when launch virtio-user: VHOST_CONFIG: (0) device not found. -- You are receiving this mail because: You are the assignee for the bug.
RE: [PATCH v7] net/i40e: add outer VLAN processing
> -Original Message- > From: Zhang, Yuying > Sent: Tuesday, June 14, 2022 4:40 PM > To: Liu, KevinX ; dev@dpdk.org > Cc: Xing, Beilei ; Yang, SteveX > ; Zhang, RobinX > Subject: RE: [PATCH v7] net/i40e: add outer VLAN processing > > Hi Kevin, > > Workaround should be replaced when root cause be found. > > Best regards, > Yuying > > > -Original Message- > > From: Liu, KevinX > > Sent: Tuesday, June 14, 2022 11:07 AM > > To: Zhang, Yuying ; dev@dpdk.org > > Cc: Xing, Beilei ; Yang, SteveX > > ; Zhang, RobinX > > Subject: RE: [PATCH v7] net/i40e: add outer VLAN processing > > > > Hi, Yuying > > > > > -Original Message- > > > From: Zhang, Yuying > > > Sent: 2022年6月14日 10:44 > > > To: Liu, KevinX ; dev@dpdk.org > > > Cc: Xing, Beilei ; Yang, SteveX > > > ; Zhang, RobinX > > > Subject: RE: [PATCH v7] net/i40e: add outer VLAN processing > > > > > > Hi Kevin, > > > > > > > -Original Message- > > > > From: Liu, KevinX > > > > Sent: Saturday, June 11, 2022 12:30 AM > > > > To: dev@dpdk.org > > > > Cc: Zhang, Yuying ; Xing, Beilei > > > > ; Yang, SteveX ; > > > > Zhang, RobinX ; Liu, KevinX > > > > > > > > Subject: [PATCH v7] net/i40e: add outer VLAN processing > > > > > > > > From: Robin Zhang > > > > > > > > Outer VLAN processing is supported after firmware v8.4, kernel > > > > driver also > > > > > > Since this patch can only be enabled with firmware v8.6, should you > > > sync with dpdk here? > > OK, I'll revise it here. > > > > > > > change the default behavior to support this feature. To align with > > > > kernel driver, add support for outer VLAN processing in DPDK. > > > > > > > > But it is forbidden for firmware to change the Inner/Outer VLAN > > > > configuration while there are MAC/VLAN filters in the switch table. > > > > Therefore, we need to clear the MAC table before setting config, > > > > and then restore the MAC table after setting. > > > > > > > > This will not impact on an old firmware. > > > > > > > > Signed-off-by: Robin Zhang > > > > Signed-off-by: Kevin Liu > > Acked-by: Yuying Zhang Applied to dpdk-next-net-intel. Thanks Qi
Re: [PATCH 1/2] common/cnxk: decrypt packet count register update
On Fri, May 20, 2022 at 10:53 AM Rahul Bhansali wrote: > > Corrects the CPT decrypt packet counter register. > > Signed-off-by: Rahul Bhansali Updated the git commit as follows and applied to dpdk-next-net-mrvl/for-next-net. Thanks common/cnxk: fix decrypt packet count register update Corrects the CPT decrypt packet counter register. Fixes: b1a22e5d4f ("common/cnxk: add CPT diagnostics") Cc: sta...@dpdk.org Signed-off-by: Rahul Bhansali > --- > drivers/common/cnxk/roc_cpt_debug.c | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/drivers/common/cnxk/roc_cpt_debug.c > b/drivers/common/cnxk/roc_cpt_debug.c > index 847d969268..be6ddb56aa 100644 > --- a/drivers/common/cnxk/roc_cpt_debug.c > +++ b/drivers/common/cnxk/roc_cpt_debug.c > @@ -200,7 +200,7 @@ cpt_lf_print(struct roc_cpt_lf *lf) > reg_val = plt_read64(lf->rbase + CPT_LF_CTX_DEC_BYTE_CNT); > plt_print("Decrypted byte count:\t%" PRIu64, reg_val); > > - reg_val = plt_read64(lf->rbase + CPT_LF_CTX_ENC_PKT_CNT); > + reg_val = plt_read64(lf->rbase + CPT_LF_CTX_DEC_PKT_CNT); > plt_print("Decrypted packet count:\t%" PRIu64, reg_val); > } > > -- > 2.25.1 >
Re: [PATCH 2/2] common/cnxk: print NIX inline outbound CPT LF registers
On Tue, May 24, 2022 at 8:43 PM Ray Kinsella wrote: > > > Rahul Bhansali writes: > > > This add the support to dump NIX inline outbound CPT LF > > registers. > > > > Signed-off-by: Rahul Bhansali > > --- > > drivers/common/cnxk/roc_nix_debug.c | 14 ++ > > drivers/common/cnxk/roc_nix_inl.h | 1 + > > drivers/common/cnxk/version.map | 1 + > > 3 files changed, 16 insertions(+) > > > > Acked-by: Ray Kinsella Applied to dpdk-next-net-mrvl/for-next-net. Thanks
Re: [dpdk-dev] [PATCH] common/cnxk: add ROC API to free MCAM entry
On Fri, Jun 3, 2022 at 9:06 AM wrote: > > From: Satheesh Paul > > Add ROC API to free the given MCAM entry. If the MCAM > entry has flow counter associated, this API will clear > and free the flow counter. > > Signed-off-by: Satheesh Paul > Reviewed-by: Jerin Jacob Kollanukkaran > --- > drivers/common/cnxk/roc_npc.c | 63 +++-- > drivers/common/cnxk/roc_npc.h | 7 > drivers/common/cnxk/roc_npc_mcam.c | 39 ++ > drivers/common/cnxk/roc_npc_priv.h | 2 + > drivers/common/cnxk/roc_npc_utils.c | 15 +++ > drivers/common/cnxk/version.map | 3 ++ > 6 files changed, 100 insertions(+), 29 deletions(-) > > diff --git a/drivers/common/cnxk/roc_npc.c b/drivers/common/cnxk/roc_npc.c > index da5b96242e..02624856e0 100644 > --- a/drivers/common/cnxk/roc_npc.c > +++ b/drivers/common/cnxk/roc_npc.c > @@ -55,6 +55,53 @@ roc_npc_mcam_free_entry(struct roc_npc *roc_npc, uint32_t > entry) > return npc_mcam_free_entry(npc, entry); > } > > +int > +roc_npc_mcam_free(struct roc_npc *roc_npc, struct roc_npc_flow *mcam) > +{ > + int rc = 0; > + > + if (mcam->use_ctr) { > + rc = roc_npc_mcam_clear_counter(roc_npc, mcam->ctr_id); > + if (rc) > + return rc; > + > + rc = roc_npc_mcam_free_counter(roc_npc, mcam->ctr_id); > + if (rc) > + return rc; > + } > + > + return roc_npc_mcam_free_entry(roc_npc, mcam->mcam_id); > +} > + > +int > +roc_npc_mcam_init(struct roc_npc *roc_npc, struct roc_npc_flow *flow, > + int mcam_id) > +{ > + struct npc *npc = roc_npc_to_npc_priv(roc_npc); > + int rc = 0; No need to assign to zero as the next line is overwriting it. > + > + rc = npc_mcam_init(npc, flow, mcam_id); > + if (rc != 0) { > + plt_err("npc: mcam initialisation write failed"); > + return rc; > + } > + return 0; > +} > + > +int > +roc_npc_mcam_move(struct roc_npc *roc_npc, uint16_t old_ent, uint16_t > new_ent) > +{ > + struct npc *npc = roc_npc_to_npc_priv(roc_npc); > + struct mbox *mbox = npc->mbox; > + int rc = -ENOSPC; No need to assign to value as the next line is overwriting it. > + > + rc = npc_mcam_move(mbox, old_ent, new_ent); > + if (rc) > + return rc; > + > + return 0; > +} > + > int > roc_npc_mcam_free_all_resources(struct roc_npc *roc_npc) > { > @@ -383,7 +430,7 @@ npc_parse_actions(struct roc_npc *roc_npc, const struct > roc_npc_attr *attr, > > case ROC_NPC_ACTION_TYPE_COUNT: > /* Indicates, need a counter */ > - flow->ctr_id = 1; > + flow->use_ctr = 1; > req_act |= ROC_NPC_ACTION_TYPE_COUNT; > break; > > @@ -1268,7 +1315,7 @@ roc_npc_flow_create(struct roc_npc *roc_npc, const > struct roc_npc_attr *attr, > return flow; > > set_rss_failed: > - rc = npc_mcam_free_entry(npc, flow->mcam_id); > + rc = roc_npc_mcam_free_entry(roc_npc, flow->mcam_id); > if (rc != 0) { > *errcode = rc; > plt_free(flow); > @@ -1314,17 +1361,7 @@ roc_npc_flow_destroy(struct roc_npc *roc_npc, struct > roc_npc_flow *flow) > return rc; > } > > - if (flow->ctr_id != NPC_COUNTER_NONE) { > - rc = roc_npc_mcam_clear_counter(roc_npc, flow->ctr_id); > - if (rc != 0) > - return rc; > - > - rc = npc_mcam_free_counter(npc, flow->ctr_id); > - if (rc != 0) > - return rc; > - } > - > - rc = npc_mcam_free_entry(npc, flow->mcam_id); > + rc = roc_npc_mcam_free(roc_npc, flow); > if (rc != 0) > return rc; > > diff --git a/drivers/common/cnxk/roc_npc.h b/drivers/common/cnxk/roc_npc.h > index f92c2a633c..1b4e5521cb 100644 > --- a/drivers/common/cnxk/roc_npc.h > +++ b/drivers/common/cnxk/roc_npc.h > @@ -246,6 +246,7 @@ struct roc_npc_flow { > uint8_t nix_intf; > uint8_t enable; > uint32_t mcam_id; > + uint8_t use_ctr; > int32_t ctr_id; > uint32_t priority; > uint32_t mtr_id; > @@ -329,6 +330,8 @@ roc_npc_flow_create(struct roc_npc *roc_npc, const struct > roc_npc_attr *attr, > const struct roc_npc_action actions[], int *errcode); > int __roc_api roc_npc_flow_destroy(struct roc_npc *roc_npc, >struct roc_npc_flow *flow); > +int __roc_api roc_npc_mcam_free(struct roc_npc *roc_npc, > + struct roc_npc_flow *mcam); > int __roc_api roc_npc_mcam_free_entry(struct roc_npc *roc_npc, uint32_t > entry); > int __roc_api roc_npc_mcam_enable_all_entries(struct roc_npc *roc_npc, > bool enable); > @
[PATCH v8 2/6] common/mlx5: share interrupt management
There are many duplicate code of creating and initializing rte_intr_handle. Add a new mlx5_os API to do this, replace all PMD related code with this API. Signed-off-by: Spike Du --- drivers/common/mlx5/linux/mlx5_common_os.c | 131 ++ drivers/common/mlx5/linux/mlx5_common_os.h | 11 +++ drivers/common/mlx5/version.map | 2 + drivers/common/mlx5/windows/mlx5_common_os.h | 24 + drivers/net/mlx5/linux/mlx5_ethdev_os.c | 71 -- drivers/net/mlx5/linux/mlx5_os.c | 132 ++- drivers/net/mlx5/linux/mlx5_socket.c | 53 ++- drivers/net/mlx5/mlx5.h | 2 - drivers/net/mlx5/mlx5_txpp.c | 28 ++ drivers/net/mlx5/windows/mlx5_ethdev_os.c| 22 - drivers/vdpa/mlx5/mlx5_vdpa_virtq.c | 48 ++ 11 files changed, 217 insertions(+), 307 deletions(-) diff --git a/drivers/common/mlx5/linux/mlx5_common_os.c b/drivers/common/mlx5/linux/mlx5_common_os.c index d40cfd5..f10a981 100644 --- a/drivers/common/mlx5/linux/mlx5_common_os.c +++ b/drivers/common/mlx5/linux/mlx5_common_os.c @@ -11,6 +11,7 @@ #endif #include #include +#include #include #include @@ -964,3 +965,133 @@ claim_zero(mlx5_glue->dereg_mr(pmd_mr->obj)); memset(pmd_mr, 0, sizeof(*pmd_mr)); } + +/** + * Rte_intr_handle create and init helper. + * + * @param[in] mode + * interrupt instance can be shared between primary and secondary + * processes or not. + * @param[in] set_fd_nonblock + * Whether to set fd to O_NONBLOCK. + * @param[in] fd + * Fd to set in created intr_handle. + * @param[in] cb + * Callback to register for intr_handle. + * @param[in] cb_arg + * Callback argument for cb. + * + * @return + * - Interrupt handle on success. + * - NULL on failure, with rte_errno set. + */ +struct rte_intr_handle * +mlx5_os_interrupt_handler_create(int mode, bool set_fd_nonblock, int fd, +rte_intr_callback_fn cb, void *cb_arg) +{ + struct rte_intr_handle *tmp_intr_handle; + int ret, flags; + + tmp_intr_handle = rte_intr_instance_alloc(mode); + if (!tmp_intr_handle) { + rte_errno = ENOMEM; + goto err; + } + if (set_fd_nonblock) { + flags = fcntl(fd, F_GETFL); + ret = fcntl(fd, F_SETFL, flags | O_NONBLOCK); + if (ret) { + rte_errno = errno; + goto err; + } + } + ret = rte_intr_fd_set(tmp_intr_handle, fd); + if (ret) + goto err; + ret = rte_intr_type_set(tmp_intr_handle, RTE_INTR_HANDLE_EXT); + if (ret) + goto err; + ret = rte_intr_callback_register(tmp_intr_handle, cb, cb_arg); + if (ret) { + rte_errno = -ret; + goto err; + } + return tmp_intr_handle; +err: + if (tmp_intr_handle) + rte_intr_instance_free(tmp_intr_handle); + return NULL; +} + +/* Safe unregistration for interrupt callback. */ +static void +mlx5_intr_callback_unregister(const struct rte_intr_handle *handle, + rte_intr_callback_fn cb_fn, void *cb_arg) +{ + uint64_t twait = 0; + uint64_t start = 0; + + do { + int ret; + + ret = rte_intr_callback_unregister(handle, cb_fn, cb_arg); + if (ret >= 0) + return; + if (ret != -EAGAIN) { + DRV_LOG(INFO, "failed to unregister interrupt" + " handler (error: %d)", ret); + MLX5_ASSERT(false); + return; + } + if (twait) { + struct timespec onems; + + /* Wait one millisecond and try again. */ + onems.tv_sec = 0; + onems.tv_nsec = NS_PER_S / MS_PER_S; + nanosleep(&onems, 0); + /* Check whether one second elapsed. */ + if ((rte_get_timer_cycles() - start) <= twait) + continue; + } else { + /* +* We get the amount of timer ticks for one second. +* If this amount elapsed it means we spent one +* second in waiting. This branch is executed once +* on first iteration. +*/ + twait = rte_get_timer_hz(); + MLX5_ASSERT(twait); + } + /* +* Timeout elapsed, show message (once a second) and retry. +* We have no other acceptable option here, if we ignore +* the unregistering return code the handler will not +* be unr
[PATCH v8 0/6] introduce per-queue available descriptor threshold and host shaper
available descriptor threshold(ADT for short) is per RX queue attribute, when RX queue available descriptors for HW is below ADT, HW sends an event to application. Host shaper can configure shaper rate and avail_thresh-triggered for a host port. The shaper limits the rate of traffic from host port to embedded ARM rx port on Nvidia BlueField 2 NIC. If avail_thresh-triggered is enabled, a 100Mbps shaper is enabled automatically when one of the host port's Rx queues receives available descriptor threshold event. These two features can combine to control traffic from host port to wire port for BlueField 2 NIC. The traffic flows from host to embedded ARM, then to the physical port. The work flow is on the ARM system, configure available descriptor threshold to RX queue and enable avail_thresh-triggered flag in host shaper, after receiving available descriptor threshold event, delay a while until RX queue is empty , then disable the shaper. We recycle this work flow to reduce RX queue drops on ARM system. Add new libethdev API to set available descriptor threshold, add rte event RTE_ETH_EVENT_RX_AVAIL_THRESH to handle available descriptor threshold event. For host shaper, because it doesn't align to existing DPDK framework and is specific to Nvidia NIC, use PMD private API. For integration with testpmd, put the private cmdline function and available descriptor threshold event handler in mlx5 PMD directory by adding a new file mlx5_testpmd.c. Follow David Marchand's driver specific commands framework to add mlx5 specific commands. Spike Du (6): net/mlx5: add LWM support for Rxq common/mlx5: share interrupt management net/mlx5: add LWM event handling support net/mlx5: support Rx queue based available descriptor threshold net/mlx5: add private API to config host port shaper app/testpmd: add Host Shaper command app/test-pmd/testpmd.c | 7 + doc/guides/nics/mlx5.rst | 93 + doc/guides/rel_notes/release_22_07.rst | 2 + drivers/common/mlx5/linux/meson.build| 13 ++ drivers/common/mlx5/linux/mlx5_common_os.c | 131 drivers/common/mlx5/linux/mlx5_common_os.h | 11 + drivers/common/mlx5/mlx5_prm.h | 26 +++ drivers/common/mlx5/version.map | 2 + drivers/common/mlx5/windows/mlx5_common_os.h | 24 +++ drivers/net/mlx5/linux/mlx5_ethdev_os.c | 71 --- drivers/net/mlx5/linux/mlx5_os.c | 132 +++- drivers/net/mlx5/linux/mlx5_socket.c | 53 + drivers/net/mlx5/meson.build | 4 + drivers/net/mlx5/mlx5.c | 68 +++ drivers/net/mlx5/mlx5.h | 12 +- drivers/net/mlx5/mlx5_devx.c | 60 +- drivers/net/mlx5/mlx5_devx.h | 1 + drivers/net/mlx5/mlx5_rx.c | 288 +++ drivers/net/mlx5/mlx5_rx.h | 13 ++ drivers/net/mlx5/mlx5_testpmd.c | 205 +++ drivers/net/mlx5/mlx5_testpmd.h | 26 +++ drivers/net/mlx5/mlx5_txpp.c | 28 +-- drivers/net/mlx5/rte_pmd_mlx5.h | 30 +++ drivers/net/mlx5/version.map | 2 + drivers/net/mlx5/windows/mlx5_ethdev_os.c| 22 -- drivers/vdpa/mlx5/mlx5_vdpa_virtq.c | 48 + 26 files changed, 1064 insertions(+), 308 deletions(-) create mode 100644 drivers/net/mlx5/mlx5_testpmd.c create mode 100644 drivers/net/mlx5/mlx5_testpmd.h -- 1.8.3.1
[PATCH v8 5/6] net/mlx5: add private API to config host port shaper
Host port shaper can be configured with QSHR(QoS Shaper Host Register). Add check in build files to enable this function or not. The host shaper configuration affects all the ethdev ports belonging to the same host port. Host shaper can configure shaper rate and lwm-triggered for a host port. The shaper limits the rate of traffic from host port to wire port. If lwm-triggered is enabled, a 100Mbps shaper is enabled automatically when one of the host port's Rx queues receives available descriptor threshold event. Signed-off-by: Spike Du --- doc/guides/nics/mlx5.rst | 35 +++ doc/guides/rel_notes/release_22_07.rst | 1 + drivers/common/mlx5/linux/meson.build | 13 + drivers/common/mlx5/mlx5_prm.h | 25 drivers/net/mlx5/mlx5.h| 2 + drivers/net/mlx5/mlx5_rx.c | 104 + drivers/net/mlx5/rte_pmd_mlx5.h| 30 ++ drivers/net/mlx5/version.map | 2 + 8 files changed, 212 insertions(+) diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst index cceaddf..5f7b060 100644 --- a/doc/guides/nics/mlx5.rst +++ b/doc/guides/nics/mlx5.rst @@ -94,6 +94,7 @@ Features - Sub-Function representors. - Sub-Function. - Rx queue available descriptor threshold configuration. +- Host shaper support. Limitations @@ -525,6 +526,12 @@ Limitations - Doesn't support shared Rx queue and Hairpin Rx queue. +- Host shaper: + + - Support BlueField series NIC from BlueField 2. + - When configure host shaper with MLX5_HOST_SHAPER_FLAG_AVAIL_THRESH_TRIGGERED flag set, +only rate 0 and 100Mbps are supported. + Statistics -- @@ -1692,3 +1699,31 @@ Available descriptor threshold is a per Rx queue attribute, it should be configu a percentage of the Rx queue size. When Rx queue available descriptors for hardware are below the threshold, an event is sent to PMD. +Host shaper introduction + + +Host shaper register is per host port register which sets a shaper +on the host port. +All VF/hostPF representors belonging to one host port share one host shaper. +For example, if representor 0 and representor 1 belong to same host port, +and a host shaper rate of 1Gbps is configured, the shaper throttles both +representors' traffic from host. +Host shaper has two modes for setting the shaper, immediate and deferred to +available descriptor threshold event trigger. In immediate mode, the rate limit is configured +immediately to host shaper. When deferring to available descriptor threshold trigger, the shaper +is not set until an available descriptor threshold event is received by any Rx queue in a VF +representor belonging to the host port. The only rate supported for deferred +mode is 100Mbps (there is no limit on the supported rates for immediate mode). +In deferred mode, the shaper is set on the host port by the firmware upon +receiving the available descriptor threshold event, which allows throttling host traffic on +available descriptor threshold events at minimum latency, preventing excess drops in the +Rx queue. + +Host shaper dependency for mstflint package +--- + +In order to configure host shaper register, ``librte_net_mlx5`` depends on ``libmtcr_ul`` +which can be installed from OFED mstflint package. +Meson detects ``libmtcr_ul`` existence at configure stage. +If the library is detected, the application must link with ``-lmtcr_ul``, +as done by the pkg-config file libdpdk.pc. diff --git a/doc/guides/rel_notes/release_22_07.rst b/doc/guides/rel_notes/release_22_07.rst index 46fd73a..3349cda 100644 --- a/doc/guides/rel_notes/release_22_07.rst +++ b/doc/guides/rel_notes/release_22_07.rst @@ -90,6 +90,7 @@ New Features * Added support for MTU on Windows. * Added matching and RSS on IPsec ESP. * Added Rx queue available descriptor threshold support. + * Added host shaper support. * **Updated Marvell cnxk crypto driver.** diff --git a/drivers/common/mlx5/linux/meson.build b/drivers/common/mlx5/linux/meson.build index 5335f5b..51c6e5d 100644 --- a/drivers/common/mlx5/linux/meson.build +++ b/drivers/common/mlx5/linux/meson.build @@ -45,6 +45,13 @@ if static_ibverbs ext_deps += declare_dependency(link_args:ibv_ldflags.split()) endif +libmtcr_ul_found = false +lib = cc.find_library('mtcr_ul', required:false) +if lib.found() and run_command('meson', '--version').stdout().version_compare('>= 0.49.2') +libmtcr_ul_found = true +ext_deps += lib +endif + sources += files('mlx5_nl.c') sources += files('mlx5_common_auxiliary.c') sources += files('mlx5_common_os.c') @@ -207,6 +214,12 @@ has_sym_args = [ [ 'HAVE_MLX5_IBV_IMPORT_CTX_PD_AND_MR', 'infiniband/verbs.h', 'ibv_import_device' ], ] +if libmtcr_ul_found +has_sym_args += [ +[ 'HAVE_MLX5_MSTFLINT', 'mstflint/mtcr.h', +'mopen'], +] +endif config = configuration_data() fo
[PATCH v8 1/6] net/mlx5: add LWM support for Rxq
Add lwm(Limit WaterMark) field to Rxq object which indicates the percentage of RX queue size used by HW to raise LWM event to the user. Allow LWM setting in modify_rq command. Allow the LWM configuration dynamically by adding RDY2RDY state change. Signed-off-by: Spike Du --- drivers/net/mlx5/mlx5.h | 1 + drivers/net/mlx5/mlx5_devx.c | 13 - drivers/net/mlx5/mlx5_devx.h | 1 + drivers/net/mlx5/mlx5_rx.h | 1 + 4 files changed, 15 insertions(+), 1 deletion(-) diff --git a/drivers/net/mlx5/mlx5.h b/drivers/net/mlx5/mlx5.h index ef755ee..305edff 100644 --- a/drivers/net/mlx5/mlx5.h +++ b/drivers/net/mlx5/mlx5.h @@ -1395,6 +1395,7 @@ enum mlx5_rxq_modify_type { MLX5_RXQ_MOD_RST2RDY, /* modify state from reset to ready. */ MLX5_RXQ_MOD_RDY2ERR, /* modify state from ready to error. */ MLX5_RXQ_MOD_RDY2RST, /* modify state from ready to reset. */ + MLX5_RXQ_MOD_RDY2RDY, /* modify state from ready to ready. */ }; enum mlx5_txq_modify_type { diff --git a/drivers/net/mlx5/mlx5_devx.c b/drivers/net/mlx5/mlx5_devx.c index 4b48f94..c918a50 100644 --- a/drivers/net/mlx5/mlx5_devx.c +++ b/drivers/net/mlx5/mlx5_devx.c @@ -62,7 +62,7 @@ * @return * 0 on success, a negative errno value otherwise and rte_errno is set. */ -static int +int mlx5_devx_modify_rq(struct mlx5_rxq_priv *rxq, uint8_t type) { struct mlx5_devx_modify_rq_attr rq_attr; @@ -76,6 +76,11 @@ case MLX5_RXQ_MOD_RST2RDY: rq_attr.rq_state = MLX5_RQC_STATE_RST; rq_attr.state = MLX5_RQC_STATE_RDY; + if (rxq->lwm) { + rq_attr.modify_bitmask |= + MLX5_MODIFY_RQ_IN_MODIFY_BITMASK_WQ_LWM; + rq_attr.lwm = rxq->lwm; + } break; case MLX5_RXQ_MOD_RDY2ERR: rq_attr.rq_state = MLX5_RQC_STATE_RDY; @@ -85,6 +90,12 @@ rq_attr.rq_state = MLX5_RQC_STATE_RDY; rq_attr.state = MLX5_RQC_STATE_RST; break; + case MLX5_RXQ_MOD_RDY2RDY: + rq_attr.rq_state = MLX5_RQC_STATE_RDY; + rq_attr.state = MLX5_RQC_STATE_RDY; + rq_attr.modify_bitmask |= MLX5_MODIFY_RQ_IN_MODIFY_BITMASK_WQ_LWM; + rq_attr.lwm = rxq->lwm; + break; default: break; } diff --git a/drivers/net/mlx5/mlx5_devx.h b/drivers/net/mlx5/mlx5_devx.h index a95207a..ebd1da4 100644 --- a/drivers/net/mlx5/mlx5_devx.h +++ b/drivers/net/mlx5/mlx5_devx.h @@ -11,6 +11,7 @@ int mlx5_txq_devx_modify(struct mlx5_txq_obj *obj, enum mlx5_txq_modify_type type, uint8_t dev_port); void mlx5_txq_devx_obj_release(struct mlx5_txq_obj *txq_obj); +int mlx5_devx_modify_rq(struct mlx5_rxq_priv *rxq, uint8_t type); extern struct mlx5_obj_ops devx_obj_ops; diff --git a/drivers/net/mlx5/mlx5_rx.h b/drivers/net/mlx5/mlx5_rx.h index e715ed6..25a5f2c 100644 --- a/drivers/net/mlx5/mlx5_rx.h +++ b/drivers/net/mlx5/mlx5_rx.h @@ -175,6 +175,7 @@ struct mlx5_rxq_priv { struct mlx5_devx_rq devx_rq; struct rte_eth_hairpin_conf hairpin_conf; /* Hairpin configuration. */ uint32_t hairpin_status; /* Hairpin binding status. */ + uint32_t lwm:16; }; /* External RX queue descriptor. */ -- 1.8.3.1
[PATCH v8 3/6] net/mlx5: add LWM event handling support
When LWM meets RQ WQE, the kernel driver raises an event to SW. Use devx event_channel to catch this and to notify the user. Allocate this channel per shared device. The channel has a cookie that informs the specific event port and queue. Signed-off-by: Spike Du --- drivers/net/mlx5/mlx5.c | 66 drivers/net/mlx5/mlx5.h | 7 + drivers/net/mlx5/mlx5_devx.c | 47 +++ drivers/net/mlx5/mlx5_rx.c | 33 ++ drivers/net/mlx5/mlx5_rx.h | 7 + 5 files changed, 160 insertions(+) diff --git a/drivers/net/mlx5/mlx5.c b/drivers/net/mlx5/mlx5.c index f098871..e04a666 100644 --- a/drivers/net/mlx5/mlx5.c +++ b/drivers/net/mlx5/mlx5.c @@ -9,6 +9,7 @@ #include #include #include +#include #include #include @@ -22,6 +23,7 @@ #include #include #include +#include #include #include @@ -1525,6 +1527,69 @@ struct mlx5_dev_ctx_shared * } /** + * Create LWM event_channel and interrupt handle for shared device + * context. All rxqs sharing the device context share the event_channel. + * A callback is registered in interrupt thread to receive the LWM event. + * + * @param[in] priv + * Pointer to mlx5_priv instance. + * + * @return + * 0 on success, negative with rte_errno set. + */ +int +mlx5_lwm_setup(struct mlx5_priv *priv) +{ + int fd_lwm; + + pthread_mutex_init(&priv->sh->lwm_config_lock, NULL); + priv->sh->devx_channel_lwm = mlx5_os_devx_create_event_channel + (priv->sh->cdev->ctx, +MLX5DV_DEVX_CREATE_EVENT_CHANNEL_FLAGS_OMIT_EV_DATA); + if (!priv->sh->devx_channel_lwm) + goto err; + fd_lwm = mlx5_os_get_devx_channel_fd(priv->sh->devx_channel_lwm); + priv->sh->intr_handle_lwm = mlx5_os_interrupt_handler_create + (RTE_INTR_INSTANCE_F_SHARED, true, +fd_lwm, mlx5_dev_interrupt_handler_lwm, priv); + if (!priv->sh->intr_handle_lwm) + goto err; + return 0; +err: + if (priv->sh->devx_channel_lwm) { + mlx5_os_devx_destroy_event_channel + (priv->sh->devx_channel_lwm); + priv->sh->devx_channel_lwm = NULL; + } + pthread_mutex_destroy(&priv->sh->lwm_config_lock); + return -rte_errno; +} + +/** + * Destroy LWM event_channel and interrupt handle for shared device + * context before free this context. The interrupt handler is also + * unregistered. + * + * @param[in] sh + * Pointer to shared device context. + */ +void +mlx5_lwm_unset(struct mlx5_dev_ctx_shared *sh) +{ + if (sh->intr_handle_lwm) { + mlx5_os_interrupt_handler_destroy(sh->intr_handle_lwm, + mlx5_dev_interrupt_handler_lwm, (void *)-1); + sh->intr_handle_lwm = NULL; + } + if (sh->devx_channel_lwm) { + mlx5_os_devx_destroy_event_channel + (sh->devx_channel_lwm); + sh->devx_channel_lwm = NULL; + } + pthread_mutex_destroy(&sh->lwm_config_lock); +} + +/** * Free shared IB device context. Decrement counter and if zero free * all allocated resources and close handles. * @@ -1601,6 +1666,7 @@ struct mlx5_dev_ctx_shared * claim_zero(mlx5_devx_cmd_destroy(sh->td)); MLX5_ASSERT(sh->geneve_tlv_option_resource == NULL); pthread_mutex_destroy(&sh->txpp.mutex); + mlx5_lwm_unset(sh); mlx5_free(sh); return; exit: diff --git a/drivers/net/mlx5/mlx5.h b/drivers/net/mlx5/mlx5.h index 7ebb2cc..a76f2fe 100644 --- a/drivers/net/mlx5/mlx5.h +++ b/drivers/net/mlx5/mlx5.h @@ -1268,6 +1268,9 @@ struct mlx5_dev_ctx_shared { struct mlx5_lb_ctx self_lb; /* QP to enable self loopback for Devx. */ unsigned int flow_max_priority; enum modify_reg flow_mreg_c[MLX5_MREG_C_NUM]; + void *devx_channel_lwm; + struct rte_intr_handle *intr_handle_lwm; + pthread_mutex_t lwm_config_lock; /* Availability of mreg_c's. */ struct mlx5_dev_shared_port port[]; /* per device port data array. */ }; @@ -1405,6 +1408,7 @@ enum mlx5_txq_modify_type { }; struct mlx5_rxq_priv; +struct mlx5_priv; /* HW objects operations structure. */ struct mlx5_obj_ops { @@ -1413,6 +1417,7 @@ struct mlx5_obj_ops { int (*rxq_event_get)(struct mlx5_rxq_obj *rxq_obj); int (*rxq_obj_modify)(struct mlx5_rxq_priv *rxq, uint8_t type); void (*rxq_obj_release)(struct mlx5_rxq_priv *rxq); + int (*rxq_event_get_lwm)(struct mlx5_priv *priv, int *rxq_idx, int *port_id); int (*ind_table_new)(struct rte_eth_dev *dev, const unsigned int log_n, struct mlx5_ind_table_obj *ind_tbl); int (*ind_table_modify)(struct rte_eth_dev *dev, @@ -1603,6 +1608,8 @@ int mlx5_udp_tunnel_port_add(struct rte_eth_dev *dev, bool mlx5_is_hpf(struct rte_eth_dev *dev); bool mlx5_is_sf_repr
[PATCH v8 4/6] net/mlx5: support Rx queue based available descriptor threshold
Add mlx5 specific available descriptor threshold configuration and query handler. In mlx5 PMD, available descriptor threshold is also called LWM(limit watermark). While the Rx queue fullness reaches the LWM limit, the driver catches an HW event and invokes the user callback. The query handler finds the next RX queue with pending LWM event if any, starting from the given RX queue index. Signed-off-by: Spike Du --- doc/guides/nics/mlx5.rst | 12 +++ doc/guides/rel_notes/release_22_07.rst | 1 + drivers/common/mlx5/mlx5_prm.h | 1 + drivers/net/mlx5/mlx5.c| 2 + drivers/net/mlx5/mlx5_rx.c | 151 + drivers/net/mlx5/mlx5_rx.h | 5 ++ 6 files changed, 172 insertions(+) diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst index d83c56d..cceaddf 100644 --- a/doc/guides/nics/mlx5.rst +++ b/doc/guides/nics/mlx5.rst @@ -93,6 +93,7 @@ Features - Connection tracking. - Sub-Function representors. - Sub-Function. +- Rx queue available descriptor threshold configuration. Limitations @@ -520,6 +521,9 @@ Limitations - The NIC egress flow rules on representor port are not supported. +- Available descriptor threshold: + + - Doesn't support shared Rx queue and Hairpin Rx queue. Statistics -- @@ -1680,3 +1684,11 @@ The procedure below is an example of using a ConnectX-5 adapter card (pf0) with #. For each VF PCIe, using the following command to bind the driver:: $ echo ":82:00.2" >> /sys/bus/pci/drivers/mlx5_core/bind + +Available descriptor threshold introduction +--- + +Available descriptor threshold is a per Rx queue attribute, it should be configured as +a percentage of the Rx queue size. +When Rx queue available descriptors for hardware are below the threshold, an event is sent to PMD. + diff --git a/doc/guides/rel_notes/release_22_07.rst b/doc/guides/rel_notes/release_22_07.rst index 0ed4f92..46fd73a 100644 --- a/doc/guides/rel_notes/release_22_07.rst +++ b/doc/guides/rel_notes/release_22_07.rst @@ -89,6 +89,7 @@ New Features * Added support for promiscuous mode on Windows. * Added support for MTU on Windows. * Added matching and RSS on IPsec ESP. + * Added Rx queue available descriptor threshold support. * **Updated Marvell cnxk crypto driver.** diff --git a/drivers/common/mlx5/mlx5_prm.h b/drivers/common/mlx5/mlx5_prm.h index 630b2c5..3b5e605 100644 --- a/drivers/common/mlx5/mlx5_prm.h +++ b/drivers/common/mlx5/mlx5_prm.h @@ -3293,6 +3293,7 @@ struct mlx5_aso_wqe { enum { MLX5_EVENT_TYPE_OBJECT_CHANGE = 0x27, + MLX5_EVENT_TYPE_SRQ_LIMIT_REACHED = 0x14, }; enum { diff --git a/drivers/net/mlx5/mlx5.c b/drivers/net/mlx5/mlx5.c index e04a666..998846a 100644 --- a/drivers/net/mlx5/mlx5.c +++ b/drivers/net/mlx5/mlx5.c @@ -2071,6 +2071,8 @@ struct mlx5_dev_ctx_shared * .dev_supported_ptypes_get = mlx5_dev_supported_ptypes_get, .vlan_filter_set = mlx5_vlan_filter_set, .rx_queue_setup = mlx5_rx_queue_setup, + .rx_queue_avail_thresh_set = mlx5_rx_queue_lwm_set, + .rx_queue_avail_thresh_query = mlx5_rx_queue_lwm_query, .rx_hairpin_queue_setup = mlx5_rx_hairpin_queue_setup, .tx_queue_setup = mlx5_tx_queue_setup, .tx_hairpin_queue_setup = mlx5_tx_hairpin_queue_setup, diff --git a/drivers/net/mlx5/mlx5_rx.c b/drivers/net/mlx5/mlx5_rx.c index 197d708..2cb7006 100644 --- a/drivers/net/mlx5/mlx5_rx.c +++ b/drivers/net/mlx5/mlx5_rx.c @@ -25,6 +25,7 @@ #include "mlx5.h" #include "mlx5_utils.h" #include "mlx5_rxtx.h" +#include "mlx5_devx.h" #include "mlx5_rx.h" @@ -128,6 +129,16 @@ return RTE_ETH_RX_DESC_AVAIL; } +/* Get rxq lwm percentage according to lwm number. */ +static uint8_t +mlx5_rxq_lwm_to_percentage(struct mlx5_rxq_priv *rxq) +{ + struct mlx5_rxq_data *rxq_data = &rxq->ctrl->rxq; + uint32_t wqe_cnt = 1 << (rxq_data->elts_n - rxq_data->sges_n); + + return rxq->lwm * 100 / wqe_cnt; +} + /** * DPDK callback to get the RX queue information. * @@ -150,6 +161,7 @@ { struct mlx5_rxq_ctrl *rxq_ctrl = mlx5_rxq_ctrl_get(dev, rx_queue_id); struct mlx5_rxq_data *rxq = mlx5_rxq_data_get(dev, rx_queue_id); + struct mlx5_rxq_priv *rxq_priv = mlx5_rxq_get(dev, rx_queue_id); if (!rxq) return; @@ -169,6 +181,8 @@ qinfo->nb_desc = mlx5_rxq_mprq_enabled(rxq) ? RTE_BIT32(rxq->elts_n) * RTE_BIT32(rxq->log_strd_num) : RTE_BIT32(rxq->elts_n); + qinfo->avail_thresh = rxq_priv ? + mlx5_rxq_lwm_to_percentage(rxq_priv) : 0; } /** @@ -1188,6 +1202,34 @@ int mlx5_get_monitor_addr(void *rx_queue, struct rte_power_monitor_cond *pmc) return -ENOTSUP; } +int +mlx5_rx_queue_lwm_query(struct rte_eth_dev *dev, + uint16_t *queue_id, uint8_t *lwm) +{ + struct mlx5_priv *priv = dev->dat
[PATCH v8 6/6] app/testpmd: add Host Shaper command
Add command line options to support host shaper configure. - Command syntax: mlx5 set port host_shaper avail_thresh_triggered <0|1> rate - Example commands: To enable avail_thresh_triggered on port 1 and disable current host shaper: testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0 To disable avail_thresh_triggered and current host shaper on port 1: testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0 The rate unit is 100Mbps. To disable avail_thresh_triggered and configure a shaper of 5Gbps on port 1: testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50 Add sample code to handle rxq available descriptor threshold event, it delays a while so that rxq empties, then disables host shaper and rearms available descriptor threshold event. Signed-off-by: Spike Du --- app/test-pmd/testpmd.c | 7 ++ doc/guides/nics/mlx5.rst| 46 + drivers/net/mlx5/meson.build| 4 + drivers/net/mlx5/mlx5_testpmd.c | 205 drivers/net/mlx5/mlx5_testpmd.h | 26 + 5 files changed, 288 insertions(+) create mode 100644 drivers/net/mlx5/mlx5_testpmd.c create mode 100644 drivers/net/mlx5/mlx5_testpmd.h diff --git a/app/test-pmd/testpmd.c b/app/test-pmd/testpmd.c index 33d9b85..b491719 100644 --- a/app/test-pmd/testpmd.c +++ b/app/test-pmd/testpmd.c @@ -69,6 +69,9 @@ #ifdef RTE_NET_BOND #include #endif +#ifdef RTE_NET_MLX5 +#include "mlx5_testpmd.h" +#endif #include "testpmd.h" @@ -3659,6 +3662,10 @@ struct pmd_test_command { break; printf("Received avail_thresh event, port:%d rxq_id:%d\n", port_id, rxq_id); + +#ifdef RTE_NET_MLX5 + mlx5_test_avail_thresh_event_handler(port_id, rxq_id); +#endif } break; default: diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst index 5f7b060..64eaddf 100644 --- a/doc/guides/nics/mlx5.rst +++ b/doc/guides/nics/mlx5.rst @@ -1727,3 +1727,49 @@ which can be installed from OFED mstflint package. Meson detects ``libmtcr_ul`` existence at configure stage. If the library is detected, the application must link with ``-lmtcr_ul``, as done by the pkg-config file libdpdk.pc. + +How to use available descriptor threshold and Host Shaper +- + +There is a command to configure available descriptor threshold in testpmd. +Testpmd also contains sample logic to handle available descriptor threshold event. +The typical workflow is: testpmd configure available descriptor threshold for Rx queues, enable +avail_thresh_triggered in host shaper and register a callback, when traffic from host is +too high and Rx queue emptiness is below available descriptor threshold, PMD receives an event and +firmware configures a 100Mbps shaper on host port automatically, then PMD call +the callback registered previously, which will delay a while to let Rx queue +empty, then disable host shaper. + +Let's assume we have a simple BlueField 2 setup: port 0 is uplink, port 1 +is VF representor. Each port has 2 Rx queues. +In order to control traffic from host to ARM, we can enable available descriptor threshold in testpmd by: + +.. code-block:: console + + testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0 + testpmd> set port 1 rxq 0 avail_thresh 70 + testpmd> set port 1 rxq 1 avail_thresh 70 + +The first command disables current host shaper, and enables available descriptor threshold triggered mode. +The other commands configure available descriptor threshold to 70% of Rx queue size for both Rx queues, +When traffic from host is too high, you can see testpmd console prints log +about available descriptor threshold event receiving, then host shaper is disabled. +The traffic rate from host is controlled and less drop happens in Rx queues. + +The threshold event and shaper can be disabled like this: + +.. code-block:: console + + testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0 + testpmd> set port 1 rxq 0 avail_thresh 0 + testpmd> set port 1 rxq 1 avail_thresh 0 + +It's recommended an application disables available descriptor threshold and avail_thresh_triggered before exit, +if it enables them before. + +We can also configure the shaper with a value, the rate unit is 100Mbps, below +command sets current shaper to 5Gbps and disables avail_thresh_triggered. + +.. code-block:: console + + testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50 diff --git a/drivers/net/mlx5/meson.build b/drivers/net/mlx5/meson.build index 99210fd..941642b 100644 --- a/drivers/net/mlx5/meson.build +++ b/drivers/net/mlx5/meson.build @@ -68,4 +68,8 @@ if get_option('buildtype').contains('debug') else cflags += [ '-UPEDANTIC' ] endif + +testpmd_sources += files('mlx5_testpmd.c') +testpmd_drivers_deps += 'net_mlx5' + subdir(ex
[PATCH] crypto/scheduler: fix qp_id in scheduler failover
This commit fixes wrong qp_id value in cryptodev scheduler in failover mode. Fixes: 37f075dad1e9 ("crypto/scheduler: add fail-over scheduling mode") Cc: roy.fan.zh...@intel.com Signed-off-by: Jakub Wysocki --- drivers/crypto/scheduler/scheduler_failover.c | 3 +++ 1 file changed, 3 insertions(+) diff --git a/drivers/crypto/scheduler/scheduler_failover.c b/drivers/crypto/scheduler/scheduler_failover.c index 5023577ef8..247aeb2f22 100644 --- a/drivers/crypto/scheduler/scheduler_failover.c +++ b/drivers/crypto/scheduler/scheduler_failover.c @@ -157,6 +157,9 @@ scheduler_start(struct rte_cryptodev *dev) ((struct scheduler_qp_ctx *) dev->data->queue_pairs[i])->private_qp_ctx; +sched_ctx->workers[PRIMARY_WORKER_IDX].qp_id = i; +sched_ctx->workers[SECONDARY_WORKER_IDX].qp_id = i; + rte_memcpy(&qp_ctx->primary_worker, &sched_ctx->workers[PRIMARY_WORKER_IDX], sizeof(struct scheduler_worker)); -- 2.17.1 -- Intel Research and Development Ireland Limited Registered in Ireland Registered Office: Collinstown Industrial Park, Leixlip, County Kildare Registered Number: 308263 This e-mail and any attachments may contain confidential material for the sole use of the intended recipient(s). Any review or distribution by others is strictly prohibited. If you are not the intended recipient, please contact the sender and delete all copies.
RE: [PATCH 1/5] usertools: use non-strict when json-loads in telemetry
> From: Chengwen Feng [mailto:fengcheng...@huawei.com] > Sent: Wednesday, 15 June 2022 09.39 > > Use 'strict=False' in json-loads, it will ignore control characters > (e.g. '\n\t'), this patch is prepared for the support of telemetry dump > in the future. > > Signed-off-by: Chengwen Feng > --- > usertools/dpdk-telemetry.py | 2 +- > 1 file changed, 1 insertion(+), 1 deletion(-) > > diff --git a/usertools/dpdk-telemetry.py b/usertools/dpdk-telemetry.py > index a81868a547..63f8004566 100755 > --- a/usertools/dpdk-telemetry.py > +++ b/usertools/dpdk-telemetry.py > @@ -27,7 +27,7 @@ def read_socket(sock, buf_len, echo=True): > """ Read data from socket and return it in JSON format """ > reply = sock.recv(buf_len).decode() > try: > -ret = json.loads(reply) > +ret = json.loads(reply, strict=False) > except json.JSONDecodeError: > print("Error in reply: ", reply) > sock.close() > -- > 2.33.0 > As I understand this patch, it accepts non-JSON data from the telemetry socket. Isn't it is a major protocol violation if the telemetry socket produces output requiring this modification? Doing that would break other applications expecting strictly JSON formatted output from the telemetry socket.
[dpdk-dev] [PATCH v2] common/cnxk: add ROC API to free MCAM entry
From: Satheesh Paul Add ROC API to free the given MCAM entry. If the MCAM entry has flow counter associated, this API will clear and free the flow counter. Signed-off-by: Satheesh Paul Reviewed-by: Jerin Jacob --- v2: * Removed unnecessary variable initialization drivers/common/cnxk/roc_npc.c | 63 +++-- drivers/common/cnxk/roc_npc.h | 7 drivers/common/cnxk/roc_npc_mcam.c | 39 ++ drivers/common/cnxk/roc_npc_priv.h | 2 + drivers/common/cnxk/roc_npc_utils.c | 15 +++ drivers/common/cnxk/version.map | 3 ++ 6 files changed, 100 insertions(+), 29 deletions(-) diff --git a/drivers/common/cnxk/roc_npc.c b/drivers/common/cnxk/roc_npc.c index da5b96242e..b38389b18a 100644 --- a/drivers/common/cnxk/roc_npc.c +++ b/drivers/common/cnxk/roc_npc.c @@ -55,6 +55,53 @@ roc_npc_mcam_free_entry(struct roc_npc *roc_npc, uint32_t entry) return npc_mcam_free_entry(npc, entry); } +int +roc_npc_mcam_free(struct roc_npc *roc_npc, struct roc_npc_flow *mcam) +{ + int rc = 0; + + if (mcam->use_ctr) { + rc = roc_npc_mcam_clear_counter(roc_npc, mcam->ctr_id); + if (rc) + return rc; + + rc = roc_npc_mcam_free_counter(roc_npc, mcam->ctr_id); + if (rc) + return rc; + } + + return roc_npc_mcam_free_entry(roc_npc, mcam->mcam_id); +} + +int +roc_npc_mcam_init(struct roc_npc *roc_npc, struct roc_npc_flow *flow, + int mcam_id) +{ + struct npc *npc = roc_npc_to_npc_priv(roc_npc); + int rc; + + rc = npc_mcam_init(npc, flow, mcam_id); + if (rc != 0) { + plt_err("npc: mcam initialisation write failed"); + return rc; + } + return 0; +} + +int +roc_npc_mcam_move(struct roc_npc *roc_npc, uint16_t old_ent, uint16_t new_ent) +{ + struct npc *npc = roc_npc_to_npc_priv(roc_npc); + struct mbox *mbox = npc->mbox; + int rc; + + rc = npc_mcam_move(mbox, old_ent, new_ent); + if (rc) + return rc; + + return 0; +} + int roc_npc_mcam_free_all_resources(struct roc_npc *roc_npc) { @@ -383,7 +430,7 @@ npc_parse_actions(struct roc_npc *roc_npc, const struct roc_npc_attr *attr, case ROC_NPC_ACTION_TYPE_COUNT: /* Indicates, need a counter */ - flow->ctr_id = 1; + flow->use_ctr = 1; req_act |= ROC_NPC_ACTION_TYPE_COUNT; break; @@ -1268,7 +1315,7 @@ roc_npc_flow_create(struct roc_npc *roc_npc, const struct roc_npc_attr *attr, return flow; set_rss_failed: - rc = npc_mcam_free_entry(npc, flow->mcam_id); + rc = roc_npc_mcam_free_entry(roc_npc, flow->mcam_id); if (rc != 0) { *errcode = rc; plt_free(flow); @@ -1314,17 +1361,7 @@ roc_npc_flow_destroy(struct roc_npc *roc_npc, struct roc_npc_flow *flow) return rc; } - if (flow->ctr_id != NPC_COUNTER_NONE) { - rc = roc_npc_mcam_clear_counter(roc_npc, flow->ctr_id); - if (rc != 0) - return rc; - - rc = npc_mcam_free_counter(npc, flow->ctr_id); - if (rc != 0) - return rc; - } - - rc = npc_mcam_free_entry(npc, flow->mcam_id); + rc = roc_npc_mcam_free(roc_npc, flow); if (rc != 0) return rc; diff --git a/drivers/common/cnxk/roc_npc.h b/drivers/common/cnxk/roc_npc.h index f92c2a633c..1b4e5521cb 100644 --- a/drivers/common/cnxk/roc_npc.h +++ b/drivers/common/cnxk/roc_npc.h @@ -246,6 +246,7 @@ struct roc_npc_flow { uint8_t nix_intf; uint8_t enable; uint32_t mcam_id; + uint8_t use_ctr; int32_t ctr_id; uint32_t priority; uint32_t mtr_id; @@ -329,6 +330,8 @@ roc_npc_flow_create(struct roc_npc *roc_npc, const struct roc_npc_attr *attr, const struct roc_npc_action actions[], int *errcode); int __roc_api roc_npc_flow_destroy(struct roc_npc *roc_npc, struct roc_npc_flow *flow); +int __roc_api roc_npc_mcam_free(struct roc_npc *roc_npc, + struct roc_npc_flow *mcam); int __roc_api roc_npc_mcam_free_entry(struct roc_npc *roc_npc, uint32_t entry); int __roc_api roc_npc_mcam_enable_all_entries(struct roc_npc *roc_npc, bool enable); @@ -367,4 +370,8 @@ int __roc_api roc_npc_mcam_merge_base_steering_rule(struct roc_npc *roc_npc, struct roc_npc_flow *flow); int __roc_api roc_npc_validate_portid_action(struct roc_npc *roc_npc_src, struct roc_npc *roc_npc_dst); +int __roc_api roc_npc_mcam_init(struct roc_npc *roc_npc, +
Re: [PATCH v2 3/3] test: support trace-autotest when enable trace
On Tue, Jun 14, 2022 at 8:06 AM Chengwen Feng wrote: > > There are a bug[1] when exit application while enable tracing, this > bug has not been discovered for a long time, to quickly detect such > bugs, this patch was introduced. > > This patch adds a testcase with trace enabling, it also depends on > patch[2] because it has a long file-prefix. > > [1] eal: fix segment fault when exit trace > [2] eal: fix trace init fail with long file-prefix This commitlog feels more like a cover letter. Please describe what the impact of the patch is, like mention that the trace_autotest unit test is being called twice, once with traces disabled, and once with traces enabled. And that the traces file is being written in a directory part of the build directory. > > Signed-off-by: Chengwen Feng > --- > app/test/meson.build | 11 +++ > 1 file changed, 11 insertions(+) > > diff --git a/app/test/meson.build b/app/test/meson.build > index 7fe261cae8..eb37aa632a 100644 > --- a/app/test/meson.build > +++ b/app/test/meson.build > @@ -509,6 +509,17 @@ foreach arg : fast_tests > is_parallel : false, > suite : 'fast-tests') > endif > + > +if run_test and arg[0] == 'trace_autotest' and (not is_windows) Calling this test on Windows should not be an issue. Is it not the case? > +test_args += ['--trace=.*'] > +test_args += ['--trace-dir=@0@'.format(meson.current_build_dir())] > +test(arg[0], dpdk_test, 58/68 DPDK:fast-tests / trace_autotest OK 0.16s 59/68 DPDK:fast-tests / trace_autotest OK 0.16s By using the same test name, it's hard to tell what the difference is between those two lines. And practically, as a developer reproducing/troubleshooting a test failure, you can't call only one of the test case. I am not sure what could be done to enhance this.., how about using a dedicated test name? Like below snippet, replacing the whole proposed patch: @@ -508,6 +508,16 @@ foreach arg : fast_tests timeout : timeout_seconds_fast, is_parallel : false, suite : 'fast-tests') +if arg[0] == 'trace_autotest' +test_args += ['--trace=.*'] +test_args += ['--trace-dir=@0@'.format(meson.current_build_dir())] +test(arg[0] + '_with_traces', dpdk_test, +env : ['DPDK_TEST=' + arg[0]], +args : test_args, +timeout : timeout_seconds_fast, +is_parallel : false, +suite : 'fast-tests') +endif endif endforeach -- David Marchand
Re: [PATCH v2 2/3] eal: fix trace init fail with long file-prefix
On Tue, Jun 14, 2022 at 8:06 AM Chengwen Feng wrote: > > Bug scenario: > 1. start testpmd: > dpdk-testpmd -l 4-6 -a :7d:00.0 --trace=.* \ > -file-prefix=trace_autotest -- -i > 2. then observed: > EAL: eal_trace_init():94 failed to initialize trace [File exists] This is not directly related to the issue being fixed, but this error log is really obscure.. Can we enhance this, maybe in a followup patch? > EAL: FATAL: Cannot init trace > EAL: Cannot init trace > EAL: Error - exiting with code: 1 > > The root cause it that the offset set wrong with long file-prefix and > then lead the strftime return failed. > > Fixes: 321dd5f8fa62 ("trace: add internal init and fini interface") > Cc: sta...@dpdk.org > > Signed-off-by: Chengwen Feng Reviewed-by: David Marchand And another topic, I see no reason for the limitation/truncation on the file prefix. Testing with the last patch of the series, I can see odd directory, like EAL: Trace dir: /home/dmarchan/builds/main/build-gcc-shared/app/test/trace_autot-2022-06-15-PM-03-18-39 I'll send a cleanup series for this, later. -- David Marchand
RE: [EXT] [PATCH v7 0/6] drivers/baseband: PMD to support ACC100/ACC101 devices
> v7: updates suggested by Akhil and Maxime. Removed RTE prefix for internal > #define, removed > build-time single FEC engine configuration mode, documentation fix, coding > guideline fix, renamed new file without rte prefix. > > v6: use formally an unified driver for AC100 and AC101. Exposing single > unified > function for the PF device configuration used by bbdev-test. > > v5: minor fix to doc and intermediate commit caught in CI > > v4: incremental updates based on reviews Changed order of patchset to include > two ACC100 specific commits first: updating formally the companion configure > function from PF and the protection adjustment. > Then next 3 commits for the actual changes specific to ACC101: the device > specific function for FCW 5GUL generation used as a local function, combined > documentations into a single .rst common to both devices, updated comments, > removed unused #defines. > > v3: Feedback from Tom Rix: missing copyright, refactor bbdev-test section > calling the configure companion function for ACC100/101, taking the Pmon > commit out which is not directly required. > > v2: Based on good feedback from Thomas and David, now implementing the > ACC101 PMD as a close derivative from existing ACC100 PMD with hooks to > have different behaviour based on variant. > This prevents code duplication and only rely on different functions and > behaviour when hardware requires or support differences. > Note that these are pending changes for ACC100 which would be specific to that > device and not ACC101 but these can be managed based on the new > implementation, ie. is_acc100() etc... (such incremental changes for ACC100 > trending 22.11 but confirming this is future proof). > The serie also includes commits which were meant for ACC101 but are also > valuable for ACC100. > > v1: This serie introduces the PMD for the new bbdev device ACC101 (aka Mount > Cirrus). > This is a derivative from previous Mount Bryce ACC100 which includes silicon > improvement, bug fixes, capacity improvement for 5GNR and feature > improvement. > > > Nicolas Chautru (6): > baseband/acc100: update companion PF configure function > baseband/acc100: add protection for some negative scenario > baseband/acc100: remove RTE prefix for internal macro > baseband/acc100: introduce PMD for ACC101 > baseband/acc100: modify validation code for ACC101 > baseband/acc100: configuration of ACC101 from PF > > MAINTAINERS | 1 + > app/test-bbdev/test_bbdev_perf.c | 6 +- > doc/guides/bbdevs/acc100.rst | 37 +- > doc/guides/bbdevs/features/acc101.ini| 13 + > doc/guides/rel_notes/release_22_07.rst | 3 + > drivers/baseband/acc100/acc100_pf_enum.h | 18 + > drivers/baseband/acc100/acc101_pmd.h | 50 +++ > drivers/baseband/acc100/rte_acc100_cfg.h | 4 +- > drivers/baseband/acc100/rte_acc100_pmd.c | 687 > --- > drivers/baseband/acc100/rte_acc100_pmd.h | 37 +- > drivers/baseband/acc100/version.map | 3 +- > 11 files changed, 768 insertions(+), 91 deletions(-) > create mode 100644 doc/guides/bbdevs/features/acc101.ini > create mode 100644 drivers/baseband/acc100/acc101_pmd.h Series applied to dpdk-next-crypto Thanks.
Re: [PATCH 0/3] bugfix and feature for skeleton DMA
08/06/2022 10:50, Chengwen Feng: > This patch set contains three patches for skeleton DMA. > > Chengwen Feng (2): > dma/skeleton: fix return last-idx when no memcopy completed > test/dma: reset last-idx before invoke DMA completed ops > > Sivaprasad Tummala (1): > dma/skeleton: support multiple instances Applied, thanks.
Re: [PATCH] dma/hisilicon: fix header file self contained
14/06/2022 11:33, Chengwen Feng: > From: "Min Hu (Connor)" > > Header files should be self contained. This patch fixed it. > > Fixes: 9e16317a38f9 ("dma/hisilicon: add probing") > Cc: sta...@dpdk.org > > Signed-off-by: Min Hu (Connor) > Signed-off-by: Chengwen Feng Applied, thanks.
Re: [dpdk-dev] [RFC PATCH v2] raw/ptdma: introduce ptdma driver
28/10/2021 16:54, Sebastian, Selwin: > I am working on making ptdma driver as a dmadev. Will submit new patch for > review. Any update?
RE: [Bug 1035] __rte_raw_cksum() crash with misaligned pointer
> From: bugzi...@dpdk.org [mailto:bugzi...@dpdk.org] > Sent: Wednesday, 15 June 2022 09.16 > > https://bugs.dpdk.org/show_bug.cgi?id=1035 > > Bug ID: 1035 >Summary: __rte_raw_cksum() crash with misaligned pointer >Product: DPDK >Version: 21.11 > Hardware: All > OS: All > Status: UNCONFIRMED > Severity: normal > Priority: Normal > Component: ethdev > Assignee: dev@dpdk.org > Reporter: emil.b...@ericsson.com > Target Milestone: --- > > See rte_raw_cksum() in rte_ip.h, which is part of the public API. See > also the > subfunction __rte_raw_cksum(). > > _rte_raw_cksum assumes that the buffer over which the checksum is > calculated is > an even address (divisible by two). See for example this stack overflow > post: > https://stackoverflow.com/questions/46790550/c-undefined-behavior- > strict-aliasing-rule-or-incorrect-alignment > > The post explains that there is undefined behavior in C11 when > "conversion > between two pointer types produces a result that is incorrectly > aligned". When > the buf argument starts on an odd address we thus have undefined > behavior, > since a pointer is cast from void* to uint16_t*. > > In most cases (at least on x86) that isn't a problem, but with higher > optimization levels it may break due to vector instructions. This new > function > seems to be easier to optimize by the compiler, resulting in a crash > when the > buf argument is odd. Please note that the undefined behavior is present > in > earlier versions of dpdk as well. > > Now you're probably thinking: "Just align your buffers". The problem is > that we > have a packet buffer which is aligned. The checksum is calculated on a > subset > of that aligned packet buffer, and that sometimes lies on odd > addresses. > > The question remains if this is an issue with dpdk or not. I can imagine other systems doing what you describe too. So it needs to be addressed. Off the top of my head, an easy fix would be updating __rte_raw_cksum() like this: static inline uint32_t __rte_raw_cksum(const void *buf, size_t len, uint32_t sum) { if (likely((buf & 1) == 0)) { /* The buffer is 16 bit aligned. */ Keep the existing, optimized implementation here. } else { /* The buffer is not 16 bit aligned. */ Add a new odd-buf tolerant implementation here. } } However, I'm not sure that it covers your scenario! The checksum is 16 bit wide, so if you calculate the checksum of e.g. 4 bytes of memory starting at offset 1 in a 6 byte packet buffer, the memory block can be treated as either 4 or 6 bytes relative to the data covered by the checksum, i.e.: A: XX [01 02] [03 04] XX --> cksum = [04 06] B: [XX 01] [02 03] [04 XX] --> cksum = [06 04] Which one do you need? Perhaps an additional function is required to support your use case, and the documentation for rte_raw_cksum() and __rte_raw_cksum() needs to reflect that the buffer must be 16 bit aligned. Or the rte_raw_cksum() function can be modified to support an odd buffer pointer as outlined above, with documentation added about alignment of the running checksum.
[PATCH v9 1/6] net/mlx5: add LWM support for Rxq
Add lwm(Limit WaterMark) field to Rxq object which indicates the percentage of RX queue size used by HW to raise LWM event to the user. Allow LWM setting in modify_rq command. Allow the LWM configuration dynamically by adding RDY2RDY state change. Signed-off-by: Spike Du --- drivers/net/mlx5/mlx5.h | 1 + drivers/net/mlx5/mlx5_devx.c | 13 - drivers/net/mlx5/mlx5_devx.h | 1 + drivers/net/mlx5/mlx5_rx.h | 1 + 4 files changed, 15 insertions(+), 1 deletion(-) diff --git a/drivers/net/mlx5/mlx5.h b/drivers/net/mlx5/mlx5.h index ef755ee..305edff 100644 --- a/drivers/net/mlx5/mlx5.h +++ b/drivers/net/mlx5/mlx5.h @@ -1395,6 +1395,7 @@ enum mlx5_rxq_modify_type { MLX5_RXQ_MOD_RST2RDY, /* modify state from reset to ready. */ MLX5_RXQ_MOD_RDY2ERR, /* modify state from ready to error. */ MLX5_RXQ_MOD_RDY2RST, /* modify state from ready to reset. */ + MLX5_RXQ_MOD_RDY2RDY, /* modify state from ready to ready. */ }; enum mlx5_txq_modify_type { diff --git a/drivers/net/mlx5/mlx5_devx.c b/drivers/net/mlx5/mlx5_devx.c index 4b48f94..c918a50 100644 --- a/drivers/net/mlx5/mlx5_devx.c +++ b/drivers/net/mlx5/mlx5_devx.c @@ -62,7 +62,7 @@ * @return * 0 on success, a negative errno value otherwise and rte_errno is set. */ -static int +int mlx5_devx_modify_rq(struct mlx5_rxq_priv *rxq, uint8_t type) { struct mlx5_devx_modify_rq_attr rq_attr; @@ -76,6 +76,11 @@ case MLX5_RXQ_MOD_RST2RDY: rq_attr.rq_state = MLX5_RQC_STATE_RST; rq_attr.state = MLX5_RQC_STATE_RDY; + if (rxq->lwm) { + rq_attr.modify_bitmask |= + MLX5_MODIFY_RQ_IN_MODIFY_BITMASK_WQ_LWM; + rq_attr.lwm = rxq->lwm; + } break; case MLX5_RXQ_MOD_RDY2ERR: rq_attr.rq_state = MLX5_RQC_STATE_RDY; @@ -85,6 +90,12 @@ rq_attr.rq_state = MLX5_RQC_STATE_RDY; rq_attr.state = MLX5_RQC_STATE_RST; break; + case MLX5_RXQ_MOD_RDY2RDY: + rq_attr.rq_state = MLX5_RQC_STATE_RDY; + rq_attr.state = MLX5_RQC_STATE_RDY; + rq_attr.modify_bitmask |= MLX5_MODIFY_RQ_IN_MODIFY_BITMASK_WQ_LWM; + rq_attr.lwm = rxq->lwm; + break; default: break; } diff --git a/drivers/net/mlx5/mlx5_devx.h b/drivers/net/mlx5/mlx5_devx.h index a95207a..ebd1da4 100644 --- a/drivers/net/mlx5/mlx5_devx.h +++ b/drivers/net/mlx5/mlx5_devx.h @@ -11,6 +11,7 @@ int mlx5_txq_devx_modify(struct mlx5_txq_obj *obj, enum mlx5_txq_modify_type type, uint8_t dev_port); void mlx5_txq_devx_obj_release(struct mlx5_txq_obj *txq_obj); +int mlx5_devx_modify_rq(struct mlx5_rxq_priv *rxq, uint8_t type); extern struct mlx5_obj_ops devx_obj_ops; diff --git a/drivers/net/mlx5/mlx5_rx.h b/drivers/net/mlx5/mlx5_rx.h index e715ed6..25a5f2c 100644 --- a/drivers/net/mlx5/mlx5_rx.h +++ b/drivers/net/mlx5/mlx5_rx.h @@ -175,6 +175,7 @@ struct mlx5_rxq_priv { struct mlx5_devx_rq devx_rq; struct rte_eth_hairpin_conf hairpin_conf; /* Hairpin configuration. */ uint32_t hairpin_status; /* Hairpin binding status. */ + uint32_t lwm:16; }; /* External RX queue descriptor. */ -- 1.8.3.1
[PATCH v9 0/6] introduce per-queue available descriptor threshold and host shaper
available descriptor threshold(ADT for short) is per RX queue attribute, when RX queue available descriptors for HW is below ADT, HW sends an event to application. Host shaper can configure shaper rate and avail_thresh-triggered for a host port. The shaper limits the rate of traffic from host port to embedded ARM rx port on Nvidia BlueField 2 NIC. If avail_thresh-triggered is enabled, a 100Mbps shaper is enabled automatically when one of the host port's Rx queues receives available descriptor threshold event. These two features can combine to control traffic from host port to wire port for BlueField 2 NIC. The traffic flows from host to embedded ARM, then to the physical port. The work flow is on the ARM system, configure available descriptor threshold to RX queue and enable avail_thresh-triggered flag in host shaper, after receiving available descriptor threshold event, delay a while until RX queue is empty , then disable the shaper. We recycle this work flow to reduce RX queue drops on ARM system. Add new libethdev API to set available descriptor threshold, add rte event RTE_ETH_EVENT_RX_AVAIL_THRESH to handle available descriptor threshold event. For host shaper, because it doesn't align to existing DPDK framework and is specific to Nvidia NIC, use PMD private API. For integration with testpmd, put the private cmdline function and available descriptor threshold event handler in mlx5 PMD directory by adding a new file mlx5_testpmd.c. Follow David Marchand's driver specific commands framework to add mlx5 specific commands. Spike Du (6): net/mlx5: add LWM support for Rxq common/mlx5: share interrupt management net/mlx5: add LWM event handling support net/mlx5: support Rx queue based available descriptor threshold net/mlx5: add private API to config host port shaper app/testpmd: add Host Shaper command app/test-pmd/testpmd.c | 7 + doc/guides/nics/mlx5.rst | 93 + doc/guides/rel_notes/release_22_07.rst | 2 + drivers/common/mlx5/linux/meson.build| 13 ++ drivers/common/mlx5/linux/mlx5_common_os.c | 131 drivers/common/mlx5/linux/mlx5_common_os.h | 11 + drivers/common/mlx5/mlx5_prm.h | 26 +++ drivers/common/mlx5/version.map | 2 + drivers/common/mlx5/windows/mlx5_common_os.h | 24 +++ drivers/net/mlx5/linux/mlx5_ethdev_os.c | 71 --- drivers/net/mlx5/linux/mlx5_os.c | 132 +++- drivers/net/mlx5/linux/mlx5_socket.c | 53 + drivers/net/mlx5/meson.build | 4 + drivers/net/mlx5/mlx5.c | 68 +++ drivers/net/mlx5/mlx5.h | 12 +- drivers/net/mlx5/mlx5_devx.c | 60 +- drivers/net/mlx5/mlx5_devx.h | 1 + drivers/net/mlx5/mlx5_rx.c | 288 +++ drivers/net/mlx5/mlx5_rx.h | 13 ++ drivers/net/mlx5/mlx5_testpmd.c | 205 +++ drivers/net/mlx5/mlx5_testpmd.h | 26 +++ drivers/net/mlx5/mlx5_txpp.c | 28 +-- drivers/net/mlx5/rte_pmd_mlx5.h | 30 +++ drivers/net/mlx5/version.map | 2 + drivers/net/mlx5/windows/mlx5_ethdev_os.c| 22 -- drivers/vdpa/mlx5/mlx5_vdpa_virtq.c | 48 + 26 files changed, 1064 insertions(+), 308 deletions(-) create mode 100644 drivers/net/mlx5/mlx5_testpmd.c create mode 100644 drivers/net/mlx5/mlx5_testpmd.h -- 1.8.3.1
[PATCH v9 2/6] common/mlx5: share interrupt management
There are many duplicate code of creating and initializing rte_intr_handle. Add a new mlx5_os API to do this, replace all PMD related code with this API. Signed-off-by: Spike Du --- drivers/common/mlx5/linux/mlx5_common_os.c | 131 ++ drivers/common/mlx5/linux/mlx5_common_os.h | 11 +++ drivers/common/mlx5/version.map | 2 + drivers/common/mlx5/windows/mlx5_common_os.h | 24 + drivers/net/mlx5/linux/mlx5_ethdev_os.c | 71 -- drivers/net/mlx5/linux/mlx5_os.c | 132 ++- drivers/net/mlx5/linux/mlx5_socket.c | 53 ++- drivers/net/mlx5/mlx5.h | 2 - drivers/net/mlx5/mlx5_txpp.c | 28 ++ drivers/net/mlx5/windows/mlx5_ethdev_os.c| 22 - drivers/vdpa/mlx5/mlx5_vdpa_virtq.c | 48 ++ 11 files changed, 217 insertions(+), 307 deletions(-) diff --git a/drivers/common/mlx5/linux/mlx5_common_os.c b/drivers/common/mlx5/linux/mlx5_common_os.c index d40cfd5..f10a981 100644 --- a/drivers/common/mlx5/linux/mlx5_common_os.c +++ b/drivers/common/mlx5/linux/mlx5_common_os.c @@ -11,6 +11,7 @@ #endif #include #include +#include #include #include @@ -964,3 +965,133 @@ claim_zero(mlx5_glue->dereg_mr(pmd_mr->obj)); memset(pmd_mr, 0, sizeof(*pmd_mr)); } + +/** + * Rte_intr_handle create and init helper. + * + * @param[in] mode + * interrupt instance can be shared between primary and secondary + * processes or not. + * @param[in] set_fd_nonblock + * Whether to set fd to O_NONBLOCK. + * @param[in] fd + * Fd to set in created intr_handle. + * @param[in] cb + * Callback to register for intr_handle. + * @param[in] cb_arg + * Callback argument for cb. + * + * @return + * - Interrupt handle on success. + * - NULL on failure, with rte_errno set. + */ +struct rte_intr_handle * +mlx5_os_interrupt_handler_create(int mode, bool set_fd_nonblock, int fd, +rte_intr_callback_fn cb, void *cb_arg) +{ + struct rte_intr_handle *tmp_intr_handle; + int ret, flags; + + tmp_intr_handle = rte_intr_instance_alloc(mode); + if (!tmp_intr_handle) { + rte_errno = ENOMEM; + goto err; + } + if (set_fd_nonblock) { + flags = fcntl(fd, F_GETFL); + ret = fcntl(fd, F_SETFL, flags | O_NONBLOCK); + if (ret) { + rte_errno = errno; + goto err; + } + } + ret = rte_intr_fd_set(tmp_intr_handle, fd); + if (ret) + goto err; + ret = rte_intr_type_set(tmp_intr_handle, RTE_INTR_HANDLE_EXT); + if (ret) + goto err; + ret = rte_intr_callback_register(tmp_intr_handle, cb, cb_arg); + if (ret) { + rte_errno = -ret; + goto err; + } + return tmp_intr_handle; +err: + if (tmp_intr_handle) + rte_intr_instance_free(tmp_intr_handle); + return NULL; +} + +/* Safe unregistration for interrupt callback. */ +static void +mlx5_intr_callback_unregister(const struct rte_intr_handle *handle, + rte_intr_callback_fn cb_fn, void *cb_arg) +{ + uint64_t twait = 0; + uint64_t start = 0; + + do { + int ret; + + ret = rte_intr_callback_unregister(handle, cb_fn, cb_arg); + if (ret >= 0) + return; + if (ret != -EAGAIN) { + DRV_LOG(INFO, "failed to unregister interrupt" + " handler (error: %d)", ret); + MLX5_ASSERT(false); + return; + } + if (twait) { + struct timespec onems; + + /* Wait one millisecond and try again. */ + onems.tv_sec = 0; + onems.tv_nsec = NS_PER_S / MS_PER_S; + nanosleep(&onems, 0); + /* Check whether one second elapsed. */ + if ((rte_get_timer_cycles() - start) <= twait) + continue; + } else { + /* +* We get the amount of timer ticks for one second. +* If this amount elapsed it means we spent one +* second in waiting. This branch is executed once +* on first iteration. +*/ + twait = rte_get_timer_hz(); + MLX5_ASSERT(twait); + } + /* +* Timeout elapsed, show message (once a second) and retry. +* We have no other acceptable option here, if we ignore +* the unregistering return code the handler will not +* be unr
[PATCH v9 3/6] net/mlx5: add LWM event handling support
When LWM meets RQ WQE, the kernel driver raises an event to SW. Use devx event_channel to catch this and to notify the user. Allocate this channel per shared device. The channel has a cookie that informs the specific event port and queue. Signed-off-by: Spike Du --- drivers/net/mlx5/mlx5.c | 66 drivers/net/mlx5/mlx5.h | 7 + drivers/net/mlx5/mlx5_devx.c | 47 +++ drivers/net/mlx5/mlx5_rx.c | 33 ++ drivers/net/mlx5/mlx5_rx.h | 7 + 5 files changed, 160 insertions(+) diff --git a/drivers/net/mlx5/mlx5.c b/drivers/net/mlx5/mlx5.c index f098871..e04a666 100644 --- a/drivers/net/mlx5/mlx5.c +++ b/drivers/net/mlx5/mlx5.c @@ -9,6 +9,7 @@ #include #include #include +#include #include #include @@ -22,6 +23,7 @@ #include #include #include +#include #include #include @@ -1525,6 +1527,69 @@ struct mlx5_dev_ctx_shared * } /** + * Create LWM event_channel and interrupt handle for shared device + * context. All rxqs sharing the device context share the event_channel. + * A callback is registered in interrupt thread to receive the LWM event. + * + * @param[in] priv + * Pointer to mlx5_priv instance. + * + * @return + * 0 on success, negative with rte_errno set. + */ +int +mlx5_lwm_setup(struct mlx5_priv *priv) +{ + int fd_lwm; + + pthread_mutex_init(&priv->sh->lwm_config_lock, NULL); + priv->sh->devx_channel_lwm = mlx5_os_devx_create_event_channel + (priv->sh->cdev->ctx, +MLX5DV_DEVX_CREATE_EVENT_CHANNEL_FLAGS_OMIT_EV_DATA); + if (!priv->sh->devx_channel_lwm) + goto err; + fd_lwm = mlx5_os_get_devx_channel_fd(priv->sh->devx_channel_lwm); + priv->sh->intr_handle_lwm = mlx5_os_interrupt_handler_create + (RTE_INTR_INSTANCE_F_SHARED, true, +fd_lwm, mlx5_dev_interrupt_handler_lwm, priv); + if (!priv->sh->intr_handle_lwm) + goto err; + return 0; +err: + if (priv->sh->devx_channel_lwm) { + mlx5_os_devx_destroy_event_channel + (priv->sh->devx_channel_lwm); + priv->sh->devx_channel_lwm = NULL; + } + pthread_mutex_destroy(&priv->sh->lwm_config_lock); + return -rte_errno; +} + +/** + * Destroy LWM event_channel and interrupt handle for shared device + * context before free this context. The interrupt handler is also + * unregistered. + * + * @param[in] sh + * Pointer to shared device context. + */ +void +mlx5_lwm_unset(struct mlx5_dev_ctx_shared *sh) +{ + if (sh->intr_handle_lwm) { + mlx5_os_interrupt_handler_destroy(sh->intr_handle_lwm, + mlx5_dev_interrupt_handler_lwm, (void *)-1); + sh->intr_handle_lwm = NULL; + } + if (sh->devx_channel_lwm) { + mlx5_os_devx_destroy_event_channel + (sh->devx_channel_lwm); + sh->devx_channel_lwm = NULL; + } + pthread_mutex_destroy(&sh->lwm_config_lock); +} + +/** * Free shared IB device context. Decrement counter and if zero free * all allocated resources and close handles. * @@ -1601,6 +1666,7 @@ struct mlx5_dev_ctx_shared * claim_zero(mlx5_devx_cmd_destroy(sh->td)); MLX5_ASSERT(sh->geneve_tlv_option_resource == NULL); pthread_mutex_destroy(&sh->txpp.mutex); + mlx5_lwm_unset(sh); mlx5_free(sh); return; exit: diff --git a/drivers/net/mlx5/mlx5.h b/drivers/net/mlx5/mlx5.h index 7ebb2cc..a76f2fe 100644 --- a/drivers/net/mlx5/mlx5.h +++ b/drivers/net/mlx5/mlx5.h @@ -1268,6 +1268,9 @@ struct mlx5_dev_ctx_shared { struct mlx5_lb_ctx self_lb; /* QP to enable self loopback for Devx. */ unsigned int flow_max_priority; enum modify_reg flow_mreg_c[MLX5_MREG_C_NUM]; + void *devx_channel_lwm; + struct rte_intr_handle *intr_handle_lwm; + pthread_mutex_t lwm_config_lock; /* Availability of mreg_c's. */ struct mlx5_dev_shared_port port[]; /* per device port data array. */ }; @@ -1405,6 +1408,7 @@ enum mlx5_txq_modify_type { }; struct mlx5_rxq_priv; +struct mlx5_priv; /* HW objects operations structure. */ struct mlx5_obj_ops { @@ -1413,6 +1417,7 @@ struct mlx5_obj_ops { int (*rxq_event_get)(struct mlx5_rxq_obj *rxq_obj); int (*rxq_obj_modify)(struct mlx5_rxq_priv *rxq, uint8_t type); void (*rxq_obj_release)(struct mlx5_rxq_priv *rxq); + int (*rxq_event_get_lwm)(struct mlx5_priv *priv, int *rxq_idx, int *port_id); int (*ind_table_new)(struct rte_eth_dev *dev, const unsigned int log_n, struct mlx5_ind_table_obj *ind_tbl); int (*ind_table_modify)(struct rte_eth_dev *dev, @@ -1603,6 +1608,8 @@ int mlx5_udp_tunnel_port_add(struct rte_eth_dev *dev, bool mlx5_is_hpf(struct rte_eth_dev *dev); bool mlx5_is_sf_repr
[PATCH v9 4/6] net/mlx5: support Rx queue based available descriptor threshold
Add mlx5 specific available descriptor threshold configuration and query handler. In mlx5 PMD, available descriptor threshold is also called LWM(limit watermark). While the Rx queue fullness reaches the LWM limit, the driver catches an HW event and invokes the user callback. The query handler finds the next RX queue with pending LWM event if any, starting from the given RX queue index. Signed-off-by: Spike Du --- doc/guides/nics/mlx5.rst | 12 +++ doc/guides/rel_notes/release_22_07.rst | 1 + drivers/common/mlx5/mlx5_prm.h | 1 + drivers/net/mlx5/mlx5.c| 2 + drivers/net/mlx5/mlx5_rx.c | 151 + drivers/net/mlx5/mlx5_rx.h | 5 ++ 6 files changed, 172 insertions(+) diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst index d83c56d..cceaddf 100644 --- a/doc/guides/nics/mlx5.rst +++ b/doc/guides/nics/mlx5.rst @@ -93,6 +93,7 @@ Features - Connection tracking. - Sub-Function representors. - Sub-Function. +- Rx queue available descriptor threshold configuration. Limitations @@ -520,6 +521,9 @@ Limitations - The NIC egress flow rules on representor port are not supported. +- Available descriptor threshold: + + - Doesn't support shared Rx queue and Hairpin Rx queue. Statistics -- @@ -1680,3 +1684,11 @@ The procedure below is an example of using a ConnectX-5 adapter card (pf0) with #. For each VF PCIe, using the following command to bind the driver:: $ echo ":82:00.2" >> /sys/bus/pci/drivers/mlx5_core/bind + +Available descriptor threshold introduction +--- + +Available descriptor threshold is a per Rx queue attribute, it should be configured as +a percentage of the Rx queue size. +When Rx queue available descriptors for hardware are below the threshold, an event is sent to PMD. + diff --git a/doc/guides/rel_notes/release_22_07.rst b/doc/guides/rel_notes/release_22_07.rst index 6fc044e..7fb98cd 100644 --- a/doc/guides/rel_notes/release_22_07.rst +++ b/doc/guides/rel_notes/release_22_07.rst @@ -151,6 +151,7 @@ New Features * Added support for promiscuous mode on Windows. * Added support for MTU on Windows. * Added matching and RSS on IPsec ESP. + * Added Rx queue available descriptor threshold support. * **Updated VMware vmxnet3 networking driver.** diff --git a/drivers/common/mlx5/mlx5_prm.h b/drivers/common/mlx5/mlx5_prm.h index 654e5f4..7c4030a 100644 --- a/drivers/common/mlx5/mlx5_prm.h +++ b/drivers/common/mlx5/mlx5_prm.h @@ -3294,6 +3294,7 @@ struct mlx5_aso_wqe { enum { MLX5_EVENT_TYPE_OBJECT_CHANGE = 0x27, + MLX5_EVENT_TYPE_SRQ_LIMIT_REACHED = 0x14, }; enum { diff --git a/drivers/net/mlx5/mlx5.c b/drivers/net/mlx5/mlx5.c index e04a666..998846a 100644 --- a/drivers/net/mlx5/mlx5.c +++ b/drivers/net/mlx5/mlx5.c @@ -2071,6 +2071,8 @@ struct mlx5_dev_ctx_shared * .dev_supported_ptypes_get = mlx5_dev_supported_ptypes_get, .vlan_filter_set = mlx5_vlan_filter_set, .rx_queue_setup = mlx5_rx_queue_setup, + .rx_queue_avail_thresh_set = mlx5_rx_queue_lwm_set, + .rx_queue_avail_thresh_query = mlx5_rx_queue_lwm_query, .rx_hairpin_queue_setup = mlx5_rx_hairpin_queue_setup, .tx_queue_setup = mlx5_tx_queue_setup, .tx_hairpin_queue_setup = mlx5_tx_hairpin_queue_setup, diff --git a/drivers/net/mlx5/mlx5_rx.c b/drivers/net/mlx5/mlx5_rx.c index 197d708..2cb7006 100644 --- a/drivers/net/mlx5/mlx5_rx.c +++ b/drivers/net/mlx5/mlx5_rx.c @@ -25,6 +25,7 @@ #include "mlx5.h" #include "mlx5_utils.h" #include "mlx5_rxtx.h" +#include "mlx5_devx.h" #include "mlx5_rx.h" @@ -128,6 +129,16 @@ return RTE_ETH_RX_DESC_AVAIL; } +/* Get rxq lwm percentage according to lwm number. */ +static uint8_t +mlx5_rxq_lwm_to_percentage(struct mlx5_rxq_priv *rxq) +{ + struct mlx5_rxq_data *rxq_data = &rxq->ctrl->rxq; + uint32_t wqe_cnt = 1 << (rxq_data->elts_n - rxq_data->sges_n); + + return rxq->lwm * 100 / wqe_cnt; +} + /** * DPDK callback to get the RX queue information. * @@ -150,6 +161,7 @@ { struct mlx5_rxq_ctrl *rxq_ctrl = mlx5_rxq_ctrl_get(dev, rx_queue_id); struct mlx5_rxq_data *rxq = mlx5_rxq_data_get(dev, rx_queue_id); + struct mlx5_rxq_priv *rxq_priv = mlx5_rxq_get(dev, rx_queue_id); if (!rxq) return; @@ -169,6 +181,8 @@ qinfo->nb_desc = mlx5_rxq_mprq_enabled(rxq) ? RTE_BIT32(rxq->elts_n) * RTE_BIT32(rxq->log_strd_num) : RTE_BIT32(rxq->elts_n); + qinfo->avail_thresh = rxq_priv ? + mlx5_rxq_lwm_to_percentage(rxq_priv) : 0; } /** @@ -1188,6 +1202,34 @@ int mlx5_get_monitor_addr(void *rx_queue, struct rte_power_monitor_cond *pmc) return -ENOTSUP; } +int +mlx5_rx_queue_lwm_query(struct rte_eth_dev *dev, + uint16_t *queue_id, uint8_t *lwm) +{ + struct mlx5_priv *priv =
[PATCH v9 5/6] net/mlx5: add private API to config host port shaper
Host port shaper can be configured with QSHR(QoS Shaper Host Register). Add check in build files to enable this function or not. The host shaper configuration affects all the ethdev ports belonging to the same host port. Host shaper can configure shaper rate and lwm-triggered for a host port. The shaper limits the rate of traffic from host port to wire port. If lwm-triggered is enabled, a 100Mbps shaper is enabled automatically when one of the host port's Rx queues receives available descriptor threshold event. Signed-off-by: Spike Du --- doc/guides/nics/mlx5.rst | 35 +++ doc/guides/rel_notes/release_22_07.rst | 1 + drivers/common/mlx5/linux/meson.build | 13 + drivers/common/mlx5/mlx5_prm.h | 25 drivers/net/mlx5/mlx5.h| 2 + drivers/net/mlx5/mlx5_rx.c | 104 + drivers/net/mlx5/rte_pmd_mlx5.h| 30 ++ drivers/net/mlx5/version.map | 2 + 8 files changed, 212 insertions(+) diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst index cceaddf..5f7b060 100644 --- a/doc/guides/nics/mlx5.rst +++ b/doc/guides/nics/mlx5.rst @@ -94,6 +94,7 @@ Features - Sub-Function representors. - Sub-Function. - Rx queue available descriptor threshold configuration. +- Host shaper support. Limitations @@ -525,6 +526,12 @@ Limitations - Doesn't support shared Rx queue and Hairpin Rx queue. +- Host shaper: + + - Support BlueField series NIC from BlueField 2. + - When configure host shaper with MLX5_HOST_SHAPER_FLAG_AVAIL_THRESH_TRIGGERED flag set, +only rate 0 and 100Mbps are supported. + Statistics -- @@ -1692,3 +1699,31 @@ Available descriptor threshold is a per Rx queue attribute, it should be configu a percentage of the Rx queue size. When Rx queue available descriptors for hardware are below the threshold, an event is sent to PMD. +Host shaper introduction + + +Host shaper register is per host port register which sets a shaper +on the host port. +All VF/hostPF representors belonging to one host port share one host shaper. +For example, if representor 0 and representor 1 belong to same host port, +and a host shaper rate of 1Gbps is configured, the shaper throttles both +representors' traffic from host. +Host shaper has two modes for setting the shaper, immediate and deferred to +available descriptor threshold event trigger. In immediate mode, the rate limit is configured +immediately to host shaper. When deferring to available descriptor threshold trigger, the shaper +is not set until an available descriptor threshold event is received by any Rx queue in a VF +representor belonging to the host port. The only rate supported for deferred +mode is 100Mbps (there is no limit on the supported rates for immediate mode). +In deferred mode, the shaper is set on the host port by the firmware upon +receiving the available descriptor threshold event, which allows throttling host traffic on +available descriptor threshold events at minimum latency, preventing excess drops in the +Rx queue. + +Host shaper dependency for mstflint package +--- + +In order to configure host shaper register, ``librte_net_mlx5`` depends on ``libmtcr_ul`` +which can be installed from OFED mstflint package. +Meson detects ``libmtcr_ul`` existence at configure stage. +If the library is detected, the application must link with ``-lmtcr_ul``, +as done by the pkg-config file libdpdk.pc. diff --git a/doc/guides/rel_notes/release_22_07.rst b/doc/guides/rel_notes/release_22_07.rst index 7fb98cd..199a775 100644 --- a/doc/guides/rel_notes/release_22_07.rst +++ b/doc/guides/rel_notes/release_22_07.rst @@ -152,6 +152,7 @@ New Features * Added support for MTU on Windows. * Added matching and RSS on IPsec ESP. * Added Rx queue available descriptor threshold support. + * Added host shaper support. * **Updated VMware vmxnet3 networking driver.** diff --git a/drivers/common/mlx5/linux/meson.build b/drivers/common/mlx5/linux/meson.build index 5335f5b..51c6e5d 100644 --- a/drivers/common/mlx5/linux/meson.build +++ b/drivers/common/mlx5/linux/meson.build @@ -45,6 +45,13 @@ if static_ibverbs ext_deps += declare_dependency(link_args:ibv_ldflags.split()) endif +libmtcr_ul_found = false +lib = cc.find_library('mtcr_ul', required:false) +if lib.found() and run_command('meson', '--version').stdout().version_compare('>= 0.49.2') +libmtcr_ul_found = true +ext_deps += lib +endif + sources += files('mlx5_nl.c') sources += files('mlx5_common_auxiliary.c') sources += files('mlx5_common_os.c') @@ -207,6 +214,12 @@ has_sym_args = [ [ 'HAVE_MLX5_IBV_IMPORT_CTX_PD_AND_MR', 'infiniband/verbs.h', 'ibv_import_device' ], ] +if libmtcr_ul_found +has_sym_args += [ +[ 'HAVE_MLX5_MSTFLINT', 'mstflint/mtcr.h', +'mopen'], +] +endif config = configuration_da
[PATCH v9 6/6] app/testpmd: add Host Shaper command
Add command line options to support host shaper configure. - Command syntax: mlx5 set port host_shaper avail_thresh_triggered <0|1> rate - Example commands: To enable avail_thresh_triggered on port 1 and disable current host shaper: testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0 To disable avail_thresh_triggered and current host shaper on port 1: testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0 The rate unit is 100Mbps. To disable avail_thresh_triggered and configure a shaper of 5Gbps on port 1: testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50 Add sample code to handle rxq available descriptor threshold event, it delays a while so that rxq empties, then disables host shaper and rearms available descriptor threshold event. Signed-off-by: Spike Du --- app/test-pmd/testpmd.c | 7 ++ doc/guides/nics/mlx5.rst| 46 + drivers/net/mlx5/meson.build| 4 + drivers/net/mlx5/mlx5_testpmd.c | 205 drivers/net/mlx5/mlx5_testpmd.h | 26 + 5 files changed, 288 insertions(+) create mode 100644 drivers/net/mlx5/mlx5_testpmd.c create mode 100644 drivers/net/mlx5/mlx5_testpmd.h diff --git a/app/test-pmd/testpmd.c b/app/test-pmd/testpmd.c index 205d98e..e6321bd 100644 --- a/app/test-pmd/testpmd.c +++ b/app/test-pmd/testpmd.c @@ -69,6 +69,9 @@ #ifdef RTE_NET_BOND #include #endif +#ifdef RTE_NET_MLX5 +#include "mlx5_testpmd.h" +#endif #include "testpmd.h" @@ -3726,6 +3729,10 @@ struct pmd_test_command { break; printf("Received avail_thresh event, port: %u, rxq_id: %u\n", port_id, rxq_id); + +#ifdef RTE_NET_MLX5 + mlx5_test_avail_thresh_event_handler(port_id, rxq_id); +#endif } break; } diff --git a/doc/guides/nics/mlx5.rst b/doc/guides/nics/mlx5.rst index 5f7b060..64eaddf 100644 --- a/doc/guides/nics/mlx5.rst +++ b/doc/guides/nics/mlx5.rst @@ -1727,3 +1727,49 @@ which can be installed from OFED mstflint package. Meson detects ``libmtcr_ul`` existence at configure stage. If the library is detected, the application must link with ``-lmtcr_ul``, as done by the pkg-config file libdpdk.pc. + +How to use available descriptor threshold and Host Shaper +- + +There is a command to configure available descriptor threshold in testpmd. +Testpmd also contains sample logic to handle available descriptor threshold event. +The typical workflow is: testpmd configure available descriptor threshold for Rx queues, enable +avail_thresh_triggered in host shaper and register a callback, when traffic from host is +too high and Rx queue emptiness is below available descriptor threshold, PMD receives an event and +firmware configures a 100Mbps shaper on host port automatically, then PMD call +the callback registered previously, which will delay a while to let Rx queue +empty, then disable host shaper. + +Let's assume we have a simple BlueField 2 setup: port 0 is uplink, port 1 +is VF representor. Each port has 2 Rx queues. +In order to control traffic from host to ARM, we can enable available descriptor threshold in testpmd by: + +.. code-block:: console + + testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 1 rate 0 + testpmd> set port 1 rxq 0 avail_thresh 70 + testpmd> set port 1 rxq 1 avail_thresh 70 + +The first command disables current host shaper, and enables available descriptor threshold triggered mode. +The other commands configure available descriptor threshold to 70% of Rx queue size for both Rx queues, +When traffic from host is too high, you can see testpmd console prints log +about available descriptor threshold event receiving, then host shaper is disabled. +The traffic rate from host is controlled and less drop happens in Rx queues. + +The threshold event and shaper can be disabled like this: + +.. code-block:: console + + testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 0 + testpmd> set port 1 rxq 0 avail_thresh 0 + testpmd> set port 1 rxq 1 avail_thresh 0 + +It's recommended an application disables available descriptor threshold and avail_thresh_triggered before exit, +if it enables them before. + +We can also configure the shaper with a value, the rate unit is 100Mbps, below +command sets current shaper to 5Gbps and disables avail_thresh_triggered. + +.. code-block:: console + + testpmd> mlx5 set port 1 host_shaper avail_thresh_triggered 0 rate 50 diff --git a/drivers/net/mlx5/meson.build b/drivers/net/mlx5/meson.build index 99210fd..941642b 100644 --- a/drivers/net/mlx5/meson.build +++ b/drivers/net/mlx5/meson.build @@ -68,4 +68,8 @@ if get_option('buildtype').contains('debug') else cflags += [ '-UPEDANTIC' ] endif + +testpmd_sources += files('mlx5_testpmd.c') +testpmd_drivers_deps += 'net_mlx5' + subdir(exec_e
Re: [PATCH 00/12] Fix compilation with gcc 12
On Wed, 15 Jun 2022 10:49:17 +0200 David Marchand wrote: > On Wed, May 18, 2022 at 12:17 PM David Marchand > wrote: > > > > Fedora 36 is out since early may and comes with gcc 12. > > This series fixes compilation or waives some checks. > > > > There might be something fishy with rte_memcpy on x86 but, for now, > > the rte_memcpy related fixes are on the caller side. > > > > Some "base" drivers have issues, I chose the simple solution of waiving > > the checks for them. > > > > Compilation is the only thing checked. > > Please driver maintainers, check nothing got broken. > > I applied the patches that got acked and that had no objection or > comment from maintainers (i.e. patch 3, 4, 5, 6, 9, 11). > I also cleaned the mess in bugzilla where we had multiple reports of > the same issues, or stale bugs that I can't reproduce with released > gcc 12. > > I'll respin separately the patches for which I have clear comments, > and drop my patches waiving the compiler checks. > > We still need to agree on the best approach to handle the new checks. > We have two rfc series from Stephen, how do we move forward? Lets fix all the bugs and remove any workarounds using pragma's. Some of them may mean removing rte_memcpy where it is not needed.
[dpdk-dev] [PATCH 0/8] bnxt PMD fixes
From: Kalesh AP This patchset contains bug fixes in bnxt PMD. Ajit Khaparde (1): net/bnxt: fix switch domain allocation Damodharam Ammepalli (3): net/bnxt: allow Tx only or Rx only configs in PMD net/bnxt: disallow MTU change when device is started net/bnxt: cleanups in MTU set callback Kalesh AP (2): net/bnxt: reduce the verbosity of a log net/bnxt: fix setting forced speed Somnath Kotur (2): net/bnxt: remove assert for zero data len in Tx path net/bnxt: fix the check for autoneg enablement in the PHY FW drivers/net/bnxt/bnxt_ethdev.c | 95 ++ drivers/net/bnxt/bnxt_hwrm.c | 45 +--- drivers/net/bnxt/bnxt_hwrm.h | 3 ++ drivers/net/bnxt/bnxt_rxq.c| 9 +--- drivers/net/bnxt/bnxt_txr.c| 29 +++-- 5 files changed, 101 insertions(+), 80 deletions(-) -- 2.10.1
[dpdk-dev] [PATCH 1/8] net/bnxt: remove assert for zero data len in Tx path
From: Somnath Kotur Currently the PMD tries to detect a potential 0 byte DMA by using RTE_VERIFY. But since RTE_VERIFY internally calls rte_panic() it is fatal to the application and some applications want to avoid that. So return an error from the bnxt xmit handler if such a bad pkt is encountered by logging an error message, dumping the pkt header and dump the current stack as well Signed-off-by: Somnath Kotur Reviewed-by: Ajit Khaparde --- drivers/net/bnxt/bnxt_txr.c | 29 ++--- 1 file changed, 26 insertions(+), 3 deletions(-) diff --git a/drivers/net/bnxt/bnxt_txr.c b/drivers/net/bnxt/bnxt_txr.c index 7a7196a..67e0167 100644 --- a/drivers/net/bnxt/bnxt_txr.c +++ b/drivers/net/bnxt/bnxt_txr.c @@ -123,6 +123,26 @@ bnxt_xmit_need_long_bd(struct rte_mbuf *tx_pkt, struct bnxt_tx_queue *txq) return false; } +static bool +bnxt_zero_data_len_tso_segsz(struct rte_mbuf *tx_pkt, uint8_t data_len_chk) +{ + const char *type_str = "Data len"; + uint16_t len_to_check = tx_pkt->data_len; + + if (data_len_chk == 0) { + type_str = "TSO Seg size"; + len_to_check = tx_pkt->tso_segsz; + } + + if (len_to_check == 0) { + PMD_DRV_LOG(ERR, "Error! Tx pkt %s == 0\n", type_str); + rte_pktmbuf_dump(stdout, tx_pkt, 64); + rte_dump_stack(); + return true; + } + return false; +} + static uint16_t bnxt_start_xmit(struct rte_mbuf *tx_pkt, struct bnxt_tx_queue *txq, uint16_t *coal_pkts, @@ -179,7 +199,8 @@ static uint16_t bnxt_start_xmit(struct rte_mbuf *tx_pkt, } /* Check non zero data_len */ - RTE_VERIFY(tx_pkt->data_len); + if (unlikely(bnxt_zero_data_len_tso_segsz(tx_pkt, 1))) + return -EIO; prod = RING_IDX(ring, txr->tx_raw_prod); tx_buf = &txr->tx_buf_ring[prod]; @@ -256,7 +277,8 @@ static uint16_t bnxt_start_xmit(struct rte_mbuf *tx_pkt, */ txbd1->kid_or_ts_low_hdr_size = hdr_size >> 1; txbd1->kid_or_ts_high_mss = tx_pkt->tso_segsz; - RTE_VERIFY(txbd1->kid_or_ts_high_mss); + if (unlikely(bnxt_zero_data_len_tso_segsz(tx_pkt, 0))) + return -EIO; } else if ((tx_pkt->ol_flags & PKT_TX_OIP_IIP_TCP_UDP_CKSUM) == PKT_TX_OIP_IIP_TCP_UDP_CKSUM) { @@ -330,7 +352,8 @@ static uint16_t bnxt_start_xmit(struct rte_mbuf *tx_pkt, m_seg = tx_pkt->next; while (m_seg) { /* Check non zero data_len */ - RTE_VERIFY(m_seg->data_len); + if (unlikely(bnxt_zero_data_len_tso_segsz(m_seg, 1))) + return -EIO; txr->tx_raw_prod = RING_NEXT(txr->tx_raw_prod); prod = RING_IDX(ring, txr->tx_raw_prod); -- 2.10.1
[dpdk-dev] [PATCH 2/8] net/bnxt: fix switch domain allocation
From: Ajit Khaparde Allocate switch domain after the trusted VF capability is queried from the FW. Currently we are calling the function earlier. Since the switch domain is allocated only for PFs or trusted VF, the current location of code fails to allocate the domain during init. But during cleanup we try to free the domain incorrectly. Fix the behavior by changing the sequence of function calls. Fixes: 3127f99274b67 ("net/bnxt: refactor init/uninit") Cc: sta...@dpdk.org Signed-off-by: Ajit Khaparde Reviewed-by: Somnath Kotur Reviewed-by: Kalesh AP Reviewed-by: Andy Gospodarek --- drivers/net/bnxt/bnxt_ethdev.c | 45 +- 1 file changed, 23 insertions(+), 22 deletions(-) diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c index 0f0f40b..34f2149 100644 --- a/drivers/net/bnxt/bnxt_ethdev.c +++ b/drivers/net/bnxt/bnxt_ethdev.c @@ -5287,6 +5287,25 @@ bnxt_init_locks(struct bnxt *bp) return err; } +/* This should be called after we have queried trusted VF cap */ +static int bnxt_alloc_switch_domain(struct bnxt *bp) +{ + int rc = 0; + + if (BNXT_PF(bp) || BNXT_VF_IS_TRUSTED(bp)) { + rc = rte_eth_switch_domain_alloc(&bp->switch_domain_id); + if (rc) + PMD_DRV_LOG(ERR, + "Failed to alloc switch domain: %d\n", rc); + else + PMD_DRV_LOG(INFO, + "Switch domain allocated %d\n", + bp->switch_domain_id); + } + + return rc; +} + static int bnxt_init_resources(struct bnxt *bp, bool reconfig_dev) { int rc = 0; @@ -5295,6 +5314,10 @@ static int bnxt_init_resources(struct bnxt *bp, bool reconfig_dev) if (rc) return rc; + rc = bnxt_alloc_switch_domain(bp); + if (rc) + return rc; + if (!reconfig_dev) { rc = bnxt_setup_mac_addr(bp->eth_dev); if (rc) @@ -5734,24 +5757,6 @@ bnxt_parse_dev_args(struct bnxt *bp, struct rte_devargs *devargs) return ret; } -static int bnxt_alloc_switch_domain(struct bnxt *bp) -{ - int rc = 0; - - if (BNXT_PF(bp) || BNXT_VF_IS_TRUSTED(bp)) { - rc = rte_eth_switch_domain_alloc(&bp->switch_domain_id); - if (rc) - PMD_DRV_LOG(ERR, - "Failed to alloc switch domain: %d\n", rc); - else - PMD_DRV_LOG(INFO, - "Switch domain allocated %d\n", - bp->switch_domain_id); - } - - return rc; -} - /* Allocate and initialize various fields in bnxt struct that * need to be allocated/destroyed only once in the lifetime of the driver */ @@ -5828,10 +5833,6 @@ static int bnxt_drv_init(struct rte_eth_dev *eth_dev) if (rc) return rc; - rc = bnxt_alloc_switch_domain(bp); - if (rc) - return rc; - return rc; } -- 2.10.1
[dpdk-dev] [PATCH 3/8] net/bnxt: reduce the verbosity of a log
From: Kalesh AP Broadcom HW drops packets when there are no descriptors available. It does not matter what flag the application specifies in "rx_drop_en" when configuring the Rx ring. Reduce the verbosity of the log to print the status of the "rx_drop_en" when configuring the Rx ring. Signed-off-by: Kalesh AP Reviewed-by: Somnath Kotur Reviewed-by: Ajit Khaparde --- drivers/net/bnxt/bnxt_rxq.c | 5 ++--- 1 file changed, 2 insertions(+), 3 deletions(-) diff --git a/drivers/net/bnxt/bnxt_rxq.c b/drivers/net/bnxt/bnxt_rxq.c index ef2ef83..f4b6417 100644 --- a/drivers/net/bnxt/bnxt_rxq.c +++ b/drivers/net/bnxt/bnxt_rxq.c @@ -360,9 +360,8 @@ int bnxt_rx_queue_setup_op(struct rte_eth_dev *eth_dev, rxq->rx_free_thresh = RTE_MIN(rte_align32pow2(nb_desc) / 4, RTE_BNXT_MAX_RX_BURST); - if (rx_conf->rx_drop_en != BNXT_DEFAULT_RX_DROP_EN) - PMD_DRV_LOG(NOTICE, - "Per-queue config of drop-en is not supported.\n"); + PMD_DRV_LOG(DEBUG, + "App supplied RXQ drop_en status : %d\n", rx_conf->rx_drop_en); rxq->drop_en = BNXT_DEFAULT_RX_DROP_EN; PMD_DRV_LOG(DEBUG, "RX Buf MTU %d\n", eth_dev->data->mtu); -- 2.10.1
[dpdk-dev] [PATCH 4/8] net/bnxt: allow Tx only or Rx only configs in PMD
From: Damodharam Ammepalli Currently, we fail the init/probe of pmd if eth_dev->data->nb_tx_queues or eth_dev->data->nb_rx_queues is 0. We are removing this check. Fixes: daef48efe5e5 ("net/bnxt: support set MTU") Cc: sta...@dpdk.org Signed-off-by: Damodharam Ammepalli Reviewed-by: Ajit Khaparde Reviewed-by: Somnath Kotur --- drivers/net/bnxt/bnxt_ethdev.c | 7 +-- 1 file changed, 1 insertion(+), 6 deletions(-) diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c index 34f2149..8181e1f 100644 --- a/drivers/net/bnxt/bnxt_ethdev.c +++ b/drivers/net/bnxt/bnxt_ethdev.c @@ -723,7 +723,7 @@ static int bnxt_alloc_prev_ring_stats(struct bnxt *bp) sizeof(struct bnxt_ring_stats) * bp->tx_cp_nr_rings, 0); - if (bp->prev_tx_ring_stats == NULL) + if (bp->tx_cp_nr_rings > 0 && bp->prev_tx_ring_stats == NULL) goto error; return 0; @@ -1567,11 +1567,6 @@ int bnxt_dev_start_op(struct rte_eth_dev *eth_dev) int vlan_mask = 0; int rc, retry_cnt = BNXT_IF_CHANGE_RETRY_COUNT; - if (!eth_dev->data->nb_tx_queues || !eth_dev->data->nb_rx_queues) { - PMD_DRV_LOG(ERR, "Queues are not configured yet!\n"); - return -EINVAL; - } - if (bp->rx_cp_nr_rings > RTE_ETHDEV_QUEUE_STAT_CNTRS) PMD_DRV_LOG(ERR, "RxQ cnt %d > RTE_ETHDEV_QUEUE_STAT_CNTRS %d\n", -- 2.10.1
[dpdk-dev] [PATCH 5/8] net/bnxt: fix setting forced speed
From: Kalesh AP The "active_fec_signal_mode" in HWRM_PORT_PHY_QCFG response does not return correct value till the link is up. Driver cannot rely on active_fec_signal_mode while setting forced speed. While setting forced speed of 50G/100G/200G, check if PAM4 speeds are supported for the port first and then populate the HWRM request accordingly. Also, If PAM4 speed is supported, use PAM4 supported speed while reporting speed capabilities. Fixes: c23f9ded0391 ("net/bnxt: support 200G PAM4 link") Cc: sta...@dpdk.org Signed-off-by: Kalesh AP Reviewed-by: Ajit Khaparde Reviewed-by: Somnath Kotur --- drivers/net/bnxt/bnxt_ethdev.c | 11 ++- drivers/net/bnxt/bnxt_hwrm.c | 27 +++ drivers/net/bnxt/bnxt_hwrm.h | 3 +++ 3 files changed, 28 insertions(+), 13 deletions(-) diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c index 8181e1f..4e4791c 100644 --- a/drivers/net/bnxt/bnxt_ethdev.c +++ b/drivers/net/bnxt/bnxt_ethdev.c @@ -903,6 +903,7 @@ static int bnxt_shutdown_nic(struct bnxt *bp) uint32_t bnxt_get_speed_capabilities(struct bnxt *bp) { + uint32_t pam4_link_speed = 0; uint32_t link_speed = 0; uint32_t speed_capa = 0; @@ -912,8 +913,8 @@ uint32_t bnxt_get_speed_capabilities(struct bnxt *bp) link_speed = bp->link_info->support_speeds; /* If PAM4 is configured, use PAM4 supported speed */ - if (link_speed == 0 && bp->link_info->support_pam4_speeds > 0) - link_speed = bp->link_info->support_pam4_speeds; + if (bp->link_info->support_pam4_speeds > 0) + pam4_link_speed = bp->link_info->support_pam4_speeds; if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_LINK_SPEED_100MB) speed_capa |= RTE_ETH_LINK_SPEED_100M; @@ -935,11 +936,11 @@ uint32_t bnxt_get_speed_capabilities(struct bnxt *bp) speed_capa |= RTE_ETH_LINK_SPEED_50G; if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_SPEEDS_100GB) speed_capa |= RTE_ETH_LINK_SPEED_100G; - if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_50G) + if (pam4_link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_50G) speed_capa |= RTE_ETH_LINK_SPEED_50G; - if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_100G) + if (pam4_link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_100G) speed_capa |= RTE_ETH_LINK_SPEED_100G; - if (link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_200G) + if (pam4_link_speed & HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_200G) speed_capa |= RTE_ETH_LINK_SPEED_200G; if (bp->link_info->auto_mode == diff --git a/drivers/net/bnxt/bnxt_hwrm.c b/drivers/net/bnxt/bnxt_hwrm.c index 9eb8b8d..206ac20 100644 --- a/drivers/net/bnxt/bnxt_hwrm.c +++ b/drivers/net/bnxt/bnxt_hwrm.c @@ -2970,7 +2970,7 @@ static uint16_t bnxt_check_eth_link_autoneg(uint32_t conf_link) } static uint16_t bnxt_parse_eth_link_speed(uint32_t conf_link_speed, - uint16_t pam4_link) + struct bnxt_link_info *link_info) { uint16_t eth_link_speed = 0; @@ -3009,18 +3009,29 @@ static uint16_t bnxt_parse_eth_link_speed(uint32_t conf_link_speed, HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_40GB; break; case RTE_ETH_LINK_SPEED_50G: - eth_link_speed = pam4_link ? - HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_50GB : - HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_50GB; + if (link_info->support_pam4_speeds & + HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_50G) { + eth_link_speed = HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_50GB; + link_info->link_signal_mode = BNXT_SIG_MODE_PAM4; + } else { + eth_link_speed = HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_50GB; + link_info->link_signal_mode = BNXT_SIG_MODE_NRZ; + } break; case RTE_ETH_LINK_SPEED_100G: - eth_link_speed = pam4_link ? - HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_100GB : - HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_100GB; + if (link_info->support_pam4_speeds & + HWRM_PORT_PHY_QCFG_OUTPUT_SUPPORT_PAM4_SPEEDS_100G) { + eth_link_speed = HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_100GB; + link_info->link_signal_mode = BNXT_SIG_MODE_PAM4; + } else { + eth_link_speed = HWRM_PORT_PHY_CFG_INPUT_FORCE_LINK_SPEED_100GB; + link_info->link_signal_mode = BNXT_SIG_MODE_NRZ; + } break; case RTE_ETH_LINK_SPEED_200G:
[dpdk-dev] [PATCH 6/8] net/bnxt: disallow MTU change when device is started
From: Damodharam Ammepalli With this patch, bnxt_mtu_set_op() will return an error code if the device has already started. The user application will have to take care to bring down device before invoking the mtu_set() Fixes: daef48efe5e5 ("net/bnxt: support set MTU") Cc: sta...@dpdk.org Signed-off-by: Damodharam Ammepalli Reviewed-by: Andy Gospodarek Reviewed-by: Ajit Khaparde --- drivers/net/bnxt/bnxt_ethdev.c | 28 1 file changed, 8 insertions(+), 20 deletions(-) diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c index 4e4791c..f040cdc 100644 --- a/drivers/net/bnxt/bnxt_ethdev.c +++ b/drivers/net/bnxt/bnxt_ethdev.c @@ -3019,9 +3019,7 @@ bnxt_tx_burst_mode_get(struct rte_eth_dev *dev, __rte_unused uint16_t queue_id, int bnxt_mtu_set_op(struct rte_eth_dev *eth_dev, uint16_t new_mtu) { - uint32_t overhead = BNXT_MAX_PKT_LEN - BNXT_MAX_MTU; struct bnxt *bp = eth_dev->data->dev_private; - uint32_t new_pkt_size; uint32_t rc; uint32_t i; @@ -3029,25 +3027,15 @@ int bnxt_mtu_set_op(struct rte_eth_dev *eth_dev, uint16_t new_mtu) if (rc) return rc; + /* Return if port is active */ + if (eth_dev->data->dev_started) { + PMD_DRV_LOG(ERR, "Stop port before changing MTU\n"); + return -EPERM; + } + /* Exit if receive queues are not configured yet */ if (!eth_dev->data->nb_rx_queues) - return rc; - - new_pkt_size = new_mtu + overhead; - - /* -* Disallow any MTU change that would require scattered receive support -* if it is not already enabled. -*/ - if (eth_dev->data->dev_started && - !eth_dev->data->scattered_rx && - (new_pkt_size > -eth_dev->data->min_rx_buf_size - RTE_PKTMBUF_HEADROOM)) { - PMD_DRV_LOG(ERR, - "MTU change would require scattered rx support. "); - PMD_DRV_LOG(ERR, "Stop port before changing MTU.\n"); - return -EINVAL; - } + return -ENOTSUP; if (new_mtu > RTE_ETHER_MTU) bp->flags |= BNXT_FLAG_JUMBO; @@ -3056,7 +3044,7 @@ int bnxt_mtu_set_op(struct rte_eth_dev *eth_dev, uint16_t new_mtu) /* Is there a change in mtu setting? */ if (eth_dev->data->mtu == new_mtu) - return rc; + return 0; for (i = 0; i < bp->nr_vnics; i++) { struct bnxt_vnic_info *vnic = &bp->vnic_info[i]; -- 2.10.1
[dpdk-dev] [PATCH 7/8] net/bnxt: cleanups in MTU set callback
From: Damodharam Ammepalli Minor cleanup in bnxt_mtu_set_op() to move pre-mature setting of jumbo flag post mtu check and remove a redundant mtu set operation from rxq vnic configs. Fixes: daef48efe5e5 ("net/bnxt: support set MTU") Cc: sta...@dpdk.org Signed-off-by: Damodharam Ammepalli Reviewed-by: Ajit Khaparde Reviewed-by: Somnath Kotur --- drivers/net/bnxt/bnxt_ethdev.c | 8 drivers/net/bnxt/bnxt_rxq.c| 4 2 files changed, 4 insertions(+), 8 deletions(-) diff --git a/drivers/net/bnxt/bnxt_ethdev.c b/drivers/net/bnxt/bnxt_ethdev.c index f040cdc..e275d3a 100644 --- a/drivers/net/bnxt/bnxt_ethdev.c +++ b/drivers/net/bnxt/bnxt_ethdev.c @@ -3037,15 +3037,15 @@ int bnxt_mtu_set_op(struct rte_eth_dev *eth_dev, uint16_t new_mtu) if (!eth_dev->data->nb_rx_queues) return -ENOTSUP; + /* Is there a change in mtu setting? */ + if (eth_dev->data->mtu == new_mtu) + return 0; + if (new_mtu > RTE_ETHER_MTU) bp->flags |= BNXT_FLAG_JUMBO; else bp->flags &= ~BNXT_FLAG_JUMBO; - /* Is there a change in mtu setting? */ - if (eth_dev->data->mtu == new_mtu) - return 0; - for (i = 0; i < bp->nr_vnics; i++) { struct bnxt_vnic_info *vnic = &bp->vnic_info[i]; uint16_t size = 0; diff --git a/drivers/net/bnxt/bnxt_rxq.c b/drivers/net/bnxt/bnxt_rxq.c index f4b6417..fabbbd4 100644 --- a/drivers/net/bnxt/bnxt_rxq.c +++ b/drivers/net/bnxt/bnxt_rxq.c @@ -402,10 +402,6 @@ int bnxt_rx_queue_setup_op(struct rte_eth_dev *eth_dev, rxq->rx_started = rxq->rx_deferred_start ? false : true; rxq->vnic = BNXT_GET_DEFAULT_VNIC(bp); - /* Configure mtu if it is different from what was configured before */ - if (!queue_idx) - bnxt_mtu_set_op(eth_dev, eth_dev->data->mtu); - return 0; err: bnxt_rx_queue_release_op(eth_dev, queue_idx); -- 2.10.1
[dpdk-dev] [PATCH 8/8] net/bnxt: fix the check for autoneg enablement in the PHY FW
From: Somnath Kotur The current combination of checks to determine whether autoneg is enabled in the card is a bit convoluted and may become incorrect as well in the future as one of the fields being used - auto_link_speed might become deprecated. Switch to using the 'auto_mode' field obtained from the response of HWRM_PHY_QCFG cmd as that is always deterministically set by the PHY FW. Fixed a bug in the 40G check to only look for the bit setting and not the actual value. Also, check the forced speeds first before trying to enforce the auto speeds Allow the user to set autoneg speed in all cases except for PAM4 200G as PAM4 200G will come up only in forced mode. Fixes: c23f9ded0391 ("net/bnxt: support 200G PAM4 link") Signed-off-by: Somnath Kotur Reviewed-by: Kalesh AP Reviewed-by: Ajit Khaparde --- drivers/net/bnxt/bnxt_hwrm.c | 18 +++--- 1 file changed, 11 insertions(+), 7 deletions(-) diff --git a/drivers/net/bnxt/bnxt_hwrm.c b/drivers/net/bnxt/bnxt_hwrm.c index 206ac20..9c52573 100644 --- a/drivers/net/bnxt/bnxt_hwrm.c +++ b/drivers/net/bnxt/bnxt_hwrm.c @@ -3235,9 +3235,11 @@ int bnxt_set_hwrm_link_config(struct bnxt *bp, bool link_up) if (!link_up) goto port_phy_cfg; + /* Get user requested autoneg setting */ autoneg = bnxt_check_eth_link_autoneg(dev_conf->link_speeds); + if (BNXT_CHIP_P5(bp) && - dev_conf->link_speeds == RTE_ETH_LINK_SPEED_40G) { + dev_conf->link_speeds & RTE_ETH_LINK_SPEED_40G) { /* 40G is not supported as part of media auto detect. * The speed should be forced and autoneg disabled * to configure 40G speed. @@ -3246,11 +3248,13 @@ int bnxt_set_hwrm_link_config(struct bnxt *bp, bool link_up) autoneg = 0; } - /* No auto speeds and no auto_pam4_link. Disable autoneg */ - if (bp->link_info->auto_link_speed == 0 && - bp->link_info->link_signal_mode && - bp->link_info->auto_pam4_link_speed_mask == 0) + /* Override based on current Autoneg setting in PHY for 200G */ + if (autoneg == 1 && BNXT_CHIP_P5(bp) && bp->link_info->auto_mode == 0 && + bp->link_info->force_pam4_link_speed == + HWRM_PORT_PHY_CFG_INPUT_FORCE_PAM4_LINK_SPEED_200GB) { autoneg = 0; + PMD_DRV_LOG(DEBUG, "Disabling autoneg for 200G\n"); + } speed = bnxt_parse_eth_link_speed(dev_conf->link_speeds, bp->link_info); @@ -3283,14 +3287,14 @@ int bnxt_set_hwrm_link_config(struct bnxt *bp, bool link_up) else if (bp->link_info->force_pam4_link_speed) link_req.link_speed = bp->link_info->force_pam4_link_speed; + else if (bp->link_info->force_link_speed) + link_req.link_speed = bp->link_info->force_link_speed; else if (bp->link_info->auto_pam4_link_speed_mask) link_req.link_speed = bp->link_info->auto_pam4_link_speed_mask; else if (bp->link_info->support_pam4_speeds) link_req.link_speed = bp->link_info->support_pam4_speeds; - else if (bp->link_info->force_link_speed) - link_req.link_speed = bp->link_info->force_link_speed; else link_req.link_speed = bp->link_info->auto_link_speed; /* Auto PAM4 link speed is zero, but auto_link_speed is not -- 2.10.1
Re: [PATCH 00/12] Fix compilation with gcc 12
15/06/2022 16:45, Stephen Hemminger: > On Wed, 15 Jun 2022 10:49:17 +0200 > David Marchand wrote: > > > On Wed, May 18, 2022 at 12:17 PM David Marchand > > wrote: > > > > > > Fedora 36 is out since early may and comes with gcc 12. > > > This series fixes compilation or waives some checks. > > > > > > There might be something fishy with rte_memcpy on x86 but, for now, > > > the rte_memcpy related fixes are on the caller side. > > > > > > Some "base" drivers have issues, I chose the simple solution of waiving > > > the checks for them. > > > > > > Compilation is the only thing checked. > > > Please driver maintainers, check nothing got broken. > > > > I applied the patches that got acked and that had no objection or > > comment from maintainers (i.e. patch 3, 4, 5, 6, 9, 11). > > I also cleaned the mess in bugzilla where we had multiple reports of > > the same issues, or stale bugs that I can't reproduce with released > > gcc 12. > > > > I'll respin separately the patches for which I have clear comments, > > and drop my patches waiving the compiler checks. > > > > We still need to agree on the best approach to handle the new checks. > > We have two rfc series from Stephen, how do we move forward? > > Lets fix all the bugs and remove any workarounds using pragma's. > > Some of them may mean removing rte_memcpy where it is not needed. What about your series Stephen? Please would you like to respin?
RE: [EXT] [PATCH 1/3] ipsec: fix NAT-T ports and length
> -- > Fix the UDP header fields, wrong byte order used for src and dst port > and wrong offset used when updating UDP datagram length. > > Fixes: 01eef5907fc3 ("ipsec: support NAT-T") > Cc: sta...@dpdk.org > > Signed-off-by: Radu Nicolau Series applied to dpdk-next-crypto Thanks.
RE: [EXT] [PATCH v2] test/crypto: fix warnings for optimization=1 build
> Fail IPSec ESN and antireplay cases, if there are no packets. Fixes > following warning when using optimization=1 build flag with GCC 11. > > ../app/test/test_cryptodev.c: In function ‘test_ipsec_pkt_replay’: > ../app/test/test_cryptodev.c:10074:15: warning: ‘td_outb’ may be used > uninitialized [-Wmaybe-uninitialized] > ret = test_ipsec_proto_process(td_outb, td_inb, nb_pkts, true, >^~~~ > &flags); > ~~~ > ../app/test/test_cryptodev.c:9150:1: note: by argument 1 of type ‘const struct > ipsec_test_data[]’ to ‘test_ipsec_proto_process’ declared here > test_ipsec_proto_process(const struct ipsec_test_data td[], > ^~~~ > ../app/test/test_cryptodev.c:10056:32: note: ‘td_outb’ declared here > struct ipsec_test_data td_outb[IPSEC_TEST_PACKETS_MAX]; > ^~~ > > Bugzilla ID: 1032 > Fixes: d02c6bfcb99a ("test/crypto: add ESN and antireplay cases") > Cc: sta...@dpdk.org > > Reported-by: Daxue Gao > Signed-off-by: Rahul Lakkireddy Applied to dpdk-next-crypto Thanks.
RE: [PATCH] examples/ipsec-secgw: fix ESN setting
> > > > Fix ESN option flag and initial value for the rte_ipsec library path. > > > > Fixes: 560029d5cfc9 ("examples/ipsec-secgw: define initial ESN value") > > Cc: sta...@dpdk.org > > > > Signed-off-by: Radu Nicolau > Acked-by: Fan Zhang Applied to dpdk-next-crypto Thanks.
RE: [PATCH 1/1] examples/ipsec-secgw: add option for descriptors per QP
> > Added option to configure number of queue pair descriptors via command > > line (--desc-nb NUMBER_OF_DESC). > > > > When the crypto processing takes longer durations, small queue pair size > > would result in cryptodev enqueue failures. Larger queue pair size would > > allow more packets to stay in flight simultaneously and reduce enqueue > > failures. > > > > Signed-off-by: Volodymyr Fialko > > Acked-by: Anoob Joseph Acked-by: Akhil Goyal Applied to dpdk-next-crypto
Re: [PATCH 00/12] Fix compilation with gcc 12
On Wed, 15 Jun 2022 16:59:51 +0200 Thomas Monjalon wrote: > 15/06/2022 16:45, Stephen Hemminger: > > On Wed, 15 Jun 2022 10:49:17 +0200 > > David Marchand wrote: > > > > > On Wed, May 18, 2022 at 12:17 PM David Marchand > > > wrote: > > > > > > > > Fedora 36 is out since early may and comes with gcc 12. > > > > This series fixes compilation or waives some checks. > > > > > > > > There might be something fishy with rte_memcpy on x86 but, for now, > > > > the rte_memcpy related fixes are on the caller side. > > > > > > > > Some "base" drivers have issues, I chose the simple solution of waiving > > > > the checks for them. > > > > > > > > Compilation is the only thing checked. > > > > Please driver maintainers, check nothing got broken. > > > > > > I applied the patches that got acked and that had no objection or > > > comment from maintainers (i.e. patch 3, 4, 5, 6, 9, 11). > > > I also cleaned the mess in bugzilla where we had multiple reports of > > > the same issues, or stale bugs that I can't reproduce with released > > > gcc 12. > > > > > > I'll respin separately the patches for which I have clear comments, > > > and drop my patches waiving the compiler checks. > > > > > > We still need to agree on the best approach to handle the new checks. > > > We have two rfc series from Stephen, how do we move forward? > > > > Lets fix all the bugs and remove any workarounds using pragma's. > > > > Some of them may mean removing rte_memcpy where it is not needed. > > What about your series Stephen? > Please would you like to respin? > > Yes will recollate based on current main branch.
RE: [PATCH] examples/ipsec-secgw: fix packet type parsing
> > Add new packet type flags instead of overwriting. > > > > Fixes: d04bb1c52647 ("examples/ipsec-secgw: use HW parsed packet type in > > poll mode") > > > > Signed-off-by: Radu Nicolau > Acked-by: Fan Zhang Acked-by: Akhil Goyal Applied to dpdk-next-crypto Thanks.
RE: [PATCH 0/2] add zuc-256 support
> Adds support for zuc-256 cipher in cnxk crypto PMD. > > Ankur Dwivedi (2): > common/cnxk: swap zuc-256 key > crypto/cnxk: swap zuc-256 iv > > drivers/common/cnxk/roc_se.c | 7 +-- > drivers/common/cnxk/roc_se.h | 22 ++ > drivers/crypto/cnxk/cnxk_se.h | 24 +--- > 3 files changed, 40 insertions(+), 13 deletions(-) > Applied to dpdk-next-crypto Thanks.
RE: [PATCH] crypto/cnxk: fix condition check
> > > Subject: [PATCH] crypto/cnxk: fix condition check > > > > The value of ec_mdata cannot be NULL, instead check that value of > > private_data_offset was set. > > > > > > Fixes: c3a498c01121 ("crypto/cnxk: add event metadata set operation") > > Coverity issue: 378861 Coverity issue: 378865 > > > > Signed-off-by: Volodymyr Fialko > > Acked-by: Anoob Joseph Applied to dpdk-next-crypto Thanks.
RE: [PATCH 1/2] crypto/octeontx: pass sub event type in event
> Subject: [PATCH 1/2] crypto/octeontx: pass sub event type in event > > Response info may have valid sub_event_type. Pass this to the event > generated by CPT. > > Signed-off-by: Anoob Joseph > Reviewed-by: Shijith Thotton > Reviewed-by: Jerin Jacob Kollanukkaran Series applied to dpdk-next-crypto Thanks.
RE: [EXT] [PATCH v4] app/test-crypto-perf: support SDAP for PDCP operations
> Add a command line option "--enable-sdap" to enable > Service Data Adaptation Protocol. > > example command: > ./dpdk-test-crypto-perf -c 0xc --log-level=8 -- > --devtype crypto_dpaa2_sec --optype pdcp --cipher-algo aes-ctr > --cipher-op encrypt --auth-algo zuc-eia3 --auth-op generate > --auth-key-sz 16 --ptest throughput --total-ops 10 --burst-sz 64 > --buffer-sz 64,390,1512 --pool-sz 4096 --silent --pdcp-sn-sz 12 > --pdcp-domain control --enable-sdap > > Signed-off-by: Gagandeep Singh Acked-by: Akhil Goyal Applied to dpdk-next-crypto Thanks.
Re: [PATCH 1/5] usertools: use non-strict when json-loads in telemetry
On Wed, Jun 15, 2022 at 03:54:57PM +0200, Morten Brørup wrote: > > From: Chengwen Feng [mailto:fengcheng...@huawei.com] > > Sent: Wednesday, 15 June 2022 09.39 > > > > Use 'strict=False' in json-loads, it will ignore control characters > > (e.g. '\n\t'), this patch is prepared for the support of telemetry dump > > in the future. > > > > Signed-off-by: Chengwen Feng > > --- > > usertools/dpdk-telemetry.py | 2 +- > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > diff --git a/usertools/dpdk-telemetry.py b/usertools/dpdk-telemetry.py > > index a81868a547..63f8004566 100755 > > --- a/usertools/dpdk-telemetry.py > > +++ b/usertools/dpdk-telemetry.py > > @@ -27,7 +27,7 @@ def read_socket(sock, buf_len, echo=True): > > """ Read data from socket and return it in JSON format """ > > reply = sock.recv(buf_len).decode() > > try: > > -ret = json.loads(reply) > > +ret = json.loads(reply, strict=False) > > except json.JSONDecodeError: > > print("Error in reply: ", reply) > > sock.close() > > -- > > 2.33.0 > > > > As I understand this patch, it accepts non-JSON data from the telemetry > socket. > > Isn't it is a major protocol violation if the telemetry socket produces > output requiring this modification? Doing that would break other applications > expecting strictly JSON formatted output from the telemetry socket. > I would tend to agree. As an alternative, I think you should add to the telemetry library an "escape string" function which can then be used by the telemetry functions to properly json encode the strings back from the dump functions before sending them out. /Bruce
Re: [PATCH] net/octeontx_ep: updated ethdev ops
On Thu, Jun 9, 2022 at 8:50 PM Sathesh B Edara wrote: > > Updated ethdev ops support for link_update(), > stats_get() and stats_reset() a) Split as two patches (Suggested heading) 1) net/octeontx_ep: support basic stats 2) net/octeontx_ep: support link status b) Update doc/guides/nics/features/octeontx_ep.ini See doc/guides/nics/features.rst > > Signed-off-by: Sathesh B Edara Since this is your first path, you may consider changing to "Sathesh Edara " > +static int > +otx_ep_link_update(struct rte_eth_dev *eth_dev, int wait_to_complete) > +{ > + struct rte_eth_link link; > + > + RTE_SET_USED(wait_to_complete); > + > + if (!eth_dev->data->dev_started) > + return 0; > + > + link.link_speed = RTE_ETH_SPEED_NUM_10G; Isn't this RTE_ETH_SPEED_NUM_NONE? as its speed is "Not defined" >
[PATCH v2 0/2] add includes to help when editing in Eclipse
These two small patches add in some additional header includes to 3 DPDK files, in order to improve the experience of anyone who, like me, sometimes uses eclipse-cdt as an editor for working with DPDK code. The C indexer in eclipse seems easily confused when missing some of our macro definitions, so we can reduce that confusion by ensuring that we include all needed headers to get those definitions. V2: fix copy-paste error Bruce Richardson (2): examples/l3fwd: add include for macro definition common/cnxk: add include for macro definition drivers/common/cnxk/roc_io.h | 2 ++ examples/l3fwd/l3fwd_em.h| 2 ++ examples/l3fwd/l3fwd_em_sequential.h | 2 ++ 3 files changed, 6 insertions(+) -- 2.34.1
[PATCH v2 2/2] common/cnxk: add include for macro definition
The header file "roc_io.h" uses the "__plt_always_inline" macro but don't include "roc_platform.h" to get the definition of it. This inclusion is not necessary for compilation, but the lack of it can confuse some indexers - such as those in eclipse, which reports the lines: "static __plt_always_inline uint64_t" as possible definitions of a variable called "uint64_t". This confusion leads to uint64_t being flagged as an unknown type in all other parts of the project being indexed, e.g. across all of DPDK code. Adding in the include of roc_platform.h makes it clear to the indexer that those lines are part of a function definition, and that allows eclipse to correctly recognise uint64_t as a type from stdint.h Signed-off-by: Bruce Richardson --- V2: fix copy-paste error, where header name replacement was missed --- drivers/common/cnxk/roc_io.h | 2 ++ 1 file changed, 2 insertions(+) diff --git a/drivers/common/cnxk/roc_io.h b/drivers/common/cnxk/roc_io.h index 62e98d9d00..9d73e263f7 100644 --- a/drivers/common/cnxk/roc_io.h +++ b/drivers/common/cnxk/roc_io.h @@ -5,6 +5,8 @@ #ifndef _ROC_IO_H_ #define _ROC_IO_H_ +#include "roc_platform.h" /* for __plt_always_inline macro */ + #define ROC_LMT_BASE_ID_GET(lmt_addr, lmt_id) \ do { \ /* 32 Lines per core */\ -- 2.34.1
[PATCH v2 1/2] examples/l3fwd: add include for macro definition
The header files "l3fwd_em.h" and "l3fwd_em_sequential.h" use the "__rte_always_inline" macro but don't directly include "rte_common.h" to get the definition of it. This inclusion is not necessary for compilation, but the lack of it can confuse some indexers - such as those in eclipse, which reports the lines: "static __rte_always_inline uint16_t" as possible definitions of a variable called "uint16_t". This confusion leads to uint16_t being flagged as an unknown type in all other parts of the project being indexed, e.g. across all of DPDK code. Adding in the include of rte_common.h makes it clear to the indexer that those lines are part of a function definition, and that allows eclipse to correctly recognise uint16_t as a type from stdint.h Signed-off-by: Bruce Richardson --- examples/l3fwd/l3fwd_em.h| 2 ++ examples/l3fwd/l3fwd_em_sequential.h | 2 ++ 2 files changed, 4 insertions(+) diff --git a/examples/l3fwd/l3fwd_em.h b/examples/l3fwd/l3fwd_em.h index 11bd951858..fe2ee59f6a 100644 --- a/examples/l3fwd/l3fwd_em.h +++ b/examples/l3fwd/l3fwd_em.h @@ -5,6 +5,8 @@ #ifndef __L3FWD_EM_H__ #define __L3FWD_EM_H__ +#include + static __rte_always_inline uint16_t l3fwd_em_handle_ipv4(struct rte_mbuf *m, uint16_t portid, struct rte_ether_hdr *eth_hdr, struct lcore_conf *qconf) diff --git a/examples/l3fwd/l3fwd_em_sequential.h b/examples/l3fwd/l3fwd_em_sequential.h index f426c508ef..d2f75edb8a 100644 --- a/examples/l3fwd/l3fwd_em_sequential.h +++ b/examples/l3fwd/l3fwd_em_sequential.h @@ -5,6 +5,8 @@ #ifndef __L3FWD_EM_SEQUENTIAL_H__ #define __L3FWD_EM_SEQUENTIAL_H__ +#include + /** * @file * This is an optional implementation of packet classification in Exact-Match -- 2.34.1
[PATCH] common/iavf: replace zero-length arrays with flexible ones
From: Shiqi Liu <835703...@qq.com> This patch replaces instances of zero-sized arrays i.e. those at the end of structures with "[0]" with the more standard syntax of "[]". Replacement was done using coccinelle script, with some revert and cleanup of whitespace afterwards. Signed-off-by: Shiqi Liu <835703...@qq.com> --- drivers/common/iavf/virtchnl_inline_ipsec.h | 24 ++--- drivers/common/mlx5/mlx5_prm.h | 4 ++-- 2 files changed, 14 insertions(+), 14 deletions(-) diff --git a/drivers/common/iavf/virtchnl_inline_ipsec.h b/drivers/common/iavf/virtchnl_inline_ipsec.h index 2f4bf15725..9d5c433d32 100644 --- a/drivers/common/iavf/virtchnl_inline_ipsec.h +++ b/drivers/common/iavf/virtchnl_inline_ipsec.h @@ -500,25 +500,25 @@ struct inline_ipsec_msg { union { /* IPsec request */ - struct virtchnl_ipsec_sa_cfg sa_cfg[0]; - struct virtchnl_ipsec_sp_cfg sp_cfg[0]; - struct virtchnl_ipsec_sa_update sa_update[0]; - struct virtchnl_ipsec_sa_destroy sa_destroy[0]; - struct virtchnl_ipsec_sp_destroy sp_destroy[0]; + struct virtchnl_ipsec_sa_cfg sa_cfg[]; + struct virtchnl_ipsec_sp_cfg sp_cfg[]; + struct virtchnl_ipsec_sa_update sa_update []; + struct virtchnl_ipsec_sa_destroy sa_destroy[]; + struct virtchnl_ipsec_sp_destroy sp_destroy[]; /* IPsec response */ - struct virtchnl_ipsec_sa_cfg_resp sa_cfg_resp[0]; - struct virtchnl_ipsec_sp_cfg_resp sp_cfg_resp[0]; - struct virtchnl_ipsec_cap ipsec_cap[0]; - struct virtchnl_ipsec_status ipsec_status[0]; + struct virtchnl_ipsec_sa_cfg_resp sa_cfg_resp[]; + struct virtchnl_ipsec_sp_cfg_resp sp_cfg_resp[]; + struct virtchnl_ipsec_cap ipsec_cap[]; + struct virtchnl_ipsec_status ipsec_status[]; /* response to del_sa, del_sp, update_sa */ - struct virtchnl_ipsec_resp ipsec_resp[0]; + struct virtchnl_ipsec_resp ipsec_resp[]; /* IPsec event (no req_id is required) */ - struct virtchnl_ipsec_event event[0]; + struct virtchnl_ipsec_event event[]; /* Reserved */ - struct virtchnl_ipsec_sa_read sa_read[0]; + struct virtchnl_ipsec_sa_read sa_read[]; } ipsec_data; } __rte_packed; diff --git a/drivers/common/mlx5/mlx5_prm.h b/drivers/common/mlx5/mlx5_prm.h index 654e5f44ee..c841d34216 100644 --- a/drivers/common/mlx5/mlx5_prm.h +++ b/drivers/common/mlx5/mlx5_prm.h @@ -552,8 +552,8 @@ struct mlx5_umr_wqe { struct mlx5_wqe_umr_cseg ucseg; struct mlx5_wqe_mkey_cseg mkc; union { - struct mlx5_wqe_dseg kseg[0]; - struct mlx5_wqe_umr_bsf_seg bsf[0]; + struct mlx5_wqe_dseg kseg[]; + struct mlx5_wqe_umr_bsf_seg bsf[]; }; } __rte_packed; -- 2.35.1.windows.2
[V2] common/iavf:replace zero-length arrays with flexible ones
From: Shiqi Liu <835703...@qq.com> This patch replaces instances of zero-sized arrays i.e. those at the end of structures with "[0]" with the more standard syntax of "[]". Replacement was done using coccinelle script, with some revert and cleanup of whitespace afterwards. v2: * revise a error with BRACKET_SPACE Signed-off-by: Shiqi Liu <835703...@qq.com> --- ...ace-zero-length-arrays-with-flexible.patch | 76 +++ drivers/common/iavf/virtchnl_inline_ipsec.h | 2 +- 2 files changed, 77 insertions(+), 1 deletion(-) create mode 100644 0001-common-iavf-replace-zero-length-arrays-with-flexible.patch diff --git a/0001-common-iavf-replace-zero-length-arrays-with-flexible.patch b/0001-common-iavf-replace-zero-length-arrays-with-flexible.patch new file mode 100644 index 00..25c5658791 --- /dev/null +++ b/0001-common-iavf-replace-zero-length-arrays-with-flexible.patch @@ -0,0 +1,76 @@ +From e680b5d91aad9e22d45652efdd85cd2c40105247 Mon Sep 17 00:00:00 2001 +From: Shiqi Liu <835703...@qq.com> +Date: Thu, 16 Jun 2022 01:12:47 +0800 +Subject: [PATCH] common/iavf: replace zero-length arrays with flexible ones + +This patch replaces instances of zero-sized arrays i.e. those at the +end of structures with "[0]" with the more standard syntax of "[]". +Replacement was done using coccinelle script, with some revert and +cleanup of whitespace afterwards. + +Signed-off-by: Shiqi Liu <835703...@qq.com> +--- + drivers/common/iavf/virtchnl_inline_ipsec.h | 24 ++--- + drivers/common/mlx5/mlx5_prm.h | 4 ++-- + 2 files changed, 14 insertions(+), 14 deletions(-) + +diff --git a/drivers/common/iavf/virtchnl_inline_ipsec.h b/drivers/common/iavf/virtchnl_inline_ipsec.h +index 2f4bf15725..9d5c433d32 100644 +--- a/drivers/common/iavf/virtchnl_inline_ipsec.h b/drivers/common/iavf/virtchnl_inline_ipsec.h +@@ -500,25 +500,25 @@ struct inline_ipsec_msg { + + union { + /* IPsec request */ +- struct virtchnl_ipsec_sa_cfg sa_cfg[0]; +- struct virtchnl_ipsec_sp_cfg sp_cfg[0]; +- struct virtchnl_ipsec_sa_update sa_update[0]; +- struct virtchnl_ipsec_sa_destroy sa_destroy[0]; +- struct virtchnl_ipsec_sp_destroy sp_destroy[0]; ++ struct virtchnl_ipsec_sa_cfg sa_cfg[]; ++ struct virtchnl_ipsec_sp_cfg sp_cfg[]; ++ struct virtchnl_ipsec_sa_update sa_update []; ++ struct virtchnl_ipsec_sa_destroy sa_destroy[]; ++ struct virtchnl_ipsec_sp_destroy sp_destroy[]; + + /* IPsec response */ +- struct virtchnl_ipsec_sa_cfg_resp sa_cfg_resp[0]; +- struct virtchnl_ipsec_sp_cfg_resp sp_cfg_resp[0]; +- struct virtchnl_ipsec_cap ipsec_cap[0]; +- struct virtchnl_ipsec_status ipsec_status[0]; ++ struct virtchnl_ipsec_sa_cfg_resp sa_cfg_resp[]; ++ struct virtchnl_ipsec_sp_cfg_resp sp_cfg_resp[]; ++ struct virtchnl_ipsec_cap ipsec_cap[]; ++ struct virtchnl_ipsec_status ipsec_status[]; + /* response to del_sa, del_sp, update_sa */ +- struct virtchnl_ipsec_resp ipsec_resp[0]; ++ struct virtchnl_ipsec_resp ipsec_resp[]; + + /* IPsec event (no req_id is required) */ +- struct virtchnl_ipsec_event event[0]; ++ struct virtchnl_ipsec_event event[]; + + /* Reserved */ +- struct virtchnl_ipsec_sa_read sa_read[0]; ++ struct virtchnl_ipsec_sa_read sa_read[]; + } ipsec_data; + } __rte_packed; + +diff --git a/drivers/common/mlx5/mlx5_prm.h b/drivers/common/mlx5/mlx5_prm.h +index 654e5f44ee..c841d34216 100644 +--- a/drivers/common/mlx5/mlx5_prm.h b/drivers/common/mlx5/mlx5_prm.h +@@ -552,8 +552,8 @@ struct mlx5_umr_wqe { + struct mlx5_wqe_umr_cseg ucseg; + struct mlx5_wqe_mkey_cseg mkc; + union { +- struct mlx5_wqe_dseg kseg[0]; +- struct mlx5_wqe_umr_bsf_seg bsf[0]; ++ struct mlx5_wqe_dseg kseg[]; ++ struct mlx5_wqe_umr_bsf_seg bsf[]; + }; + } __rte_packed; + +-- +2.35.1.windows.2 + diff --git a/drivers/common/iavf/virtchnl_inline_ipsec.h b/drivers/common/iavf/virtchnl_inline_ipsec.h index 9d5c433d32..0781de1fe1 100644 --- a/drivers/common/iavf/virtchnl_inline_ipsec.h +++ b/drivers/common/iavf/virtchnl_inline_ipsec.h @@ -502,7 +502,7 @@ struct inline_ipsec_msg { /* IPsec request */ struct virtchnl_ipsec_sa_cfg sa_cfg[]; struct virtchnl_ipsec_sp_cfg sp_cfg[]; - struct virtchnl_ipsec_sa_update sa_update []; + struct virtchnl_ipsec_sa_update sa_update[]; struct virtchnl_ipsec_sa_destroy sa_destroy[]; struct virtchnl_ipsec_sp_destroy sp_destroy[]; -- 2.35.1.windows.2
Re: [PATCH] common/cnxk: return on fail to init ROC Model
On Fri, Jun 10, 2022 at 1:44 PM Hanumanth Pothula wrote: > > Return with error on fail to initialize RoC Model. > > Signed-off-by: Hanumanth Pothula Updated the git commit as follows and applied to dpdk-next-net-mrvl/for-next-net. Thanks common/cnxk: handle ROC model init failure Return with error on fail to initialize ROC model. Fixes: 014a9e222bac ("common/cnxk: add model init and IO handling API") Cc: sta...@dpdk.org Signed-off-by: Hanumanth Pothula > --- > drivers/common/cnxk/roc_platform.c | 7 ++- > 1 file changed, 6 insertions(+), 1 deletion(-) > > diff --git a/drivers/common/cnxk/roc_platform.c > b/drivers/common/cnxk/roc_platform.c > index ebb6225f4d..443aa8d396 100644 > --- a/drivers/common/cnxk/roc_platform.c > +++ b/drivers/common/cnxk/roc_platform.c > @@ -37,7 +37,12 @@ roc_plt_init(void) > plt_err("Failed to reserve mem for > roc_model"); > return -ENOMEM; > } > - roc_model_init(mz->addr); > + if (roc_model_init(mz->addr)) { > + plt_err("Failed to init roc_model"); > + > + rte_memzone_free(mz); > + return -EINVAL; > + } > } > } else { > if (mz == NULL) { > -- > 2.25.1 >
RE: [PATCH 1/5] usertools: use non-strict when json-loads in telemetry
+CC Ciara Power, Telemetry lib maintainer > From: Bruce Richardson [mailto:bruce.richard...@intel.com] > Sent: Wednesday, 15 June 2022 18.55 > > On Wed, Jun 15, 2022 at 03:54:57PM +0200, Morten Brørup wrote: > > > From: Chengwen Feng [mailto:fengcheng...@huawei.com] > > > Sent: Wednesday, 15 June 2022 09.39 > > > > > > Use 'strict=False' in json-loads, it will ignore control characters > > > (e.g. '\n\t'), this patch is prepared for the support of telemetry > dump > > > in the future. > > > > > > Signed-off-by: Chengwen Feng > > > --- > > > usertools/dpdk-telemetry.py | 2 +- > > > 1 file changed, 1 insertion(+), 1 deletion(-) > > > > > > diff --git a/usertools/dpdk-telemetry.py b/usertools/dpdk- > telemetry.py > > > index a81868a547..63f8004566 100755 > > > --- a/usertools/dpdk-telemetry.py > > > +++ b/usertools/dpdk-telemetry.py > > > @@ -27,7 +27,7 @@ def read_socket(sock, buf_len, echo=True): > > > """ Read data from socket and return it in JSON format """ > > > reply = sock.recv(buf_len).decode() > > > try: > > > -ret = json.loads(reply) > > > +ret = json.loads(reply, strict=False) > > > except json.JSONDecodeError: > > > print("Error in reply: ", reply) > > > sock.close() > > > -- > > > 2.33.0 > > > > > > > As I understand this patch, it accepts non-JSON data from the > telemetry socket. > > > > Isn't it is a major protocol violation if the telemetry socket > produces output requiring this modification? Doing that would break > other applications expecting strictly JSON formatted output from the > telemetry socket. > > > > I would tend to agree. > > As an alternative, I think you should add to the telemetry library an > "escape string" function which can then be used by the telemetry > functions > to properly json encode the strings back from the dump functions before > sending them out. Instead of adding a separate JSON encode function, the rte_tel_data_string() and rte_tel_data_add_array_string() functions should simply JSON encode the provided strings if required. Their descriptions do not mention any requirements to the strings provided, and being control plane functions, I would certainly expect them to JSON encode. Warning: Although I consider such a change a bugfix, others might consider it an ABI breakage. :-( @Ciara, what is your opinion about my suggestion here? For minimal changes, RTE_TEL_MAX_STRING_LEN and RTE_TEL_MAX_SINGLE_STRING_LEN should keep their meaning, i.e. define the maximum length of the string after any JSON encoding. And optionally, new rte_tel_data_[array_]string_raw() performance optimized functions could be provided for strings known not to require any encoding.
Re: [PATCH v2 2/3] eal: fix trace init fail with long file-prefix
On Wed, Jun 15, 2022 at 7:32 PM David Marchand wrote: > > On Tue, Jun 14, 2022 at 8:06 AM Chengwen Feng wrote: > > > > Bug scenario: > > 1. start testpmd: > > dpdk-testpmd -l 4-6 -a :7d:00.0 --trace=.* \ > > -file-prefix=trace_autotest -- -i > > 2. then observed: > > EAL: eal_trace_init():94 failed to initialize trace [File exists] > > This is not directly related to the issue being fixed, but this error > log is really obscure.. > Can we enhance this, maybe in a followup patch? > > > > EAL: FATAL: Cannot init trace > > EAL: Cannot init trace > > EAL: Error - exiting with code: 1 > > > > The root cause it that the offset set wrong with long file-prefix and > > then lead the strftime return failed. > > > > Fixes: 321dd5f8fa62 ("trace: add internal init and fini interface") > > Cc: sta...@dpdk.org > > > > Signed-off-by: Chengwen Feng > > Reviewed-by: David Marchand Acked-by: Jerin Jacob
Re: [EXT] Re: [PATCH v2] devtools: ensure proper tag sequence
15/06/2022 10:46, Jakub Palider: > > -Original Message- > > From: Thomas Monjalon > > Sent: Wednesday, June 15, 2022 8:21 AM > > To: Jakub Palider > > Cc: dev@dpdk.org; david.march...@redhat.com > > Subject: [EXT] Re: [PATCH v2] devtools: ensure proper tag sequence > > > > External Email > > > > -- > > 14/06/2022 00:21, jpali...@marvell.com: > > > + SEQ[0] = "Coverity issue"; > > > + SEQ[1] = "Bugzilla ID"; > > > + SEQ[2] = "Fixes"; > > > + SEQ[3] = "Cc"; > > > + SEQ[4] = "^$"; > > > + SEQ[5] = "Suggested-by"; > > > + SEQ[6] = "Reported-by"; > > > + SEQ[7] = "Signed-off-by"; > > > + latest = 0; > > > > Do you think you could check that Review, Ack and Test are added after the > > first Signed-off? > > Ok, thank you for clarification. So any further Signed-offs need not to be in > a particular > sequence as long as the first one precedes Review/Ack/Test (these 3 in any > order). Yes > > > +Tag order > > > +~ > > > + > > > +There is a pattern indicating how certain tags should relate to each > > > other. > > > + > > > +Example of proper tag sequence:: > > > + > > > + Coverity issue: > > > + Bugzilla ID: > > > + Fixes: > > > + Cc: > > > + > > > + Suggested-by: > > > + Reported-by: > > > + Signed-off-by: > > > > Given it is an example, you could add Reviewed-by, Acked-by and Tested-by. > > Sure, I will restore it. > > > > + > > > +Between first and second tag section there is and empty line. > > > + > > > +While ``Signed-off-by:`` is an obligatory tag and must exists in each > > > +commit, all other tags are optional. Any tag, as long as it is in > > > +proper location to other adjacent tags (if present), may occur multiple > > times. > > > + > > > +Other tags shall be laid out in a chronological order. > > > > Yes, after the first Signed-off-by. > > Side question: what about Sponsored-by, I noticed it appeared just recently? It is a way to add the name of a company wishing to appear. There is no constraint on it, it can be anywhere I think. If it is in chronological order, it looks better before the Signed-off. I think we should not advertise it here, because real names are more useful.