Re: [PATCH v5 1/2] ethdev: Add link_speed lanes support

2024-09-24 Thread Damodharam Ammepalli
On Sun, Sep 22, 2024 at 10:02 AM Ferruh Yigit  wrote:
>
> On 9/4/2024 6:50 PM, Damodharam Ammepalli wrote:
> > Update the eth_dev_ops structure with new function vectors
> > to get, get capabilities and set ethernet link speed lanes.
> > Update the testpmd to provide required config and information
> > display infrastructure.
> >
> > The supporting ethernet controller driver will register callbacks
> > to avail link speed lanes config and get services. This lanes
> > configuration is applicable only when the nic is forced to fixed
> > speeds. In Autonegiation mode, the hardware automatically
> > negotiates the number of lanes.
> >
> > These are the new commands.
> >
> > testpmd> show port 0 speed_lanes capabilities
> >
> >  Supported speeds Valid lanes
> > ---
> >  10 Gbps  1
> >  25 Gbps  1
> >  40 Gbps  4
> >  50 Gbps  1 2
> >  100 Gbps 1 2 4
> >  200 Gbps 2 4
> >  400 Gbps 4 8
> > testpmd>
> >
> > testpmd>
> > testpmd> port stop 0
> > testpmd> port config 0 speed_lanes 4
> > testpmd> port config 0 speed 20 duplex full
> > testpmd> port start 0
> > testpmd>
> > testpmd> show port info 0
> >
> > * Infos for port 0  *
> > MAC address: 14:23:F2:C3:BA:D2
> > Device name: :b1:00.0
> > Driver name: net_bnxt
> > Firmware-version: 228.9.115.0
> > Connect to socket: 2
> > memory allocation on the socket: 2
> > Link status: up
> > Link speed: 200 Gbps
> > Active Lanes: 4
> > Link duplex: full-duplex
> > Autoneg status: Off
> >
> > Signed-off-by: Damodharam Ammepalli 
> > ---
> >  app/test-pmd/cmdline.c | 248 -
> >  app/test-pmd/config.c  |   4 +
> >  lib/ethdev/ethdev_driver.h |  91 ++
> >  lib/ethdev/rte_ethdev.c|  52 
> >  lib/ethdev/rte_ethdev.h|  95 ++
> >  lib/ethdev/version.map |   5 +
> >  6 files changed, 494 insertions(+), 1 deletion(-)
> >
> > diff --git a/app/test-pmd/cmdline.c b/app/test-pmd/cmdline.c
> > index b7759e38a8..643102032e 100644
> > --- a/app/test-pmd/cmdline.c
> > +++ b/app/test-pmd/cmdline.c
> > @@ -284,6 +284,9 @@ static void cmd_help_long_parsed(void *parsed_result,
> >
> >   "dump_log_types\n"
> >   "Dumps the log level for all the dpdk modules\n\n"
> > +
> > + "show port (port_id) speed_lanes capabilities"
> > + "   Show speed lanes capabilities of a port.\n\n"
> >   );
> >   }
> >
> > @@ -823,6 +826,9 @@ static void cmd_help_long_parsed(void *parsed_result,
> >   "port config (port_id) txq (queue_id) affinity 
> > (value)\n"
> >   "Map a Tx queue with an aggregated port "
> >   "of the DPDK port\n\n"
> > +
> > + "port config (port_id|all) speed_lanes (0|1|4|8)\n"
> >
>
> This help string, and the implementation, implies there has to be fixed
> lane values, like 1, 4, 8. Why not leave this part to the capability
> reporting, and not limit (and worry) those values in the command, so
> from command's perspective it can be only .
>

ok, will update the help string to 

> > + "Set number of lanes for all ports or port_id for 
> > a forced speed\n\n"
> >   );
> >   }
> >
> > @@ -1560,6 +1566,244 @@ static cmdline_parse_inst_t 
> > cmd_config_speed_specific = {
> >   },
> >  };
> >
> > +static int
> > +parse_speed_lanes_cfg(portid_t pid, uint32_t lanes)
> > +{
> > + int ret;
> > +
> > + ret = rte_eth_speed_lanes_set(pid, lanes);
> >
>
> As a sample application usage, I think it is better if it gets the
> capability and verify that 'lanes' is withing the capability and later
> set it, what do you think?
>
> <...>

Makes sense, will try out and get back to you soon.


> > diff --git a/lib/ethdev/rte_ethdev.h b/lib/ethdev/rte_ethdev.h> index
> 548fada1c7..9444e0a836 100644
> > --- a/lib/ethdev/rte_ethdev.h
> > +++ b/lib/ethdev/rte_ethdev.h
> > @@ -357,6 +357,27 @@ struct rte_eth_link {
> >  #define RTE_ETH_LINK_MAX_STR_LEN 40 /**< Max length of default link 
> > string. */
> >  /**@}*/
> >
> > +/**
> > + * Constants used to indicate the possible link speed lanes of an ethdev 
> > port.
> > + */
> > +#define RTE_ETH_SPEED_LANE_UNKNOWN  0  /**< speed lanes unsupported mode 
> > or default */
> > +#define RTE_ETH_SPEED_LANE_1  1/**< Link speed lane  1 */
> > +#define RTE_ETH_SPEED_LANE_2  2/**< Link speed lanes 2 */
> > +#define RTE_ETH_SPEED_LANE_4  4/**< Link speed lanes 4 */
> > +#define RTE_ETH_SPEED_LANE_8  8/**< Link speed lanes 8 */
> > +
> > +/* Translate from link speed lanes to speed lanes capa */
> > +#define RTE_ETH_SPEED_LANES_TO_CAPA(x) RTE_BIT32(x)
> > +
> > +/* This macro indicates link speed lanes capa mask */
> > +#define RTE_ETH

Re: [PATCH] eal: fix device unregister for event registered with device_name NULL

2024-09-24 Thread Stephen Hemminger
On Thu, 18 Jul 2024 12:37:28 -0700
lon...@linuxonhyperv.com wrote:

> From: Malcolm Bumgardner 
> 
> In the device event unregister code, it unconditionally remove all
> callbacks which are registered with device_name set to NULL. This results
> in many callbacks uncorrectly removed.
> 
> Fix this by only removing callbacks with matching cb_fn and cb_arg.
> 
> Signed-off-by: Malcolm Bumgardner 
> 
> Fixes: a753e53d517b ("eal: add device event monitor framework")
> Cc: sta...@dpdk.org
> Signed-off-by: Long Li 
> ---
>  lib/eal/common/eal_common_dev.c | 13 +++--
>  1 file changed, 7 insertions(+), 6 deletions(-)

This patch solves the problem in the netvsc driver.
The netvsc driver registers a device event callback to discover when the VF
is added/removed. When the netvsc driver is stoppped, it unregisters this 
callback.
The bug is that the rte_dev code would end up removing all callbacks, including
the callback for other netvsc devices.

Perhaps a bugzilla entry or more direct example would have gotten more attention
on this.

Reviewed-by: Stephen Hemminger 


RE: [PATCH v4] eal: add build-time option to omit trace

2024-09-24 Thread Morten Brørup
> From: Jerin Jacob [mailto:jerinjac...@gmail.com]
> Sent: Tuesday, 24 September 2024 11.42
> 
> On Tue, Sep 24, 2024 at 7:10 PM Morten Brørup 
> wrote:
> >
> > Some applications want to omit the trace feature.
> > Either to reduce the memory footprint, to reduce the exposed attack
> > surface, or for other reasons.
> >
> > This patch adds an option in rte_config.h to include or omit trace in
> the
> > build. Trace is included by default.
> >
> > Omitting trace works by omitting all trace points.
> > For API and ABI compatibility, the trace feature itself remains.
> >
> > Signed-off-by: Morten Brørup 
> > ---
> > v4:
> > * Added check for generic trace enabled when registering trace points,
> in
> >   RTE_INIT. (Jerin Jacob)
> > * Test application uses function instead of macro to check if generic
> >   trace is enabled. (Jerin Jacob)
> > * Performance test application uses function to check if generic trace
> is
> >   enabled.
> > v3:
> > * Simpler version with much fewer ifdefs. (Jerin Jacob)
> > v2:
> > * Added/modified macros required for building applications with trace
> >   omitted.
> 
> >
> > +/**
> > + * @internal
> 
> Since it is used in app/test. Can we remove it as internal and make
> symbol as rte_trace_point_is_enabled

I don't think we should make functions public if only used for test purposes.

Do you think it is useful for normal usage too? Does rte_trace_is_enabled() not 
suffice?

> 
> > + *
> > + * Test if the tracepoint compile-time option is enabled.
> > + *
> > + * @return
> > + *   true if tracepoint enabled, false otherwise.
> > + */
> > +__rte_experimental
> > +static __rte_always_inline bool
> > +__rte_trace_point_generic_is_enabled(void)
> 
> Do we need to keep _generic_

Other internal functions have _generic_ too, so I added it.
If we decide to make it public, I agree _generic_ should be removed.

> 
> Rest looks good to me.


Re: [PATCH v2] dts: add VLAN methods to testpmd shell

2024-09-24 Thread Juraj Linkeš
We should bring up the exact names of method used in this patch and talk 
through them in the call.



On 18. 9. 2024 21:41, Dean Marx wrote:

added the following methods to testpmd shell class:


Capitalize please.


vlan set filter on/off, rx vlan add/rm,
vlan set strip on/off, tx vlan set/reset,
set promisc/verbose

Fixes: 61d5bc9bf974 ("dts: add port info command to testpmd shell")


Is the fix related to the rest of the changes? It looks like it is. The 
relation to the other changes should be explained in the commit message 
as well as what the fix fixes (or in which cases the original 
implementation didn't work properly).




Signed-off-by: Dean Marx 
---
  dts/framework/remote_session/testpmd_shell.py | 175 +-
  1 file changed, 174 insertions(+), 1 deletion(-)

diff --git a/dts/framework/remote_session/testpmd_shell.py 
b/dts/framework/remote_session/testpmd_shell.py
index 8c228af39f..5c5e681841 100644
--- a/dts/framework/remote_session/testpmd_shell.py
+++ b/dts/framework/remote_session/testpmd_shell.py
@@ -102,7 +102,7 @@ def make_parser(cls) -> ParserFn:
  r"strip (?Pon|off), "
  r"filter (?Pon|off), "
  r"extend (?Pon|off), "
-r"qinq strip (?Pon|off)$",
+r"qinq strip (?Pon|off)",
  re.MULTILINE,
  named=True,
  ),
@@ -982,6 +982,179 @@ def set_port_mtu_all(self, mtu: int, verify: bool = True) 
-> None:
  for port in self.ports:
  self.set_port_mtu(port.id, mtu, verify)
  
+def vlan_filter_set(self, port: int, on: bool, verify: bool = True) -> None:


The names should follow the same naming scheme. I think the scheme says 
it doesn't necessarily have to follow testpmd commands, so I'd say let's 
name it the same way as the mtu methods - set_vlan_filter.



+"""Set vlan filter on.


Out of curiosity, what is vlan filter?


+
+Args:
+port: The port number to enable VLAN filter on, should be within 
0-32.


Where is this 0-32 coming from? And why 32 and not 31?


+on: Sets filter on if :data:`True`, otherwise turns off.


I'd use third person and add the port:
Enable the filter on `port` if :data:`True`, otherwise disable it.


+verify: If :data:`True`, the output of the command and show port 
info
+is scanned to verify that vlan filtering was enabled 
successfully.
+If not, it is considered an error.


This is not 100% clear whether this means if False or if not enabled 
successfully. Looks like this should be updated everywhere.



+
+Raises:
+InteractiveCommandExecutionError: If `verify` is :data:`True` and 
the filter
+fails to update.
+"""
+filter_cmd_output = self.send_command(f"vlan set filter {'on' if on else 
'off'} {port}")
+if verify:
+vlan_settings = self.show_port_info(port_id=port).vlan_offload
+if on ^ (vlan_settings is not None and VLANOffloadFlag.FILTER in 
vlan_settings):


This is an awesome condition.


+self._logger.debug(f"Failed to set filter on port {port}: 
\n{filter_cmd_output}")
+raise InteractiveCommandExecutionError(
+f"Testpmd failed to set VLAN filter on port {port}."
+)


We should say whether we're enabling or disabling the filter in both the 
log and error. Does filter_cmd_output contain this? What about the other 
commands (every one of them except verbose_output)?



+
+def rx_vlan(self, vlan: int, port: int, add: bool, verify: bool = True) -> 
None:


We have set as the prefix for boolean configs and configs that only set 
the value, here we probabaly want add_del_rx_vlan. We really should talk 
about these names in the call.



+"""Add specified vlan tag to the filter list on a port.


The command doesn't mention the filter. Does this mean the filter needs 
to be enabled before we can config vlans on the port?



+
+Args:
+vlan: The vlan tag to add, should be within 1-1005, 1-4094 
extended.


The method mentions extended vlans only in this place. It's not clear 
when can we use extended vlans (where is it configured or enforced).



+port: The port number to add the tag on, should be within 0-32.
+add: Adds the tag if :data:`True`, otherwise removes tag.


removes the tag


+verify: If :data:`True`, the output of the command is scanned to 
verify that
+the vlan tag was added to the filter list on the specified 
port. If not, it is
+considered an error.> +
+Raises:
+InteractiveCommandExecutionError: If `verify` is :data:`True` and 
the tag
+is not added.


I guess this would also be raised if removal is unsuccessful.


+"""
+rx_output = self.send_command(f"rx_vlan {'add' if add else 'rm'} {vlan} 
{port}")


As Patrick 

[Azure] [ConnectX-3] net_mlx4: cannot access device, is mlx4_ib loaded?

2024-09-24 Thread Rajasekhar Pulluru
Hi Team,
Unable to attach ConnectX-3 mlx4 interfaces(uio_hv_generic) in DPDK.
testpmd application reports below errors. mlx4_ib is loaded. Not sure if I
am missing anything here. Appreciate any help to resolve this.
































*EAL: Detected CPU lcores: 8EAL: Detected NUMA nodes: 1EAL: Detected static
linkage of DPDKEAL: Multi-process socket /var/run/dpdk/rte/mp_socketEAL:
Selected IOVA mode 'PA'EAL: Probe PCI driver: net_mlx4 (15b3:1004) device:
44be:00:02.0 (socket 0)net_mlx4: cannot access device, is mlx4_ib
loaded?EAL: Requested device 44be:00:02.0 cannot be usedEAL: Probe PCI
driver: net_mlx4 (15b3:1004) device: 9d69:00:02.0 (socket 0)net_mlx4:
cannot access device, is mlx4_ib loaded?EAL: Requested device 9d69:00:02.0
cannot be usedEAL: Probe PCI driver: net_mlx4 (15b3:1004) device:
a5e7:00:02.0 (socket 0)net_mlx4: cannot access device, is mlx4_ib
loaded?EAL: Requested device a5e7:00:02.0 cannot be usedEAL: Probe PCI
driver: net_mlx4 (15b3:1004) device: f25c:00:02.0 (socket 0)net_mlx4:
cannot access device, is mlx4_ib loaded?EAL: Requested device f25c:00:02.0
cannot be usedhn_vf_attach(): Couldn't find port for VFhn_vf_add(): RNDIS
reports VF but device not found, retryinghn_vf_attach(): Couldn't find port
for VFhn_vf_add(): RNDIS reports VF but device not found,
retryinghn_vf_attach(): Couldn't find port for VFhn_vf_add(): RNDIS reports
VF but device not found, retryingTELEMETRY: No legacy callbacks, legacy
socket not createdtestpmd: create a new mbuf pool : n=171456,
size=2176, socket=0testpmd: preferred mempool ops selected:
ring_mp_mchn_vf_attach(): Couldn't find port for VFhn_vf_add(): RNDIS
reports VF but device not found, retryinghn_vf_attach(): Couldn't find port
for VFhn_vf_add(): RNDIS reports VF but device not found, retryingEAL:
Error - exiting with code: 1  Cause: Creation of mbuf pool for socket 0
failed: Cannot allocate memory*

Thanks & Regards,
Rajasekhar


[PATCH v3 2/2] dts: add binding to different drivers to TG node

2024-09-24 Thread jspewock
From: Jeremy Spewock 

The DTS framework in its current state supports binding ports to
different drivers on the SUT node but not the TG node. The TG node
already has the information that it needs about the different drivers
that it has available in the configuration file, but it did not
previously have access to the devbind script, so it did not use that
information for anything.

This patch moves the location of the tmp directory as well as the method
for binding ports into the node class rather than the SUT node class and
adds an abstract method for getting the path to the devbind script into
the node class. Then, binding ports to the correct drivers is moved into
the build target setup and run on both nodes.

Bugzilla ID: 1420

Signed-off-by: Jeremy Spewock 
---
 dts/framework/runner.py |  2 +
 dts/framework/testbed_model/node.py | 49 -
 dts/framework/testbed_model/sut_node.py | 49 -
 dts/framework/testbed_model/tg_node.py  | 21 +++
 dts/framework/utils.py  |  2 +
 5 files changed, 88 insertions(+), 35 deletions(-)

diff --git a/dts/framework/runner.py b/dts/framework/runner.py
index ab98de8353..4c884fbcd4 100644
--- a/dts/framework/runner.py
+++ b/dts/framework/runner.py
@@ -486,6 +486,7 @@ def _run_build_target(
 
 try:
 sut_node.set_up_build_target(build_target_config)
+tg_node.set_up_build_target(build_target_config)
 self._result.dpdk_version = sut_node.dpdk_version
 
build_target_result.add_build_target_info(sut_node.get_build_target_info())
 build_target_result.update_setup(Result.PASS)
@@ -500,6 +501,7 @@ def _run_build_target(
 try:
 self._logger.set_stage(DtsStage.build_target_teardown)
 sut_node.tear_down_build_target()
+tg_node.tear_down_build_target()
 build_target_result.update_teardown(Result.PASS)
 except Exception as e:
 self._logger.exception("Build target teardown failed.")
diff --git a/dts/framework/testbed_model/node.py 
b/dts/framework/testbed_model/node.py
index 12a40170ac..aa26f2a2a7 100644
--- a/dts/framework/testbed_model/node.py
+++ b/dts/framework/testbed_model/node.py
@@ -13,11 +13,18 @@
 The :func:`~Node.skip_setup` decorator can be used without subclassing.
 """
 
-from abc import ABC
+
+from abc import ABC, abstractmethod
 from ipaddress import IPv4Interface, IPv6Interface
+from pathlib import PurePath
 from typing import Any, Callable, Union
 
-from framework.config import OS, NodeConfiguration, TestRunConfiguration
+from framework.config import (
+OS,
+BuildTargetConfiguration,
+NodeConfiguration,
+TestRunConfiguration,
+)
 from framework.exception import ConfigurationError
 from framework.logger import DTSLogger, get_dts_logger
 from framework.settings import SETTINGS
@@ -58,8 +65,10 @@ class Node(ABC):
 lcores: list[LogicalCore]
 ports: list[Port]
 _logger: DTSLogger
+_tmp_dir: PurePath
 _other_sessions: list[OSSession]
 _test_run_config: TestRunConfiguration
+_path_to_devbind_script: PurePath | None
 
 def __init__(self, node_config: NodeConfiguration):
 """Connect to the node and gather info during initialization.
@@ -88,6 +97,8 @@ def __init__(self, node_config: NodeConfiguration):
 
 self._other_sessions = []
 self._init_ports()
+self._tmp_dir = self.main_session.get_remote_tmp_dir()
+self._path_to_devbind_script = None
 
 def _init_ports(self) -> None:
 self.ports = [Port(self.name, port_config) for port_config in 
self.config.ports]
@@ -95,6 +106,11 @@ def _init_ports(self) -> None:
 for port in self.ports:
 self.configure_port_state(port)
 
+@property
+@abstractmethod
+def path_to_devbind_script(self) -> PurePath:
+"""The path to the dpdk-devbind.py script on the node."""
+
 def set_up_test_run(self, test_run_config: TestRunConfiguration) -> None:
 """Test run setup steps.
 
@@ -114,6 +130,20 @@ def tear_down_test_run(self) -> None:
 Additional steps can be added by extending the method in subclasses 
with the use of super().
 """
 
+def set_up_build_target(self, build_target_config: 
BuildTargetConfiguration) -> None:
+"""Bind ports to their DPDK drivers.
+
+Args:
+build_target_config: The build target test run configuration 
according to which
+the setup steps will be taken. This is unused in this method, 
but subclasses that
+extend this method may need it.
+"""
+self.bind_ports_to_driver()
+
+def tear_down_build_target(self) -> None:
+"""Bind ports to their OS drivers."""
+self.bind_ports_to_driver(for_dpdk=False)
+
 def create_session(self, name: str) -> OSSession:
 """Create and return a new OS-aware remote session.
 
@@

[PATCH v3 0/2] dts: add driver binding on TG

2024-09-24 Thread jspewock
From: Jeremy Spewock 

v3:
 * removed _remote from _remote_tmp_dir in node
 * switched to using Path where appropriate
 * added symlink to dpdk-devbind to dts/ and excluded from the
   the formatting script

Jeremy Spewock (2):
  dts: add synbolic link to dpdk-devbind script
  dts: add binding to different drivers to TG node

 devtools/dts-check-format.sh|  9 +++--
 dts/dpdk-devbind.py |  1 +
 dts/framework/runner.py |  2 +
 dts/framework/testbed_model/node.py | 49 -
 dts/framework/testbed_model/sut_node.py | 49 -
 dts/framework/testbed_model/tg_node.py  | 21 +++
 dts/framework/utils.py  |  2 +
 7 files changed, 94 insertions(+), 39 deletions(-)
 create mode 12 dts/dpdk-devbind.py

-- 
2.46.0



[PATCH v3 1/2] dts: add symbolic link to dpdk-devbind script

2024-09-24 Thread jspewock
From: Jeremy Spewock 

The devbind script is used throughout DTS to manage drivers on the
remote hosts. Currently, the only way to copy this script onto a host
is to either copy the entire DPDK directory onto a host, or reach out
of the dts directory into its parent DPDK directory to access the
script. The first is undesirable if the host doesn't require any other
DPDK tools since you would be copying extra unneeded information and
the second is undesirable since it enforces the assumption that DTS is
being run from within the DPDK directory. To solve this issue a symbolic
link is added which links the devbind script from the parent into the
DTS directory.

Since this file is not part of DTS and therefore is not expected to
follow DTS formatting rules, it is excluded from the DTS formatting
script.

Signed-off-by: Jeremy Spewock 
---
 devtools/dts-check-format.sh | 9 +
 dts/dpdk-devbind.py  | 1 +
 2 files changed, 6 insertions(+), 4 deletions(-)
 create mode 12 dts/dpdk-devbind.py

diff --git a/devtools/dts-check-format.sh b/devtools/dts-check-format.sh
index 3f43e17e88..adc199d34e 100755
--- a/devtools/dts-check-format.sh
+++ b/devtools/dts-check-format.sh
@@ -13,6 +13,7 @@ usage() {
 format=true
 lint=true
 typecheck=true
+ignore="dpdk-devbind.py"
 
 # Comments after args serve as documentation; must be present
 while getopts "hflt" arg; do
@@ -54,14 +55,14 @@ if $format; then
heading "Formatting in $directory/"
if command -v black > /dev/null; then
echo "Formatting code with black:"
-   black .
+   black --exclude "$ignore" .
else
echo "black is not installed, not formatting"
errors=$((errors + 1))
fi
if command -v isort > /dev/null; then
echo "Sorting imports with isort:"
-   isort .
+   isort --skip "$ignore" .
else
echo "isort is not installed, not sorting 
imports"
errors=$((errors + 1))
@@ -90,7 +91,7 @@ if $lint; then
fi
heading "Linting in $directory/"
if command -v pylama > /dev/null; then
-   pylama .
+   pylama --skip "$ignore" .
errors=$((errors + $?))
else
echo "pylama not found, unable to run linter"
@@ -104,7 +105,7 @@ if $typecheck; then
fi
heading "Checking types in $directory/"
if command -v mypy > /dev/null; then
-   mypy .
+   mypy --exclude "$ignore" .
errors=$((errors + $?))
else
echo "mypy not found, unable to check types"
diff --git a/dts/dpdk-devbind.py b/dts/dpdk-devbind.py
new file mode 12
index 00..9d042fad14
--- /dev/null
+++ b/dts/dpdk-devbind.py
@@ -0,0 +1 @@
+../usertools/dpdk-devbind.py
\ No newline at end of file
-- 
2.46.0



Re: [PATCH v4] eal: add build-time option to omit trace

2024-09-24 Thread Stephen Hemminger
On Tue, 24 Sep 2024 13:39:57 +
Morten Brørup  wrote:

> Some applications want to omit the trace feature.
> Either to reduce the memory footprint, to reduce the exposed attack
> surface, or for other reasons.
> 
> This patch adds an option in rte_config.h to include or omit trace in the
> build. Trace is included by default.
> 
> Omitting trace works by omitting all trace points.
> For API and ABI compatibility, the trace feature itself remains.
> 
> Signed-off-by: Morten Brørup 

This is good compact solution. In future, it would be nice if DPDK could
use static keys to isolate rarely used features. But static keys require
runtime modification of instructions.

https://docs.kernel.org/6.7/staging/static-keys.html

Acked-by: Stephen Hemminger 


Re: [PATCH v2 1/1] dts: add binding to different drivers to TG node

2024-09-24 Thread Juraj Linkeš




On 24. 9. 2024 15:57, Jeremy Spewock wrote:

On Tue, Sep 24, 2024 at 5:12 AM Juraj Linkeš  wrote:


I have some thoughts for the future:
1a. The traffic generator is specified per-node, so maybe we could also
change the binding to be for the whole lifetime of the TG node,
1b. But the same is true for the SUT node as well, right? After we do
the port update (with kernel driver), we can just bind to DPDK driver.
With SUT in the mix, this looks like a change for a different patch,


Right, these are good points. A good observation too that we only
really need the kernel driver at the start in both cases. You had
mentioned in your previous comments as well that we should only be
binding on the TG once per lifetime, but I ended up not adding it for
that very reason of I still wanted the binding to be in Node, but I
didn't want to change the process for the SUT.


2. We could add a symlink to the devbind script with the target being in
the dts directory. This way we don't have to go outside the dts
directory and if DTS ever become a python package, we could just copy
the script to the appropriate place. This is also something we don't
really need to do.


I like this idea a lot actually. It feels very weird to me having to
step out of the DTS directory and I like the idea of keeping it
together like it were a package (even if it isn't yet).



Ok, you can add that to the next version.


diff --git a/dts/framework/testbed_model/node.py 
b/dts/framework/testbed_model/node.py



@@ -58,8 +65,10 @@ class Node(ABC):
   lcores: list[LogicalCore]
   ports: list[Port]
   _logger: DTSLogger
+_remote_tmp_dir: PurePath
   _other_sessions: list[OSSession]
   _test_run_config: TestRunConfiguration
+_path_to_devbind_script: PurePath | None


A note on the naming. We have _remote_tmp_dir and
_path_to_devbind_script. Both are pointing to a remote file/dir, but
only one has the _remote prefix. They should probably be unified.


I didn't think of this but you're right, the two are very similar but
named differently.



I've thought a bit about what the right name is. Dropping the prefix
makes sense; sut_node.tmp_dir should mean the tmp dir on the SUT node
(which would make it remote from the execution host's point of view, but
not the node's view; the file is local to SUT node). This could be a
good separate patch (improving the remote/local naming scheme to make it
consistent and as sensible as possible).


I also like the sound of it without the prefix and how it actually has
a more fitting meaning from the two perspectives. I agree that there
is probably some other work to be done on this in another patch, but
since I am moving the _remote_tmp_dir variable around anyway I think
it wouldn't hurt for me to rename it.



Yes, we should pick one naming convention to be consistent in this patch 
and we can have a broader (framework-wide) look at this in a separate patch.






Re: [PATCH v3 0/2] vhost: add VDUSE reconnection support

2024-09-24 Thread Maxime Coquelin




On 9/23/24 21:51, Maxime Coquelin wrote:

This series adds support for VDUSE reconnection.

First patch introduces the reconnection file layout and
track the virtqueues available index updates in the
datapath and control queue.

Second patch adds VDUSE reconnect intialization and some
sanity checks to prevent incompatible reconnections.

Changes in v3:
==
- Fixed missing avail index updates (David)
- Fixed typos in commit message (David)
- Applied R-by's

Changes in v2:
==
- Added more sanity checks at reconnection
- Improve versionning
- Fix error loggin (Chenbo)
- Clarify why offloading device start is required (Chenbo)
- Change runtime path to /vduse instead of /dpdk/vduse

Maxime Coquelin (2):
   vhost: add logging mechanism for reconnection
   vhost: add reconnection support to VDUSE

  lib/vhost/vduse.c   | 308 +++-
  lib/vhost/vhost.c   |   2 +
  lib/vhost/vhost.h   |  41 -
  lib/vhost/vhost_user.c  |   4 +
  lib/vhost/virtio_net.c  |   8 +
  lib/vhost/virtio_net_ctrl.c |   2 +
  6 files changed, 322 insertions(+), 43 deletions(-)



Applied to next-virtio/for-next-net



[PATCH] net/r8169: add phy registers access routines

2024-09-24 Thread Howard Wang
Signed-off-by: Howard Wang 
---
 drivers/net/r8169/r8169_ethdev.h |   1 +
 drivers/net/r8169/r8169_phy.c| 219 +++
 drivers/net/r8169/r8169_phy.h|  18 +++
 3 files changed, 238 insertions(+)

diff --git a/drivers/net/r8169/r8169_ethdev.h b/drivers/net/r8169/r8169_ethdev.h
index 20dbf06c9b..9656a26eb0 100644
--- a/drivers/net/r8169/r8169_ethdev.h
+++ b/drivers/net/r8169/r8169_ethdev.h
@@ -18,6 +18,7 @@ struct rtl_hw {
u8  *mmio_addr;
u32 mcfg;
u8  HwSuppIntMitiVer;
+   u16 cur_page;
 
/* Enable Tx No Close */
u8 EnableTxNoClose;
diff --git a/drivers/net/r8169/r8169_phy.c b/drivers/net/r8169/r8169_phy.c
index f0a880eeca..cfec426ee1 100644
--- a/drivers/net/r8169/r8169_phy.c
+++ b/drivers/net/r8169/r8169_phy.c
@@ -39,3 +39,222 @@ rtl_set_mac_ocp_bit(struct rtl_hw *hw, u16 addr, u16 mask)
rtl_clear_set_mac_ocp_bit(hw, addr, 0, mask);
 }
 
+static u16
+rtl_map_phy_ocp_addr(u16 PageNum, u8 RegNum)
+{
+   u8 ocp_reg_num = 0;
+   u16 ocp_page_num = 0;
+   u16 ocp_phy_address = 0;
+
+   if (PageNum == 0) {
+   ocp_page_num = OCP_STD_PHY_BASE_PAGE + (RegNum / 8);
+   ocp_reg_num = 0x10 + (RegNum % 8);
+   } else {
+   ocp_page_num = PageNum;
+   ocp_reg_num = RegNum;
+   }
+
+   ocp_page_num <<= 4;
+
+   if (ocp_reg_num < 16)
+   ocp_phy_address = 0;
+   else {
+   ocp_reg_num -= 16;
+   ocp_reg_num <<= 1;
+
+   ocp_phy_address = ocp_page_num + ocp_reg_num;
+   }
+
+   return ocp_phy_address;
+}
+
+static u32
+rtl_mdio_real_read_phy_ocp(struct rtl_hw *hw, u32 RegAddr)
+{
+   u32 data32;
+   int i, value = 0;
+
+   data32 = RegAddr / 2;
+   data32 <<= OCPR_Addr_Reg_shift;
+
+   RTL_W32(hw, PHYOCP, data32);
+   for (i = 0; i < 100; i++) {
+   udelay(1);
+
+   if (RTL_R32(hw, PHYOCP) & OCPR_Flag)
+   break;
+   }
+   value = RTL_R32(hw, PHYOCP) & OCPDR_Data_Mask;
+
+   return value;
+}
+
+u32
+rtl_mdio_direct_read_phy_ocp(struct rtl_hw *hw, u32 RegAddr)
+{
+   return rtl_mdio_real_read_phy_ocp(hw, RegAddr);
+}
+
+static u32
+rtl_mdio_read_phy_ocp(struct rtl_hw *hw, u16 PageNum, u32 RegAddr)
+{
+   u16 ocp_addr;
+
+   ocp_addr = rtl_map_phy_ocp_addr(PageNum, RegAddr);
+
+   return rtl_mdio_direct_read_phy_ocp(hw, ocp_addr);
+}
+
+static u32
+rtl_mdio_real_read(struct rtl_hw *hw, u32 RegAddr)
+{
+   return rtl_mdio_read_phy_ocp(hw, hw->cur_page, RegAddr);
+}
+
+static void
+rtl_mdio_real_write_phy_ocp(struct rtl_hw *hw, u32 RegAddr, u32 value)
+{
+   u32 data32;
+   int i;
+
+   data32 = RegAddr / 2;
+   data32 <<= OCPR_Addr_Reg_shift;
+   data32 |= OCPR_Write | value;
+
+   RTL_W32(hw, PHYOCP, data32);
+   for (i = 0; i < 100; i++) {
+   udelay(1);
+
+   if (!(RTL_R32(hw, PHYOCP) & OCPR_Flag))
+   break;
+   }
+}
+
+void
+rtl_mdio_direct_write_phy_ocp(struct rtl_hw *hw, u32 RegAddr, u32 value)
+{
+   rtl_mdio_real_write_phy_ocp(hw, RegAddr, value);
+}
+
+static void
+rtl_mdio_write_phy_ocp(struct rtl_hw *hw, u16 PageNum, u32 RegAddr, u32 value)
+{
+   u16 ocp_addr;
+
+   ocp_addr = rtl_map_phy_ocp_addr(PageNum, RegAddr);
+
+   rtl_mdio_direct_write_phy_ocp(hw, ocp_addr, value);
+}
+
+static void
+rtl_mdio_real_write(struct rtl_hw *hw, u32 RegAddr, u32 value)
+{
+   if (RegAddr == 0x1F)
+   hw->cur_page = value;
+   rtl_mdio_write_phy_ocp(hw, hw->cur_page, RegAddr, value);
+}
+
+u32
+rtl_mdio_read(struct rtl_hw *hw, u32 RegAddr)
+{
+   return rtl_mdio_real_read(hw, RegAddr);
+}
+
+void
+rtl_mdio_write(struct rtl_hw *hw, u32 RegAddr, u32 value)
+{
+   rtl_mdio_real_write(hw, RegAddr, value);
+}
+
+void
+rtl_clear_and_set_eth_phy_ocp_bit(struct rtl_hw *hw, u16 addr, u16 clearmask,
+  u16 setmask)
+{
+   u16 phy_reg_value;
+
+   phy_reg_value = rtl_mdio_direct_read_phy_ocp(hw, addr);
+   phy_reg_value &= ~clearmask;
+   phy_reg_value |= setmask;
+   rtl_mdio_direct_write_phy_ocp(hw, addr, phy_reg_value);
+}
+
+void
+rtl_clear_eth_phy_ocp_bit(struct rtl_hw *hw, u16 addr, u16 mask)
+{
+   rtl_clear_and_set_eth_phy_ocp_bit(hw, addr, mask, 0);
+}
+
+void
+rtl_set_eth_phy_ocp_bit(struct rtl_hw *hw, u16 addr, u16 mask)
+{
+   rtl_clear_and_set_eth_phy_ocp_bit(hw, addr, 0, mask);
+}
+
+void
+rtl_ephy_write(struct rtl_hw *hw, int addr, int value)
+{
+   int i;
+
+   RTL_W32(hw, EPHYAR, EPHYAR_Write |
+   (addr & EPHYAR_Reg_Mask_v2) << EPHYAR_Reg_shift |
+   (value & EPHYAR_Data_Mask));
+
+   for (i = 0; i < 10; i++) {
+   udelay(100);
+
+   /* Check if the NIC has completed EPHY write */
+   if (!(RTL_R32(hw, EPHYAR) & EPHYAR_Flag))
+   break;

Re: [PATCH 2/2] timer/linux: override TSC freq if no tsc_known_freq

2024-09-24 Thread Isaac Boukris
Perhaps this was a bit hastily, I got an automated failure report on
"arm64-native-linuxapp-gcc":

https://lab.dpdk.org/results/dashboard/patchsets/31106/

Not sure which patch caused the problem, looking at the kernel code it
looks like 'tsc_known_freq' is only set for x86 arch, but it could be
the other patch and maybe lowering the rounding to 10KHz is too
aggressive, maybe better 100KHz or 1MHz.

As an alternative to 'tsc_known_freq' detection, maybe just provide an
init parameter to set the frequency manually, along with some
known-issue documentation.

Maybe we should just allow to specify the frequency as a parameter at init


Re: [RFC PATCH v7] mempool: fix mempool cache size

2024-09-24 Thread Patrick Robb
Recheck-request: iol-intel-Performance

On Tue, Sep 24, 2024 at 2:12 PM Morten Brørup 
wrote:

> This patch refactors the mempool cache to fix two bugs:
> 1. When a mempool is created with a cache size of N objects, the cache was
> actually created with a size of 1.5 * N objects.
> 2. The mempool cache field names did not reflect their purpose;
> the "flushthresh" field held the size, and the "size" field held the
> number of objects remaining in the cache when returning from a get
> operation refilling it from the backend.
>
> Especially the first item could be fatal:
> When more objects than a mempool's configured cache size is held in the
> mempool's caches associated with other lcores, a rightsized mempool may
> unexpectedly run out of objects, causing the application to fail.
>
> Furthermore, this patch introduces two optimizations:
> 1. The mempool caches are flushed to/filled from the backend in their
> entirety, so backend accesses are CPU cache line aligned. (Assuming the
> mempool cache size is a multiplum of a CPU cache line size divided by the
> size of a pointer.)
> 2. The unlikely paths in the get and put functions, where the cache is
> flushed to/filled from the backend, are moved from the inline functions to
> separate helper functions, thereby reducing the code size of the inline
> functions.
> Note: Accessing the backend for cacheless mempools remains inline.
>
> Various drivers accessing the mempool directly have been updated
> accordingly.
> These drivers did not update mempool statistics when accessing the mempool
> directly, so that is fixed too.
>
> Note: Performance not yet benchmarked.
>
> Signed-off-by: Morten Brørup 
> ---
> v7:
> * Increased max mempool cache size from 512 to 1024 objects.
>   Mainly for CI performance test purposes.
>   Originally, the max mempool cache size was 768 objects, and used a fixed
>   size array of 1024 objects in the mempool cache structure.
> v6:
> * Fix v5 incomplete implementation of passing large requests directly to
>   the backend.
> * Use memcpy instead of rte_memcpy where compiler complains about it.
> * Added const to some function parameters.
> v5:
> * Moved helper functions back into the header file, for improved
>   performance.
> * Pass large requests directly to the backend. This also simplifies the
>   code.
> v4:
> * Updated subject to reflect that misleading names are considered bugs.
> * Rewrote patch description to provide more details about the bugs fixed.
>   (Mattias Rönnblom)
> * Moved helper functions, not to be inlined, to mempool C file.
>   (Mattias Rönnblom)
> * Pass requests for n >= RTE_MEMPOOL_CACHE_MAX_SIZE objects known at build
>   time directly to backend driver, to avoid calling the helper functions.
>   This also fixes the compiler warnings about out of bounds array access.
> v3:
> * Removed __attribute__(assume).
> v2:
> * Removed mempool perf test; not part of patch set.
> ---
>  config/rte_config.h   |   2 +-
>  drivers/common/idpf/idpf_common_rxtx_avx512.c |  54 +---
>  drivers/mempool/dpaa/dpaa_mempool.c   |  16 +-
>  drivers/mempool/dpaa2/dpaa2_hw_mempool.c  |  14 -
>  drivers/net/i40e/i40e_rxtx_vec_avx512.c   |  17 +-
>  drivers/net/iavf/iavf_rxtx_vec_avx512.c   |  27 +-
>  drivers/net/ice/ice_rxtx_vec_avx512.c |  27 +-
>  lib/mempool/mempool_trace.h   |   1 -
>  lib/mempool/rte_mempool.c |  12 +-
>  lib/mempool/rte_mempool.h | 287 --
>  10 files changed, 232 insertions(+), 225 deletions(-)
>
> diff --git a/config/rte_config.h b/config/rte_config.h
> index dd7bb0d35b..2488ff167d 100644
> --- a/config/rte_config.h
> +++ b/config/rte_config.h
> @@ -56,7 +56,7 @@
>  #define RTE_CONTIGMEM_DEFAULT_BUF_SIZE (512*1024*1024)
>
>  /* mempool defines */
> -#define RTE_MEMPOOL_CACHE_MAX_SIZE 512
> +#define RTE_MEMPOOL_CACHE_MAX_SIZE 1024
>  /* RTE_LIBRTE_MEMPOOL_STATS is not set */
>  /* RTE_LIBRTE_MEMPOOL_DEBUG is not set */
>
> diff --git a/drivers/common/idpf/idpf_common_rxtx_avx512.c
> b/drivers/common/idpf/idpf_common_rxtx_avx512.c
> index 3b5e124ec8..98535a48f3 100644
> --- a/drivers/common/idpf/idpf_common_rxtx_avx512.c
> +++ b/drivers/common/idpf/idpf_common_rxtx_avx512.c
> @@ -1024,21 +1024,13 @@ idpf_tx_singleq_free_bufs_avx512(struct
> idpf_tx_queue *txq)
>
> rte_lcore_id());
> void **cache_objs;
>
> -   if (cache == NULL || cache->len == 0)
> -   goto normal;
> -
> -   cache_objs = &cache->objs[cache->len];
> -
> -   if (n > RTE_MEMPOOL_CACHE_MAX_SIZE) {
> -   rte_mempool_ops_enqueue_bulk(mp, (void *)txep, n);
> +   if (!cache || unlikely(n + cache->len > cache->size)) {
> +   rte_mempool_generic_put(mp, (void *)txep, n,
> cache);
> goto done;
> }
>
> -   /* The cache follows the following algorithm
> -

Re: [RFC] ethdev: introduce PTP device capability

2024-09-24 Thread lihuisong (C)

Hi Ferruh,


在 2024/9/23 6:23, Ferruh Yigit 写道:

On 1/30/2024 3:58 AM, Huisong Li wrote:

Currently, the PTP feature is a little messy and has some issue.
1> There is different implementation for PTP. This feature of some
hardware depends on the Rx HW timestamp offload, and use this
offload to detect if enable PTP feature. Others can enable PTP
feature with only ethdev ops.
2> For some drivers, enabling PTP feature also depends on the macro
RTE_LIBRTE_IEEE1588. Actually, whether device support, enable
or disable this feature should not be determined at compilation
time. This has been discussed in thread [1].
3> The user haven't a good way to distinguish which port supports
the PTP feature in the case that a couple of device belong to the
same driver. And PTP related API in ethdev layer doesn't do check
for PTP capability. This has been mentioned and discussed in
thread [2].

In the thread [1], there is a conclusion that remove the compiling
macro RTE_LIBRTE_IEEE1588. And in the thread [2], there is a common
opinion that the RTE_ETH_RX_OFFLOAD_TIMESTAMP cannot be used as the
PTP capability.

For the above issuse, this patch introduces a PTP capability in
rte_eth_dev_info.dev_capa to report PTP capability.

Welcome to jumping into discussion.

[1] 
https://patchwork.dpdk.org/project/dpdk/patch/20230203132810.14187-1-tho...@monjalon.net/
[2] https://inbox.dpdk.org/dev/20230817084226.55327-1-lihuis...@huawei.com/

Signed-off-by: Huisong Li 
---
  lib/ethdev/rte_ethdev.h | 6 ++
  1 file changed, 6 insertions(+)

diff --git a/lib/ethdev/rte_ethdev.h b/lib/ethdev/rte_ethdev.h
index efa4f12b2a..4c8bd691bd 100644
--- a/lib/ethdev/rte_ethdev.h
+++ b/lib/ethdev/rte_ethdev.h
@@ -1613,6 +1613,12 @@ struct rte_eth_conf {
  #define RTE_ETH_DEV_CAPA_FLOW_RULE_KEEP RTE_BIT64(3)
  /** Device supports keeping shared flow objects across restart. */
  #define RTE_ETH_DEV_CAPA_FLOW_SHARED_OBJECT_KEEP RTE_BIT64(4)
+/**
+ * Device supports PTP feature.
+ * For some hardware, this feature also need to set the offload
+ * RTE_ETH_RX_OFFLOAD_TIMESTAMP, please check rte_eth_dev_info.rx_offload_capa.
+ */
+#define RTE_ETH_DEV_CAPA_PTP RTE_BIT64(5)
  /**@}*/
  
  /*

Hi Huisong,

Thanks for the effort, as you mentioned PTP feature can be improved and
there is a target to remove RTE_LIBRTE_IEEE1588 build time macro.

As far as I remember, one of the main reasons of the RTE_LIBRTE_IEEE1588
macro is some HW has resource restrictions, when RTE_LIBRTE_IEEE1588 is
enabled some other features, like vector datapath, are not usable, that
is why this is a build time selection.
I think the main reason that driver don't support PTP feature in vector 
datapath is for performance.
This can be controled by releated dev_ops API or TIMESTAMP offload and 
no need to use RTE_LIBRTE_IEEE1588 macro, like hns3.


And related to the PTP capability, can you please give some more
information, what does device having PTP capability exactly means.

Just the device having PTP capability can be used to enable PTP feature.


PTP is protocol and it is implemented in userspace daemon. I guess even
NIC does not support timestamp offloading, by using time information
from SW it can still implement PTP, right?


AFAIS, PTP feature implement requires the collaboration of HW and SW, as 
describted by the releated dev_ops API.



Is PTP offload means, offloading some part of the protocol communication
withing the device?
Normally, a feature offload means this feature is done in hardware and 
the software doesn't need to something for this.
I reviewed our discussion in [1], we all think it's unreasonable to name 
it "PTP offload".




Btw, the relevant RTE_ETH_RX_OFFLOAD_TIMESTAMP offload is, a little more
generic HW capability that HW can add timestamp to Rx/Tx packets, which
can be used for custom diagnostics. HW supporting this offload means
that HW has some specific clock HW in it.

Yes.


Both having RTE_ETH_RX_OFFLOAD_TIMESTAMP and RTE_ETH_DEV_CAPA_PTP
capability can be confusing, lets clarify it more.


Let me try to clearify them:
1) RTE_ETH_DEV_CAPA_PTP just means the ethdev support PTP capability because of 
application no way to know if the device support PTP feature.
2) As you said above, setting RTE_ETH_RX_OFFLOAD_TIMESTAMP offload is not 
necessary for PTP feature, but also may be for custom diagnostics.
   Some NICs enable PTP feature using only the rte_eth_timesync_xxx API,
   and many NICs also need this offload to fill timestamp in mbuf to cooperate 
with the implementation of PTP feature.




[1] https://inbox.dpdk.org/dev/20230817084226.55327-1-lihuis...@huawei.com/



.


[PATCH] net/ice/base: update README

2024-09-24 Thread Ian Stokes
Update the README in base folder of ice driver to reflect
base code update release date and supported devices.

Signed-off-by: Ian Stokes 
---
 drivers/net/ice/base/README | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)

diff --git a/drivers/net/ice/base/README b/drivers/net/ice/base/README
index da065f9be2..7d5a01d223 100644
--- a/drivers/net/ice/base/README
+++ b/drivers/net/ice/base/README
@@ -1,17 +1,18 @@
 /* SPDX-License-Identifier: BSD-3-Clause
- * Copyright(c) 2020-2023 Intel Corporation
+ * Copyright(c) 2020-2024 Intel Corporation
  */
 
 Intel® ICE driver
 ==
 
 This directory contains source code of FreeBSD ice driver of version
-2023.03.30 released by the team which develops
+2024.08.19 released by the team which develops
 basic drivers for any ice NIC. The directory of base/ contains the
 original source package.
 This driver is valid for the product(s) listed below
 
 * Intel® Ethernet Network Adapters E810
+* Intel® Ethernet Network Adapters E830
 
 Updating the driver
 ===
-- 
2.34.1



Re: [PATCH v3 1/1] dts: Remove XML-RPC server for Scapy TG and instead use PythonShell

2024-09-24 Thread Juraj Linkeš
I like how this looks. I have a number of minor comments (mainly wording 
and naming), but overall it looks very good.


On 19. 9. 2024 21:02, jspew...@iol.unh.edu wrote:

From: Jeremy Spewock 

Previously all scapy commands were handled using an XML-RPC server that
ran on the TGNode. This unnecessarily enforces a minimum Python version
of 3.10 on the server that is being used as a traffic generator and
complicates the implementation of scapy methods.


What is the TG's minimum Python version now? 
https://bugs.dpdk.org/show_bug.cgi?id=1388 says this will become a moot 
point, but we're still using Python on the remote node.




diff --git a/dts/framework/remote_session/single_active_interactive_shell.py 
b/dts/framework/remote_session/single_active_interactive_shell.py



@@ -93,9 +94,13 @@ def __init__(
  timeout: float = SETTINGS.timeout,
  app_params: Params = Params(),
  name: str | None = None,
+**kwargs,
  ) -> None:
  """Create an SSH channel during initialization.
  
+Additional key-word arguments can be passed through `kwargs` is needed to fulfill other


Typo: is -> if

And key-word should be just keyword.

Also, should we add super().__init__(), seeing as we also added it to 
TrafficGenerator?




diff --git a/dts/framework/testbed_model/traffic_generator/scapy.py 
b/dts/framework/testbed_model/traffic_generator/scapy.py



+class ScapyTrafficGenerator(PythonShell, CapturingTrafficGenerator):



+def __init__(self, tg_node: Node, config: ScapyTrafficGeneratorConfig, 
**kwargs):


The usage of privileged=True when creating the instance confused me for 
a bit, because it's not this class's argument, but rather 
SingleActiveInteractiveShell's. I thing we should document the arguments 
of SingleActiveInteractiveShell in the Keyword Args section. We probably 
need just a link to SingleActiveInteractiveShell.




+def _create_packet_list(self, packets: list[Packet]) -> None:


Maybe we could apply some of the ideas from the local/remote naming 
scheme I talked about in the tg devbind script patch here. Whatever 
happens on the TG could be prefixed with remote and whatever is 
happening locally would be without the prefix (or maybe whatever is 
happnening on the TG shouldn't be prefixed (or a different prefix - 
shell)? Makes sense, but then we'd need a good prefix for what's 
happening on the execution environment. Maybe this also needs to be in a 
different patch, after it's been though through with everything else in 
mind.). That would make this _create_remote_packet_list, but we're just 
setting a variable (the passed packets have already been built), so 
maybe _set_remote_packet_list?


Or maybe all of the remote methods could start with remote, making it 
_remote_set_packet_list (or _shell_set_packet_list? Doesn't sound bad.). 
Not sure which is better, maybe after renaming more of these it's going 
to be clearer.


Whatever we go with, the naming scheme should be explained in the 
class's dosctring.




+def _create_sniffer(
+self, packets_to_send: list[Packet], send_port: Port, recv_port: Port, 
filter_config: str
+) -> None:
+"""Create an asynchronous sniffer in the shell.
+
+A list of packets to send is added to the sniffer inside of a callback 
function so that
+they are immediately sent at the time sniffing is started.


This is still a bit confusing (where are the packets added and what is 
inside the function?); We may not need the Scapy implementation details. 
We could just say the packets are sent immediately at the time sniffing 
is started. Or maybe:
A list of packets is passed to the sniffer's callback function so that 
they are immediately sent at the time sniffing is started.


It's a private method, so maybe the implementation detail could be 
valuable, even though it's not fully clear why the implication is true - 
you still need some knowledge of how the sniffer works to put everything 
together.




+sniffer_commands = [
+f"{self._sniffer_name} = AsyncSniffer(",
+f"iface='{recv_port.logical_name}',",
+"store=True,",
+"started_callback=lambda *args: sendp(",


As far as I can tell, we're not using the args, so we can just use 
"lambda: sendp()"



+(
+f"{self._python_indentation}{self._send_packet_list_name},"


Is the indentation needed here?



@@ -335,32 +204,19 @@ def _send_packets_and_capture(


I think we can improve the order of methods in the class. I'd put 
_send_packets() and _send_packets_and_capture() after the public 
methods. These two methods are the most important ones (implementing the 
abstract methods). The other methods should come after that in the order 
they're used in _send_packets() and _send_packets_and_capture().



  self,
  packets: list[Packet],
  send_port: Port,
-receive_port: Port,
+recv_port: Port,
  filter_config:

[PATCH V3 2/3] net/mlx5/hws: add log for failing to create rule in HWS

2024-09-24 Thread Minggang Li(Gavin)
Add log messages about the reason why the flow was failed to be created.

Signed-off-by: Minggang Li(Gavin) 
Acked-by: Alex Vesker 
---
 drivers/net/mlx5/hws/mlx5dr_rule.c | 6 ++
 1 file changed, 6 insertions(+)

diff --git a/drivers/net/mlx5/hws/mlx5dr_rule.c 
b/drivers/net/mlx5/hws/mlx5dr_rule.c
index 1edb7eac74..5d66d81ea5 100644
--- a/drivers/net/mlx5/hws/mlx5dr_rule.c
+++ b/drivers/net/mlx5/hws/mlx5dr_rule.c
@@ -638,6 +638,7 @@ static int mlx5dr_rule_destroy_hws(struct mlx5dr_rule *rule,
 
/* Rule is not completed yet */
if (rule->status == MLX5DR_RULE_STATUS_CREATING) {
+   DR_LOG(NOTICE, "Cannot destroy, rule creation still in 
progress");
rte_errno = EBUSY;
return rte_errno;
}
@@ -806,12 +807,14 @@ static int mlx5dr_rule_enqueue_precheck(struct 
mlx5dr_rule *rule,
struct mlx5dr_context *ctx = rule->matcher->tbl->ctx;
 
if (unlikely(!attr->user_data)) {
+   DR_LOG(DEBUG, "User data must be provided for rule operations");
rte_errno = EINVAL;
return rte_errno;
}
 
/* Check if there is room in queue */
if 
(unlikely(mlx5dr_send_engine_full(&ctx->send_queue[attr->queue_id]))) {
+   DR_LOG(NOTICE, "No room in queue[%d]", attr->queue_id);
rte_errno = EBUSY;
return rte_errno;
}
@@ -823,6 +826,7 @@ static int mlx5dr_rule_enqueue_precheck_move(struct 
mlx5dr_rule *rule,
 struct mlx5dr_rule_attr *attr)
 {
if (unlikely(rule->status != MLX5DR_RULE_STATUS_CREATED)) {
+   DR_LOG(DEBUG, "Cannot move, rule status is invalid");
rte_errno = EINVAL;
return rte_errno;
}
@@ -835,6 +839,7 @@ static int mlx5dr_rule_enqueue_precheck_create(struct 
mlx5dr_rule *rule,
 {
if (unlikely(mlx5dr_matcher_is_in_resize(rule->matcher))) {
/* Matcher in resize - new rules are not allowed */
+   DR_LOG(NOTICE, "Resizing in progress, cannot create rule");
rte_errno = EAGAIN;
return rte_errno;
}
@@ -1068,6 +1073,7 @@ int mlx5dr_rule_hash_calculate(struct mlx5dr_matcher 
*matcher,
mlx5dr_table_is_root(matcher->tbl) ||
matcher->tbl->ctx->caps->access_index_mode == 
MLX5DR_MATCHER_INSERT_BY_HASH ||
matcher->tbl->ctx->caps->flow_table_hash_type != 
MLX5_FLOW_TABLE_HASH_TYPE_CRC32) {
+   DR_LOG(DEBUG, "Matcher is not supported");
rte_errno = ENOTSUP;
return -rte_errno;
}
-- 
2.34.1



[PATCH V3 0/3] Error report improvement and fix

2024-09-24 Thread Minggang Li(Gavin)
This patch set is to improve error handling in pmd and under layer.

Gavin Li (3):
  net/mlx5: set rte errno if malloc failed
---
changelog:
v0->v1
- Fix typo in commit message
v1->v2
- Fix checkpatch warning
---
  net/mlx5/hws: add log for failing to create rule in HWS
---
changelog:
v1->v2
- Fix checkpatch warning
v2->v3
- Fix checkpatch warning
---
  net/mlx5/hws: print CQE error syndrome and more information
---
changelog:
v1->v2
- Fix checkpatch warning
v2->v3
- Fix checkpatch warning
---

 drivers/net/mlx5/hws/mlx5dr_rule.c |  6 ++
 drivers/net/mlx5/hws/mlx5dr_send.c |  9 -
 drivers/net/mlx5/mlx5_flow_hw.c| 31 +++---
 3 files changed, 38 insertions(+), 8 deletions(-)

-- 
2.34.1



[PATCH V3 1/3] net/mlx5: set rte errno if malloc failed

2024-09-24 Thread Minggang Li(Gavin)
rte_errno should be set if anything wrong happened in under layer so that
user can figure out what's going on.

There were some cases that did not set it when ipool allocation failed. To
fix the issue, set rte_errno to ENOMEM if mlx5_ipool_malloc failed to
allocate ID.

Fixes: c40c061a022e ("net/mlx5: add basic flow queue operation")
Fixes: 48fbb0e93d06 ("net/mlx5: support flow meter mark indirect action with 
HWS")
cc: sta...@dpdk.org
Signed-off-by: Minggang Li(Gavin) 
Acked-by: Bing Zhao 
---
 drivers/net/mlx5/mlx5_flow_hw.c | 31 ---
 1 file changed, 24 insertions(+), 7 deletions(-)

diff --git a/drivers/net/mlx5/mlx5_flow_hw.c b/drivers/net/mlx5/mlx5_flow_hw.c
index a275154d4b..f34670b3ec 100644
--- a/drivers/net/mlx5/mlx5_flow_hw.c
+++ b/drivers/net/mlx5/mlx5_flow_hw.c
@@ -1905,7 +1905,7 @@ flow_hw_meter_mark_alloc(struct rte_eth_dev *dev, 
uint32_t queue,
const struct rte_flow_action_meter_mark *meter_mark = action->conf;
struct mlx5_aso_mtr *aso_mtr;
struct mlx5_flow_meter_info *fm;
-   uint32_t mtr_id;
+   uint32_t mtr_id = 0;
uintptr_t handle = (uintptr_t)MLX5_INDIRECT_ACTION_TYPE_METER_MARK <<
MLX5_INDIRECT_ACTION_TYPE_OFFSET;
 
@@ -1917,8 +1917,15 @@ flow_hw_meter_mark_alloc(struct rte_eth_dev *dev, 
uint32_t queue,
if (meter_mark->profile == NULL)
return NULL;
aso_mtr = mlx5_ipool_malloc(pool->idx_pool, &mtr_id);
-   if (!aso_mtr)
+   if (!aso_mtr) {
+   rte_flow_error_set(error, ENOMEM,
+  RTE_FLOW_ERROR_TYPE_UNSPECIFIED,
+  NULL,
+  "failed to allocate aso meter entry");
+   if (mtr_id)
+   mlx5_ipool_free(pool->idx_pool, mtr_id);
return NULL;
+   }
/* Fill the flow meter parameters. */
aso_mtr->type = ASO_METER_INDIRECT;
fm = &aso_mtr->fm;
@@ -3926,8 +3933,10 @@ flow_hw_async_flow_create(struct rte_eth_dev *dev,
return NULL;
}
flow = mlx5_ipool_malloc(table->flow, &flow_idx);
-   if (!flow)
+   if (!flow) {
+   rte_errno = ENOMEM;
goto error;
+   }
rule_acts = flow_hw_get_dr_action_buffer(priv, table, 
action_template_index, queue);
/*
 * Set the table here in order to know the destination table
@@ -3938,8 +3947,10 @@ flow_hw_async_flow_create(struct rte_eth_dev *dev,
flow->idx = flow_idx;
if (table->resource) {
mlx5_ipool_malloc(table->resource, &res_idx);
-   if (!res_idx)
+   if (!res_idx) {
+   rte_errno = ENOMEM;
goto error;
+   }
flow->res_idx = res_idx;
} else {
flow->res_idx = flow_idx;
@@ -4070,8 +4081,10 @@ flow_hw_async_flow_create_by_index(struct rte_eth_dev 
*dev,
return NULL;
}
flow = mlx5_ipool_malloc(table->flow, &flow_idx);
-   if (!flow)
+   if (!flow) {
+   rte_errno = ENOMEM;
goto error;
+   }
rule_acts = flow_hw_get_dr_action_buffer(priv, table, 
action_template_index, queue);
/*
 * Set the table here in order to know the destination table
@@ -4082,8 +4095,10 @@ flow_hw_async_flow_create_by_index(struct rte_eth_dev 
*dev,
flow->idx = flow_idx;
if (table->resource) {
mlx5_ipool_malloc(table->resource, &res_idx);
-   if (!res_idx)
+   if (!res_idx) {
+   rte_errno = ENOMEM;
goto error;
+   }
flow->res_idx = res_idx;
} else {
flow->res_idx = flow_idx;
@@ -4218,8 +4233,10 @@ flow_hw_async_flow_update(struct rte_eth_dev *dev,
nf->idx = of->idx;
if (table->resource) {
mlx5_ipool_malloc(table->resource, &res_idx);
-   if (!res_idx)
+   if (!res_idx) {
+   rte_errno = ENOMEM;
goto error;
+   }
nf->res_idx = res_idx;
} else {
nf->res_idx = of->res_idx;
-- 
2.34.1



[PATCH V3 3/3] net/mlx5/hws: print CQE error syndrome and more information

2024-09-24 Thread Minggang Li(Gavin)
Print CQE error syndrome and more information in case of queue error.

Signed-off-by: Minggang Li(Gavin) 
Acked-by: Alex Vesker 
---
 drivers/net/mlx5/hws/mlx5dr_send.c | 9 -
 1 file changed, 8 insertions(+), 1 deletion(-)

diff --git a/drivers/net/mlx5/hws/mlx5dr_send.c 
b/drivers/net/mlx5/hws/mlx5dr_send.c
index 3022c50260..e9abf3dddb 100644
--- a/drivers/net/mlx5/hws/mlx5dr_send.c
+++ b/drivers/net/mlx5/hws/mlx5dr_send.c
@@ -598,8 +598,15 @@ static void mlx5dr_send_engine_poll_cq(struct 
mlx5dr_send_engine *queue,
cqe_owner != sw_own)
return;
 
-   if (unlikely(cqe_opcode != MLX5_CQE_REQ))
+   if (unlikely(cqe_opcode != MLX5_CQE_REQ)) {
+   struct mlx5_err_cqe *err_cqe = (struct mlx5_err_cqe *)cqe;
+
+   DR_LOG(ERR, "CQE ERR:0x%x, Vendor_ERR:0x%x, OP:0x%x, QPN:0x%x, 
WQE_CNT:0x%x",
+   err_cqe->syndrome, err_cqe->vendor_err_synd, cqe_opcode,
+   (rte_be_to_cpu_32(err_cqe->s_wqe_opcode_qpn) & 
0xff),
+   rte_be_to_cpu_16(err_cqe->wqe_counter));
queue->err = true;
+   }
 
rte_io_rmb();
 
-- 
2.34.1



[RFC PATCH v6] mempool: fix mempool cache size

2024-09-24 Thread Morten Brørup
This patch refactors the mempool cache to fix two bugs:
1. When a mempool is created with a cache size of N objects, the cache was
actually created with a size of 1.5 * N objects.
2. The mempool cache field names did not reflect their purpose;
the "flushthresh" field held the size, and the "size" field held the
number of objects remaining in the cache when returning from a get
operation refilling it from the backend.

Especially the first item could be fatal:
When more objects than a mempool's configured cache size is held in the
mempool's caches associated with other lcores, a rightsized mempool may
unexpectedly run out of objects, causing the application to fail.

Furthermore, this patch introduces two optimizations:
1. The mempool caches are flushed to/filled from the backend in their
entirety, so backend accesses are CPU cache line aligned. (Assuming the
mempool cache size is a multiplum of a CPU cache line size divided by the
size of a pointer.)
2. The unlikely paths in the get and put functions, where the cache is
flushed to/filled from the backend, are moved from the inline functions to
separate helper functions, thereby reducing the code size of the inline
functions.
Note: Accessing the backend for cacheless mempools remains inline.

Various drivers accessing the mempool directly have been updated
accordingly.
These drivers did not update mempool statistics when accessing the mempool
directly, so that is fixed too.

Note: Performance not yet benchmarked.

Signed-off-by: Morten Brørup 
---
v6:
* Fix v5 incomplete implementation of passing large requests directly to
  the backend.
* Use memcpy instead of rte_memcpy where compiler complains about it.
* Added const to some function parameters.
v5:
* Moved helper functions back into the header file, for improved
  performance.
* Pass large requests directly to the backend. This also simplifies the
  code.
v4:
* Updated subject to reflect that misleading names are considered bugs.
* Rewrote patch description to provide more details about the bugs fixed.
  (Mattias Rönnblom)
* Moved helper functions, not to be inlined, to mempool C file.
  (Mattias Rönnblom)
* Pass requests for n >= RTE_MEMPOOL_CACHE_MAX_SIZE objects known at build
  time directly to backend driver, to avoid calling the helper functions.
  This also fixes the compiler warnings about out of bounds array access.
v3:
* Removed __attribute__(assume).
v2:
* Removed mempool perf test; not part of patch set.
---
 drivers/common/idpf/idpf_common_rxtx_avx512.c |  54 +---
 drivers/mempool/dpaa/dpaa_mempool.c   |  16 +-
 drivers/mempool/dpaa2/dpaa2_hw_mempool.c  |  14 -
 drivers/net/i40e/i40e_rxtx_vec_avx512.c   |  17 +-
 drivers/net/iavf/iavf_rxtx_vec_avx512.c   |  27 +-
 drivers/net/ice/ice_rxtx_vec_avx512.c |  27 +-
 lib/mempool/mempool_trace.h   |   1 -
 lib/mempool/rte_mempool.c |  12 +-
 lib/mempool/rte_mempool.h | 287 --
 9 files changed, 231 insertions(+), 224 deletions(-)

diff --git a/drivers/common/idpf/idpf_common_rxtx_avx512.c 
b/drivers/common/idpf/idpf_common_rxtx_avx512.c
index 3b5e124ec8..98535a48f3 100644
--- a/drivers/common/idpf/idpf_common_rxtx_avx512.c
+++ b/drivers/common/idpf/idpf_common_rxtx_avx512.c
@@ -1024,21 +1024,13 @@ idpf_tx_singleq_free_bufs_avx512(struct idpf_tx_queue 
*txq)
rte_lcore_id());
void **cache_objs;
 
-   if (cache == NULL || cache->len == 0)
-   goto normal;
-
-   cache_objs = &cache->objs[cache->len];
-
-   if (n > RTE_MEMPOOL_CACHE_MAX_SIZE) {
-   rte_mempool_ops_enqueue_bulk(mp, (void *)txep, n);
+   if (!cache || unlikely(n + cache->len > cache->size)) {
+   rte_mempool_generic_put(mp, (void *)txep, n, cache);
goto done;
}
 
-   /* The cache follows the following algorithm
-*   1. Add the objects to the cache
-*   2. Anything greater than the cache min value (if it 
crosses the
-*   cache flush threshold) is flushed to the ring.
-*/
+   cache_objs = &cache->objs[cache->len];
+
/* Add elements back into the cache */
uint32_t copied = 0;
/* n is multiple of 32 */
@@ -1056,16 +1048,13 @@ idpf_tx_singleq_free_bufs_avx512(struct idpf_tx_queue 
*txq)
}
cache->len += n;
 
-   if (cache->len >= cache->flushthresh) {
-   rte_mempool_ops_enqueue_bulk(mp,
-&cache->objs[cache->size],
-cache->len - cache->size);
-   cache->len = cache->size;
-   }
+   /* Increment stat. */
+   RTE_MEMPOOL

Re: [PATCH v4 02/11] dts: add test case decorators

2024-09-24 Thread Juraj Linkeš




On 23. 9. 2024 21:26, Jeremy Spewock wrote:

One super nit-pick comment below,  even without that though I still
think this looks good.

Reviewed-by: Jeremy Spewock 

On Mon, Sep 23, 2024 at 11:02 AM Juraj Linkeš
 wrote:


+def is_test_case(function: Callable) -> bool:
+if inspect.isfunction(function):
+# TestCase is not used at runtime, so we can't use 
isinstance() with `function`.
+# But function.test_type exists.
+if hasattr(function, "test_type"):
+return isinstance(function.test_type, TestCaseType)
+return False
+
+if test_case_sublist is None:
+test_case_sublist = []
+
+# the copy is needed so that the condition "elif test_case_sublist" 
doesn't
+# change mid-cycle
+test_case_sublist_copy = list(test_case_sublist)
+func_test_cases = set()
+perf_test_cases = set()
+
+for test_case_name, test_case_function in inspect.getmembers(cls, 
is_test_case):
+if test_case_name in test_case_sublist_copy:
+# if test_case_sublist_copy is non-empty, remove the found 
test case
+# so that we can look at the remainder at the end
+test_case_sublist_copy.remove(test_case_name)
+elif test_case_sublist:
+# the original list not being empty means we're filtering test 
cases


This might read a little better if there was a period at the end, but
I still think this gets the point across as is.



Ack, I'll capitalize and add periods to all comments in this method.


+# since we didn't remove test_case_name in the previous branch,
+# it doesn't match the filter and we don't want to remove it
+continue
+
+match test_case_function.test_type:
+case TestCaseType.PERFORMANCE:
+perf_test_cases.add(test_case_function)
+case TestCaseType.FUNCTIONAL:
+func_test_cases.add(test_case_function)
+



2.43.0





Re: [PATCH v4 06/11] dts: add NIC capability support

2024-09-24 Thread Juraj Linkeš




On 23. 9. 2024 21:26, Jeremy Spewock wrote:

On Mon, Sep 23, 2024 at 11:02 AM Juraj Linkeš
 wrote:


Some test cases or suites may be testing a NIC feature that is not
supported on all NICs, so add support for marking test cases or suites
as requiring NIC capabilities.

The marking is done with a decorator, which populates the internal
required_capabilities attribute of TestProtocol. The NIC capability
itself is a wrapper around the NicCapability defined in testpmd_shell.
The reason is Enums cannot be extended and the class implements the
methods of its abstract base superclass.

The decorator API is designed to be simple to use. The arguments passed
to it are all from the testpmd shell. Everything else (even the actual
capability object creation) is done internally.

Signed-off-by: Juraj Linkeš 
Reviewed-by: Dean Marx 


Thank you for addressing all my comments!


Thanks you for giving them. Very helpful.


Reviewed-by: Jeremy Spewock 




[PATCH] net/r8169: add support for hw config

2024-09-24 Thread Howard Wang
Implement the rtl_hw_config function to configure the hardware.

Signed-off-by: Howard Wang 
---
 drivers/net/r8169/meson.build|   1 +
 drivers/net/r8169/r8169_base.h   | 125 ++
 drivers/net/r8169/r8169_ethdev.c |   2 +
 drivers/net/r8169/r8169_ethdev.h |  15 +-
 drivers/net/r8169/r8169_hw.c | 701 +++
 drivers/net/r8169/r8169_hw.h |  17 +
 drivers/net/r8169/r8169_phy.c|  41 ++
 drivers/net/r8169/r8169_phy.h|  21 +
 8 files changed, 921 insertions(+), 2 deletions(-)
 create mode 100644 drivers/net/r8169/r8169_phy.c
 create mode 100644 drivers/net/r8169/r8169_phy.h

diff --git a/drivers/net/r8169/meson.build b/drivers/net/r8169/meson.build
index ff7d6ca4b8..56f857ac8c 100644
--- a/drivers/net/r8169/meson.build
+++ b/drivers/net/r8169/meson.build
@@ -5,5 +5,6 @@ sources = files(
'r8169_ethdev.c',
'r8169_hw.c',
'r8169_rxtx.c',
+   'r8169_phy.c',
 )
 
diff --git a/drivers/net/r8169/r8169_base.h b/drivers/net/r8169/r8169_base.h
index 0e79d8d22a..2e72faeb2c 100644
--- a/drivers/net/r8169/r8169_base.h
+++ b/drivers/net/r8169/r8169_base.h
@@ -23,6 +23,117 @@ typedef uint16_t  u16;
 typedef uint32_t  u32;
 typedef uint64_t  u64;
 
+enum mcfg {
+   CFG_METHOD_1 = 1,
+   CFG_METHOD_2,
+   CFG_METHOD_3,
+   CFG_METHOD_4,
+   CFG_METHOD_5,
+   CFG_METHOD_6,
+   CFG_METHOD_7,
+   CFG_METHOD_8,
+   CFG_METHOD_9,
+   CFG_METHOD_10,
+   CFG_METHOD_11,
+   CFG_METHOD_12,
+   CFG_METHOD_13,
+   CFG_METHOD_14,
+   CFG_METHOD_15,
+   CFG_METHOD_16,
+   CFG_METHOD_17,
+   CFG_METHOD_18,
+   CFG_METHOD_19,
+   CFG_METHOD_20,
+   CFG_METHOD_21,
+   CFG_METHOD_22,
+   CFG_METHOD_23,
+   CFG_METHOD_24,
+   CFG_METHOD_25,
+   CFG_METHOD_26,
+   CFG_METHOD_27,
+   CFG_METHOD_28,
+   CFG_METHOD_29,
+   CFG_METHOD_30,
+   CFG_METHOD_31,
+   CFG_METHOD_32,
+   CFG_METHOD_33,
+   CFG_METHOD_34,
+   CFG_METHOD_35,
+   CFG_METHOD_36,
+   CFG_METHOD_37,
+   CFG_METHOD_38,
+   CFG_METHOD_39,
+   CFG_METHOD_40,
+   CFG_METHOD_41,
+   CFG_METHOD_42,
+   CFG_METHOD_43,
+   CFG_METHOD_44,
+   CFG_METHOD_45,
+   CFG_METHOD_46,
+   CFG_METHOD_47,
+   CFG_METHOD_48,
+   CFG_METHOD_49,
+   CFG_METHOD_50,
+   CFG_METHOD_51,
+   CFG_METHOD_52,
+   CFG_METHOD_53,
+   CFG_METHOD_54,
+   CFG_METHOD_55,
+   CFG_METHOD_56,
+   CFG_METHOD_57,
+   CFG_METHOD_58,
+   CFG_METHOD_59,
+   CFG_METHOD_60,
+   CFG_METHOD_61,
+   CFG_METHOD_62,
+   CFG_METHOD_63,
+   CFG_METHOD_64,
+   CFG_METHOD_65,
+   CFG_METHOD_66,
+   CFG_METHOD_67,
+   CFG_METHOD_68,
+   CFG_METHOD_69,
+   CFG_METHOD_70,
+   CFG_METHOD_71,
+   CFG_METHOD_MAX,
+   CFG_METHOD_DEFAULT = 0xFF
+};
+
+enum bits {
+   BIT_0 = (1UL << 0),
+   BIT_1 = (1UL << 1),
+   BIT_2 = (1UL << 2),
+   BIT_3 = (1UL << 3),
+   BIT_4 = (1UL << 4),
+   BIT_5 = (1UL << 5),
+   BIT_6 = (1UL << 6),
+   BIT_7 = (1UL << 7),
+   BIT_8 = (1UL << 8),
+   BIT_9 = (1UL << 9),
+   BIT_10 = (1UL << 10),
+   BIT_11 = (1UL << 11),
+   BIT_12 = (1UL << 12),
+   BIT_13 = (1UL << 13),
+   BIT_14 = (1UL << 14),
+   BIT_15 = (1UL << 15),
+   BIT_16 = (1UL << 16),
+   BIT_17 = (1UL << 17),
+   BIT_18 = (1UL << 18),
+   BIT_19 = (1UL << 19),
+   BIT_20 = (1UL << 20),
+   BIT_21 = (1UL << 21),
+   BIT_22 = (1UL << 22),
+   BIT_23 = (1UL << 23),
+   BIT_24 = (1UL << 24),
+   BIT_25 = (1UL << 25),
+   BIT_26 = (1UL << 26),
+   BIT_27 = (1UL << 27),
+   BIT_28 = (1UL << 28),
+   BIT_29 = (1UL << 29),
+   BIT_30 = (1UL << 30),
+   BIT_31 = (1UL << 31)
+};
+
 enum RTL_registers {
MAC0= 0x00, /* Ethernet hardware address */
MAC4= 0x04,
@@ -363,6 +474,8 @@ enum RTL_register_content {
INT_CFG0_ENABLE_8125= (1 << 0),
INT_CFG0_TIMEOUT0_BYPASS_8125   = (1 << 1),
INT_CFG0_MITIGATION_BYPASS_8125 = (1 << 2),
+   INT_CFG0_RDU_BYPASS_8126= (1 << 4),
+   INT_CFG0_MSIX_ENTRY_NUM_MODE= (1 << 5),
ISRIMR_V2_ROK_Q0 = (1 << 0),
ISRIMR_TOK_Q0= (1 << 16),
ISRIMR_TOK_Q1= (1 << 18),
@@ -389,6 +502,18 @@ enum RTL_register_content {
 #define msleep rte_delay_ms
 #define usleep rte_delay_us
 
+#define RX_DMA_BURST_unlimited  7   /* Maximum PCI burst, '7' is unlimited */
+#define RX_DMA_BURST_5125
+#define TX_DMA_BURST_unlimited  7
+#define TX_DMA_BURST_1024   6
+#define TX_DMA_BURST_5125
+#define TX_DMA_BURST_2564
+#define TX_DMA_BURST_1283
+#define TX_DMA_BURST_64 2
+#define TX_DMA_BURST_32 1
+#define TX_DMA_BURST_16 0
+#define InterFrameGap   0x03/* 3 means InterFrameG

grout: DPDK router project

2024-09-24 Thread Thomas Monjalon
Hello,

dpdk.org is welcoming a new project: grout

grout is a DPDK-based network processing application.
It uses the graph library for data path processing.

Its main purpose is to simulate a network function or a physical router 
for testing/replicating real (usually closed source) VNF/CNF behavior 
with an Open Source tool.

The project is still in its early stages and it is very promising.
Please check it by yourself and don't hesitate to participate in its growing.
It could groute a lot more new protocol layers!

The project has been presented at the DPDK summit 2024 in Montréal
this Tuesday, September 24th:

https://sched.co/1iAtk

The source code is available here:

https://git.dpdk.org/apps/grout/about/
https://github.com/DPDK/grout/

For any questions, you can send a message to

gr...@dpdk.org
or
DPDK Slack #grout

Happy grouting




Re: [PATCH v2 1/1] dts: add send_packets to test suites and rework packet addressing

2024-09-24 Thread Juraj Linkeš




On 20. 9. 2024 20:08, jspew...@iol.unh.edu wrote:

From: Jeremy Spewock 

Currently the only method provided in the test suite class for sending
packets sends a single packet and then captures the results. There is,
in some cases, a need to send multiple packets at once while not really
needing to capture any traffic received back. The method to do this
exists in the traffic generator already, but this patch exposes the
method to test suites.



Patrick mentioned there is now a method that does it 
(send_packets_and_capture()), but that one captures the packets. I think 
we should add this method though, as it does something different, but 
maybe the most important point is that non-capturing TGs are going to 
support send_packets_and_capture(), so we need it either way.



This patch also updates the _adjust_addresses method of test suites so
that addresses of packets are only modified if the developer did not
configure them beforehand. This allows for developers to have more
control over the content of their packets when sending them through the
framework.



I think this could and should be split into two patches. They aren't 
really connected.




diff --git a/dts/framework/test_suite.py b/dts/framework/test_suite.py



@@ -243,41 +255,74 @@ def get_expected_packet(self, packet: Packet) -> Packet:
  Returns:
  `packet` with injected L2/L3 addresses.
  """
-return self._adjust_addresses(packet, expected=True)
+return self._adjust_addresses([packet], expected=True)[0]
  
-def _adjust_addresses(self, packet: Packet, expected: bool = False) -> Packet:

+def _adjust_addresses(self, packets: list[Packet], expected: bool = False) 
-> list[Packet]:
  """L2 and L3 address additions in both directions.
  
+Packets in `packets` will be directly modified in this method. The returned list of packets

+however will be copies of the modified packets in order to keep the 
two lists distinct.
+


Do we need to do this? I guess you needed this in a test suite - what is 
the reason?



+Only missing addresses are added to packets, existing addresses will 
not be overridden. If
+any packet in `packets` has multiple IP layers (using GRE, for 
example) only the inner-most
+IP layer will have its addresses adjusted.
+
  Assumptions:
  Two links between SUT and TG, one link is TG -> SUT, the other SUT 
-> TG.
  
  Args:

-packet: The packet to modify.
+packets: The packets to modify.
  expected: If :data:`True`, the direction is SUT -> TG,
  otherwise the direction is TG -> SUT.
+
+Returns:
+A list containing copies of all packets in `packets` after 
modification.
  """
-if expected:
-# The packet enters the TG from SUT
-# update l2 addresses
-packet.src = self._sut_port_egress.mac_address
-packet.dst = self._tg_port_ingress.mac_address
+ret_packets = []
+for packet in packets:
+# The fields parameter of a packet does not include fields of the 
payload, so this can
+# only be the Ether src/dst.
+pkt_src_is_unset = "src" not in packet.fields
+pkt_dst_is_unset = "dst" not in packet.fields
+num_ip_layers = packet.layers().count(IP)
+
+# Update the last IP layer if there are multiple to account for 
GRE addressing (the


This comment should be more generic as it's going to apply to other 
things than just GRE.



+# framework should be modifying the packet address instead of the 
tunnel).
+l3_to_use = packet.getlayer(IP, num_ip_layers)


Would this make more sense inside the positive if branch right below?


+if num_ip_layers > 0:
+ip_src_is_unset = "src" not in l3_to_use.fields
+ip_dst_is_unset = "dst" not in l3_to_use.fields
+else:
+ip_src_is_unset = None
+ip_dst_is_unset = None
  
-# The packet is routed from TG egress to TG ingress

-# update l3 addresses
-packet.payload.src = self._tg_ip_address_egress.ip.exploded
-packet.payload.dst = self._tg_ip_address_ingress.ip.exploded
-else:
-# The packet leaves TG towards SUT
  # update l2 addresses
-packet.src = self._tg_port_egress.mac_address
-packet.dst = self._sut_port_ingress.mac_address
+# If `expected` is :data:`True`, the packet enters the TG from 
SUT, otherwise the
+# packet leaves the TG towards the SUT
+if pkt_src_is_unset:
+packet.src = (
+self._sut_port_egress.mac_address
+if expected
+else self._tg_port_egress.mac_address
+)
+if pkt_dst_is_unset:
+packet

Re: [PATCH v2 1/1] dts: add binding to different drivers to TG node

2024-09-24 Thread Jeremy Spewock
On Tue, Sep 24, 2024 at 5:12 AM Juraj Linkeš  wrote:
>
> I have some thoughts for the future:
> 1a. The traffic generator is specified per-node, so maybe we could also
> change the binding to be for the whole lifetime of the TG node,
> 1b. But the same is true for the SUT node as well, right? After we do
> the port update (with kernel driver), we can just bind to DPDK driver.
> With SUT in the mix, this looks like a change for a different patch,

Right, these are good points. A good observation too that we only
really need the kernel driver at the start in both cases. You had
mentioned in your previous comments as well that we should only be
binding on the TG once per lifetime, but I ended up not adding it for
that very reason of I still wanted the binding to be in Node, but I
didn't want to change the process for the SUT.

> 2. We could add a symlink to the devbind script with the target being in
> the dts directory. This way we don't have to go outside the dts
> directory and if DTS ever become a python package, we could just copy
> the script to the appropriate place. This is also something we don't
> really need to do.

I like this idea a lot actually. It feels very weird to me having to
step out of the DTS directory and I like the idea of keeping it
together like it were a package (even if it isn't yet).

>
> And also two minor comments. A lot of suggestions for separate patches
> overall. :-)
>
> On 19. 9. 2024 20:16, jspew...@iol.unh.edu wrote:
> > From: Jeremy Spewock 
> >
> > The DTS framework in its current state supports binding ports to
> > different drivers on the SUT node but not the TG node. The TG node
> > already has the information that it needs about the different drivers
> > that it has available in the configuration file, but it did not
> > previously have access to the devbind script, so it did not use that
> > information for anything.
> >
> > This patch moves the location of the tmp directory as well as the method
> > for binding ports into the node class rather than the SUT node class and
> > adds an abstract method for getting the path to the devbind script into
> > the node class. Then, binding ports to the correct drivers is moved into
> > the build target setup and run on both nodes.
> >
> > Bugzilla ID: 1420
> >
> > Signed-off-by: Jeremy Spewock 
> > ---
>
> With the two minor comments,
> Reviewed-by: Juraj Linkeš 
>
> > diff --git a/dts/framework/testbed_model/node.py 
> > b/dts/framework/testbed_model/node.py
>
> > @@ -58,8 +65,10 @@ class Node(ABC):
> >   lcores: list[LogicalCore]
> >   ports: list[Port]
> >   _logger: DTSLogger
> > +_remote_tmp_dir: PurePath
> >   _other_sessions: list[OSSession]
> >   _test_run_config: TestRunConfiguration
> > +_path_to_devbind_script: PurePath | None
>
> A note on the naming. We have _remote_tmp_dir and
> _path_to_devbind_script. Both are pointing to a remote file/dir, but
> only one has the _remote prefix. They should probably be unified.

I didn't think of this but you're right, the two are very similar but
named differently.

>
> I've thought a bit about what the right name is. Dropping the prefix
> makes sense; sut_node.tmp_dir should mean the tmp dir on the SUT node
> (which would make it remote from the execution host's point of view, but
> not the node's view; the file is local to SUT node). This could be a
> good separate patch (improving the remote/local naming scheme to make it
> consistent and as sensible as possible).

I also like the sound of it without the prefix and how it actually has
a more fitting meaning from the two perspectives. I agree that there
is probably some other work to be done on this in another patch, but
since I am moving the _remote_tmp_dir variable around anyway I think
it wouldn't hurt for me to rename it.

>
>
> > diff --git a/dts/framework/utils.py b/dts/framework/utils.py
>
> > @@ -29,6 +29,8 @@
> >   from .exception import ConfigurationError, InternalError
> >
> >   REGEX_FOR_PCI_ADDRESS: str = 
> > "/[0-9a-fA-F]{4}:[0-9a-fA-F]{2}:[0-9a-fA-F]{2}.[0-9]{1}/"
> > +#: Path to DPDK directory on the host where DTS is being run.
> > +LOCAL_DPDK_DIR: PurePath = PurePath(__file__).parents[2]
>
> Local paths can be just pathlib.Path. PurePaths are for path
> manipulations only (useful for remote paths in RemoteSessions,
> OSSessions and Nodes), but for local existing paths we should use Path.
>
> The OSSession and subclasses need a bit of an update in this regard -
> use Path for local paths and PurePaths for remote ones. We added this
> into our pre-built DPDK patch.

Ack. I tried to search for the differences between the two actually
and saw a similar answer that PurePaths are used for manipulation, but
I didn't fully understand as I am still manipulated this path and it
isn't referencing a real file on host where I am defining the variable
(what ever host is running DTS that is). This is a good distinction
though to use Paths when it is used as a local path somewhere.

>
> >
> >

RE: [PATCH v3] eal: add build-time option to omit trace

2024-09-24 Thread Morten Brørup
> > +++ b/app/test/test_trace.c
> > @@ -12,7 +12,16 @@
> >
> >  int app_dpdk_test_tp_count;
> >
> > -#ifdef RTE_EXEC_ENV_WINDOWS
> > +#if !defined(RTE_TRACE)
> 
> Now that, we are is not disabling any symbols,
> This conditional compilation can be removed. Use
> __rte_trace_point_generic_is_enabled() in another leg.

Done in v4.
Also added to perf test.


> > +/**
> > + * @internal
> > + *
> > + * Test if the tracepoint compile-time option is enabled.
> > + *
> > + * @return
> > + *   true if tracepoint enabled, false otherwise.
> > + */
> > +__rte_experimental
> > +static __rte_always_inline bool
> > +__rte_trace_point_generic_is_enabled(void)
> 
> Add to version.map file

It's static inline, so not an exposed symbol.
__rte_trace_point_fp_is_enabled is also not in the version.map file.


> >  #define __rte_trace_point_emit_header_generic(t) \
> >  void *mem; \
> >  do { \
> > +   if (!__rte_trace_point_generic_is_enabled()) \
> > +   return; \
> 
> 
> To be super safe, Add similar check in
> RTE_INIT(trace##_init)

Done in v4.



Re: [PATCH v4] eal: add build-time option to omit trace

2024-09-24 Thread Jerin Jacob
On Tue, Sep 24, 2024 at 7:10 PM Morten Brørup  
wrote:
>
> Some applications want to omit the trace feature.
> Either to reduce the memory footprint, to reduce the exposed attack
> surface, or for other reasons.
>
> This patch adds an option in rte_config.h to include or omit trace in the
> build. Trace is included by default.
>
> Omitting trace works by omitting all trace points.
> For API and ABI compatibility, the trace feature itself remains.
>
> Signed-off-by: Morten Brørup 
> ---
> v4:
> * Added check for generic trace enabled when registering trace points, in
>   RTE_INIT. (Jerin Jacob)
> * Test application uses function instead of macro to check if generic
>   trace is enabled. (Jerin Jacob)
> * Performance test application uses function to check if generic trace is
>   enabled.
> v3:
> * Simpler version with much fewer ifdefs. (Jerin Jacob)
> v2:
> * Added/modified macros required for building applications with trace
>   omitted.

>
> +/**
> + * @internal

Since it is used in app/test. Can we remove it as internal and make
symbol as rte_trace_point_is_enabled

> + *
> + * Test if the tracepoint compile-time option is enabled.
> + *
> + * @return
> + *   true if tracepoint enabled, false otherwise.
> + */
> +__rte_experimental
> +static __rte_always_inline bool
> +__rte_trace_point_generic_is_enabled(void)

Do we need to keep _generic_

Rest looks good to me.


[RFC PATCH v7] mempool: fix mempool cache size

2024-09-24 Thread Morten Brørup
This patch refactors the mempool cache to fix two bugs:
1. When a mempool is created with a cache size of N objects, the cache was
actually created with a size of 1.5 * N objects.
2. The mempool cache field names did not reflect their purpose;
the "flushthresh" field held the size, and the "size" field held the
number of objects remaining in the cache when returning from a get
operation refilling it from the backend.

Especially the first item could be fatal:
When more objects than a mempool's configured cache size is held in the
mempool's caches associated with other lcores, a rightsized mempool may
unexpectedly run out of objects, causing the application to fail.

Furthermore, this patch introduces two optimizations:
1. The mempool caches are flushed to/filled from the backend in their
entirety, so backend accesses are CPU cache line aligned. (Assuming the
mempool cache size is a multiplum of a CPU cache line size divided by the
size of a pointer.)
2. The unlikely paths in the get and put functions, where the cache is
flushed to/filled from the backend, are moved from the inline functions to
separate helper functions, thereby reducing the code size of the inline
functions.
Note: Accessing the backend for cacheless mempools remains inline.

Various drivers accessing the mempool directly have been updated
accordingly.
These drivers did not update mempool statistics when accessing the mempool
directly, so that is fixed too.

Note: Performance not yet benchmarked.

Signed-off-by: Morten Brørup 
---
v7:
* Increased max mempool cache size from 512 to 1024 objects.
  Mainly for CI performance test purposes.
  Originally, the max mempool cache size was 768 objects, and used a fixed
  size array of 1024 objects in the mempool cache structure.
v6:
* Fix v5 incomplete implementation of passing large requests directly to
  the backend.
* Use memcpy instead of rte_memcpy where compiler complains about it.
* Added const to some function parameters.
v5:
* Moved helper functions back into the header file, for improved
  performance.
* Pass large requests directly to the backend. This also simplifies the
  code.
v4:
* Updated subject to reflect that misleading names are considered bugs.
* Rewrote patch description to provide more details about the bugs fixed.
  (Mattias Rönnblom)
* Moved helper functions, not to be inlined, to mempool C file.
  (Mattias Rönnblom)
* Pass requests for n >= RTE_MEMPOOL_CACHE_MAX_SIZE objects known at build
  time directly to backend driver, to avoid calling the helper functions.
  This also fixes the compiler warnings about out of bounds array access.
v3:
* Removed __attribute__(assume).
v2:
* Removed mempool perf test; not part of patch set.
---
 config/rte_config.h   |   2 +-
 drivers/common/idpf/idpf_common_rxtx_avx512.c |  54 +---
 drivers/mempool/dpaa/dpaa_mempool.c   |  16 +-
 drivers/mempool/dpaa2/dpaa2_hw_mempool.c  |  14 -
 drivers/net/i40e/i40e_rxtx_vec_avx512.c   |  17 +-
 drivers/net/iavf/iavf_rxtx_vec_avx512.c   |  27 +-
 drivers/net/ice/ice_rxtx_vec_avx512.c |  27 +-
 lib/mempool/mempool_trace.h   |   1 -
 lib/mempool/rte_mempool.c |  12 +-
 lib/mempool/rte_mempool.h | 287 --
 10 files changed, 232 insertions(+), 225 deletions(-)

diff --git a/config/rte_config.h b/config/rte_config.h
index dd7bb0d35b..2488ff167d 100644
--- a/config/rte_config.h
+++ b/config/rte_config.h
@@ -56,7 +56,7 @@
 #define RTE_CONTIGMEM_DEFAULT_BUF_SIZE (512*1024*1024)
 
 /* mempool defines */
-#define RTE_MEMPOOL_CACHE_MAX_SIZE 512
+#define RTE_MEMPOOL_CACHE_MAX_SIZE 1024
 /* RTE_LIBRTE_MEMPOOL_STATS is not set */
 /* RTE_LIBRTE_MEMPOOL_DEBUG is not set */
 
diff --git a/drivers/common/idpf/idpf_common_rxtx_avx512.c 
b/drivers/common/idpf/idpf_common_rxtx_avx512.c
index 3b5e124ec8..98535a48f3 100644
--- a/drivers/common/idpf/idpf_common_rxtx_avx512.c
+++ b/drivers/common/idpf/idpf_common_rxtx_avx512.c
@@ -1024,21 +1024,13 @@ idpf_tx_singleq_free_bufs_avx512(struct idpf_tx_queue 
*txq)
rte_lcore_id());
void **cache_objs;
 
-   if (cache == NULL || cache->len == 0)
-   goto normal;
-
-   cache_objs = &cache->objs[cache->len];
-
-   if (n > RTE_MEMPOOL_CACHE_MAX_SIZE) {
-   rte_mempool_ops_enqueue_bulk(mp, (void *)txep, n);
+   if (!cache || unlikely(n + cache->len > cache->size)) {
+   rte_mempool_generic_put(mp, (void *)txep, n, cache);
goto done;
}
 
-   /* The cache follows the following algorithm
-*   1. Add the objects to the cache
-*   2. Anything greater than the cache min value (if it 
crosses the
-*   cache flush threshold) is flushed to the ring.
-*/
+   cache_o

Re: [PATCH v4] eal: add build-time option to omit trace

2024-09-24 Thread Jerin Jacob
On Tue, Sep 24, 2024 at 9:23 PM Morten Brørup  
wrote:
>
> > From: Jerin Jacob [mailto:jerinjac...@gmail.com]
> > Sent: Tuesday, 24 September 2024 11.42
> >
> > On Tue, Sep 24, 2024 at 7:10 PM Morten Brørup 
> > wrote:
> > >
> > > Some applications want to omit the trace feature.
> > > Either to reduce the memory footprint, to reduce the exposed attack
> > > surface, or for other reasons.
> > >
> > > This patch adds an option in rte_config.h to include or omit trace in
> > the
> > > build. Trace is included by default.
> > >
> > > Omitting trace works by omitting all trace points.
> > > For API and ABI compatibility, the trace feature itself remains.
> > >
> > > Signed-off-by: Morten Brørup 
> > > ---
> > > v4:
> > > * Added check for generic trace enabled when registering trace points,
> > in
> > >   RTE_INIT. (Jerin Jacob)
> > > * Test application uses function instead of macro to check if generic
> > >   trace is enabled. (Jerin Jacob)
> > > * Performance test application uses function to check if generic trace
> > is
> > >   enabled.
> > > v3:
> > > * Simpler version with much fewer ifdefs. (Jerin Jacob)
> > > v2:
> > > * Added/modified macros required for building applications with trace
> > >   omitted.
> >
> > >
> > > +/**
> > > + * @internal
> >
> > Since it is used in app/test. Can we remove it as internal and make
> > symbol as rte_trace_point_is_enabled
>
> I don't think we should make functions public if only used for test purposes.
>
> Do you think it is useful for normal usage too? Does rte_trace_is_enabled() 
> not suffice?

It will be good for an app to know, Is trace feature disabled if the
application cares about.
Yes. rte_trace_is_enabled() suffice.


>
> >
> > > + *
> > > + * Test if the tracepoint compile-time option is enabled.
> > > + *
> > > + * @return
> > > + *   true if tracepoint enabled, false otherwise.
> > > + */
> > > +__rte_experimental
> > > +static __rte_always_inline bool
> > > +__rte_trace_point_generic_is_enabled(void)
> >
> > Do we need to keep _generic_
>
> Other internal functions have _generic_ too, so I added it.
> If we decide to make it public, I agree _generic_ should be removed.
>
> >
> > Rest looks good to me.


RE: [EXT] [PATCH 1/3] cryptodev: add ec points to sm2 op

2024-09-24 Thread Akhil Goyal
Hi Arek,

Can you resend this series if you still want to pursue so that CI can run?

> > > In the case when PMD cannot support full process of the SM2, but
> > > elliptic curve computation only, additional fields are needed to
> > > handle such a case.
> > >
> >
> > Asym crypto APIs are no longer experimental.
> > Hence adding new fields would lead to ABI break.
> 
> It seems that
> `__rte_crypto_op_reset`
> `rte_crypto_op_pool_create`
> functions do not need versioning, and we could easily do it if needed.
> But the field `flags` changes an offset, and this is actually problematic.
> Which means that we cannot do this change before 24.11.
> 
> >
> > > Points C1, kP therefore were added to the SM2 crypto operation struct.
> > >
> > > Signed-off-by: Arkadiusz Kusztal 


Re: [PATCH v3 1/1] dts: Remove XML-RPC server for Scapy TG and instead use PythonShell

2024-09-24 Thread Jeremy Spewock
On Tue, Sep 24, 2024 at 6:55 AM Juraj Linkeš  wrote:
>
> I like how this looks. I have a number of minor comments (mainly wording
> and naming), but overall it looks very good.
>
> On 19. 9. 2024 21:02, jspew...@iol.unh.edu wrote:
> > From: Jeremy Spewock 
> >
> > Previously all scapy commands were handled using an XML-RPC server that
> > ran on the TGNode. This unnecessarily enforces a minimum Python version
> > of 3.10 on the server that is being used as a traffic generator and
> > complicates the implementation of scapy methods.
>
> What is the TG's minimum Python version now?
> https://bugs.dpdk.org/show_bug.cgi?id=1388 says this will become a moot
> point, but we're still using Python on the remote node.

Right, there is still some dependency there. I'm not sure the exact
versions, but doing some looking around I believe one of the newest
things scapy tools we use in the framework is the AsyncSniffer and the
scapy documentation [1] says that it has been available since version
2.4.3, and then when I looked at that version of the scapy package [2]
it looks like it claims to support python 2.7 and python 3.4-7. Poking
around in the documentation/code from scapy version 2.4.3 it also
looks like the syntax is very similar, so I believe it would work, but
I'm not sure I have any hosts that I could run on which have python
3.4. Maybe that does make the dependency essentially a moot point
considering these are fairly old python versions.

[1] https://github.com/secdev/scapy/blob/master/doc/scapy/usage.rst
[2] https://pypi.org/project/scapy/2.4.3/

>
>
> > diff --git 
> > a/dts/framework/remote_session/single_active_interactive_shell.py 
> > b/dts/framework/remote_session/single_active_interactive_shell.py
>
> > @@ -93,9 +94,13 @@ def __init__(
> >   timeout: float = SETTINGS.timeout,
> >   app_params: Params = Params(),
> >   name: str | None = None,
> > +**kwargs,
> >   ) -> None:
> >   """Create an SSH channel during initialization.
> >
> > +Additional key-word arguments can be passed through `kwargs` is 
> > needed to fulfill other
>
> Typo: is -> if
>
> And key-word should be just keyword.
>

Ack.

> Also, should we add super().__init__(), seeing as we also added it to
> TrafficGenerator?
>

We probably should so that it doesn't matter which order you specify
the two in and they still work, this is probably something that I
missed in the rebase.

>
> > diff --git a/dts/framework/testbed_model/traffic_generator/scapy.py 
> > b/dts/framework/testbed_model/traffic_generator/scapy.py
>
> > +class ScapyTrafficGenerator(PythonShell, CapturingTrafficGenerator):
>
> > +def __init__(self, tg_node: Node, config: ScapyTrafficGeneratorConfig, 
> > **kwargs):
>
> The usage of privileged=True when creating the instance confused me for
> a bit, because it's not this class's argument, but rather
> SingleActiveInteractiveShell's. I thing we should document the arguments
> of SingleActiveInteractiveShell in the Keyword Args section. We probably
> need just a link to SingleActiveInteractiveShell.

Sure, I can add some kind of reference. I agree that it's confusing, I
wish there was a simple way we could unpack the parameters of the
interactive shell into kwargs, but when I was searching I couldn't
really find anything great. Other than doing something like making
TypedDict classes for interactive shell parameters, but I'm not sure
that's super sleek either.

>
>
> > +def _create_packet_list(self, packets: list[Packet]) -> None:
>
> Maybe we could apply some of the ideas from the local/remote naming
> scheme I talked about in the tg devbind script patch here. Whatever
> happens on the TG could be prefixed with remote and whatever is
> happening locally would be without the prefix (or maybe whatever is
> happnening on the TG shouldn't be prefixed (or a different prefix -
> shell)? Makes sense, but then we'd need a good prefix for what's
> happening on the execution environment. Maybe this also needs to be in a
> different patch, after it's been though through with everything else in

I'll still write something here that makes the distinction and that
other patch could reformat if the author thought something else was
better.

> mind.). That would make this _create_remote_packet_list, but we're just
> setting a variable (the passed packets have already been built), so
> maybe _set_remote_packet_list?
>
> Or maybe all of the remote methods could start with remote, making it
> _remote_set_packet_list (or _shell_set_packet_list? Doesn't sound bad.).
> Not sure which is better, maybe after renaming more of these it's going
> to be clearer.

This is a good idea. I like remote more initially, but I'll try it out
on some of the methods and see if shell fits better. Regardless, I
think one of those two would be a good option as well.

>
> Whatever we go with, the naming scheme should be explained in the
> class's dosctring.

Ack.

>
>
> > +def _create_sniffer(
> > + 

[PATCH v2 0/2] cryptodev: add queue pair priority

2024-09-24 Thread Akhil Goyal
Changes in v2:
- added release notes and removed deprecation notice.
- removed gerrit-id

Akhil Goyal (2):
  cryptodev: add queue pair priority
  app/crypto-perf: test queue pair priority

 app/test-crypto-perf/cperf_options.h |  3 +++
 app/test-crypto-perf/cperf_options_parsing.c | 20 ++
 app/test-crypto-perf/main.c  |  6 +-
 doc/guides/rel_notes/deprecation.rst |  3 ---
 doc/guides/rel_notes/release_24_11.rst   |  3 +++
 doc/guides/tools/cryptoperf.rst  |  5 +
 lib/cryptodev/rte_cryptodev.h| 22 
 7 files changed, 58 insertions(+), 4 deletions(-)

-- 
2.25.1



[PATCH v2 2/2] app/crypto-perf: test queue pair priority

2024-09-24 Thread Akhil Goyal
Updated the application to test variable queue pair priority.
A mask using `--low-prio-qp-mask` can be set to lower the priority
of queues which are set in the mask.
This would result in lower performance for those queues.
By default the priority is set as highest.
This option is added just to verify the performance drop
of queues which have lower priority set.

Signed-off-by: Akhil Goyal 
---
 app/test-crypto-perf/cperf_options.h |  3 +++
 app/test-crypto-perf/cperf_options_parsing.c | 20 
 app/test-crypto-perf/main.c  |  6 +-
 doc/guides/tools/cryptoperf.rst  |  5 +
 4 files changed, 33 insertions(+), 1 deletion(-)

diff --git a/app/test-crypto-perf/cperf_options.h 
b/app/test-crypto-perf/cperf_options.h
index 131ecfdffb..48f409c3ba 100644
--- a/app/test-crypto-perf/cperf_options.h
+++ b/app/test-crypto-perf/cperf_options.h
@@ -32,6 +32,8 @@
 #define CPERF_TEST_FILE("test-file")
 #define CPERF_TEST_NAME("test-name")
 
+#define CPERF_LOW_PRIO_QP_MASK ("low-prio-qp-mask")
+
 #define CPERF_CIPHER_ALGO  ("cipher-algo")
 #define CPERF_CIPHER_OP("cipher-op")
 #define CPERF_CIPHER_KEY_SZ("cipher-key-sz")
@@ -107,6 +109,7 @@ struct cperf_options {
uint32_t *imix_buffer_sizes;
uint32_t nb_descriptors;
uint16_t nb_qps;
+   uint64_t low_prio_qp_mask;
 
uint32_t sessionless:1;
uint32_t shared_session:1;
diff --git a/app/test-crypto-perf/cperf_options_parsing.c 
b/app/test-crypto-perf/cperf_options_parsing.c
index c91fcf0479..8c15cd813f 100644
--- a/app/test-crypto-perf/cperf_options_parsing.c
+++ b/app/test-crypto-perf/cperf_options_parsing.c
@@ -37,6 +37,7 @@ usage(char *progname)
" --segment-sz N: set the size of the segment to use\n"
" --desc-nb N: set number of descriptors for each crypto 
device\n"
" --devtype TYPE: set crypto device type to use\n"
+   " --low-prio-qp-mask mask: set low priority for queues set in 
mask(hex)\n"
" --optype cipher-only / auth-only / cipher-then-auth / 
auth-then-cipher /\n"
"aead / pdcp / docsis / ipsec / modex / secp256r1 / sm2 
/ tls-record : set operation type\n"
" --sessionless: enable session-less crypto operations\n"
@@ -941,6 +942,22 @@ parse_pmd_cyclecount_delay_ms(struct cperf_options *opts,
return 0;
 }
 
+static int
+parse_low_prio_qp_mask(struct cperf_options *opts, const char *arg)
+{
+   char *end = NULL;
+   unsigned long n;
+
+   /* parse hexadecimal string */
+   n = strtoul(arg, &end, 16);
+   if ((optarg[0] == '\0') || (end == NULL) || (*end != '\0'))
+   return -1;
+
+   opts->low_prio_qp_mask = n;
+
+   return 0;
+}
+
 typedef int (*option_parser_t)(struct cperf_options *opts,
const char *arg);
 
@@ -962,6 +979,8 @@ static struct option lgopts[] = {
{ CPERF_SEGMENT_SIZE, required_argument, 0, 0 },
{ CPERF_DESC_NB, required_argument, 0, 0 },
 
+   { CPERF_LOW_PRIO_QP_MASK, required_argument, 0, 0 },
+
{ CPERF_IMIX, required_argument, 0, 0 },
{ CPERF_DEVTYPE, required_argument, 0, 0 },
{ CPERF_OPTYPE, required_argument, 0, 0 },
@@ -1097,6 +1116,7 @@ cperf_opts_parse_long(int opt_idx, struct cperf_options 
*opts)
{ CPERF_BUFFER_SIZE,parse_buffer_sz },
{ CPERF_SEGMENT_SIZE,   parse_segment_sz },
{ CPERF_DESC_NB,parse_desc_nb },
+   { CPERF_LOW_PRIO_QP_MASK,   parse_low_prio_qp_mask },
{ CPERF_DEVTYPE,parse_device_type },
{ CPERF_OPTYPE, parse_op_type },
{ CPERF_SESSIONLESS,parse_sessionless },
diff --git a/app/test-crypto-perf/main.c b/app/test-crypto-perf/main.c
index 75810dbf0b..eac87091a1 100644
--- a/app/test-crypto-perf/main.c
+++ b/app/test-crypto-perf/main.c
@@ -249,7 +249,8 @@ cperf_initialize_cryptodev(struct cperf_options *opts, 
uint8_t *enabled_cdevs)
}
 
struct rte_cryptodev_qp_conf qp_conf = {
-   .nb_descriptors = opts->nb_descriptors
+   .nb_descriptors = opts->nb_descriptors,
+   .priority = RTE_CRYPTODEV_QP_PRIORITY_HIGHEST
};
 
/**
@@ -315,6 +316,9 @@ cperf_initialize_cryptodev(struct cperf_options *opts, 
uint8_t *enabled_cdevs)
}
 
for (j = 0; j < opts->nb_qps; j++) {
+   if ((1 << j) & opts->low_prio_qp_mask)
+   qp_conf.priority = 
RTE_CRYPTODEV_QP_PRIORITY_LOWEST;
+
ret = rte_cryptodev_queue_pair_setup(cdev_id, j,
&qp_conf, socket_id);
if (ret < 0) {
diff --git a/doc/guides/tools/cryptoperf.rst b/doc/guides/tools/cryptoperf.rst
index 0510a3

[PATCH v2 1/2] cryptodev: add queue pair priority

2024-09-24 Thread Akhil Goyal
Added a new field priority in `rte_cryptodev_qp_conf`,
to set the queue pair priority while setting up.
The priorities can be set between
`RTE_CRYPTODEV_QP_PRIORITY_HIGHEST` and
`RTE_CRYPTODEV_QP_PRIORITY_LOWEST`.
The underlying implementation may normalize the value
as per the supported priority levels.
If the implementation does not support setting up
priority, all queue pairs will be on same priority level
and this field will be ignored.

Signed-off-by: Akhil Goyal 
---
 doc/guides/rel_notes/deprecation.rst   |  3 ---
 doc/guides/rel_notes/release_24_11.rst |  3 +++
 lib/cryptodev/rte_cryptodev.h  | 22 ++
 3 files changed, 25 insertions(+), 3 deletions(-)

diff --git a/doc/guides/rel_notes/deprecation.rst 
b/doc/guides/rel_notes/deprecation.rst
index 211f59fdc9..3605e9cddc 100644
--- a/doc/guides/rel_notes/deprecation.rst
+++ b/doc/guides/rel_notes/deprecation.rst
@@ -174,9 +174,6 @@ Deprecation Notices
   which got error interrupt to the application,
   so that application can reset that particular queue pair.
 
-* cryptodev: The structure ``rte_cryptodev_qp_conf`` will be updated
-  to have a new parameter to set priority of that particular queue pair.
-
 * cryptodev: The enum ``rte_crypto_asym_xform_type`` and struct 
``rte_crypto_asym_op``
   will be extended to include new values to support EDDSA.
   This will break ABI compatibility with existing applications that use these 
data types.
diff --git a/doc/guides/rel_notes/release_24_11.rst 
b/doc/guides/rel_notes/release_24_11.rst
index 0ff70d9057..fcdc738cf6 100644
--- a/doc/guides/rel_notes/release_24_11.rst
+++ b/doc/guides/rel_notes/release_24_11.rst
@@ -100,6 +100,9 @@ ABI Changes
Also, make sure to start the actual text at the margin.
===
 
+* cryptodev: The queue pair configuration structure ``rte_cryptodev_qp_conf``
+  is updated to have a new parameter to set priority of that particular queue 
pair.
+
 
 Known Issues
 
diff --git a/lib/cryptodev/rte_cryptodev.h b/lib/cryptodev/rte_cryptodev.h
index bec947f6d5..0a9cd718ea 100644
--- a/lib/cryptodev/rte_cryptodev.h
+++ b/lib/cryptodev/rte_cryptodev.h
@@ -608,12 +608,34 @@ enum rte_cryptodev_event_type {
RTE_CRYPTODEV_EVENT_MAX /**< max value of this enum */
 };
 
+/* Crypto queue pair priority levels */
+#define RTE_CRYPTODEV_QP_PRIORITY_HIGHEST   0
+/**< Highest priority of a cryptodev queue pair
+ * @see rte_cryptodev_queue_pair_setup(), rte_cryptodev_enqueue_burst()
+ */
+#define RTE_CRYPTODEV_QP_PRIORITY_NORMAL128
+/**< Normal priority of a cryptodev queue pair
+ * @see rte_cryptodev_queue_pair_setup(), rte_cryptodev_enqueue_burst()
+ */
+#define RTE_CRYPTODEV_QP_PRIORITY_LOWEST255
+/**< Lowest priority of a cryptodev queue pair
+ * @see rte_cryptodev_queue_pair_setup(), rte_cryptodev_enqueue_burst()
+ */
+
 /** Crypto device queue pair configuration structure. */
 /* Structure rte_cryptodev_qp_conf 8<*/
 struct rte_cryptodev_qp_conf {
uint32_t nb_descriptors; /**< Number of descriptors per queue pair */
struct rte_mempool *mp_session;
/**< The mempool for creating session in sessionless mode */
+   uint8_t priority;
+   /**< Priority for this queue pair relative to other queue pairs.
+*
+* The requested priority should in the range of
+* [@ref RTE_CRYPTODEV_QP_PRIORITY_HIGHEST, @ref 
RTE_CRYPTODEV_QP_PRIORITY_LOWEST].
+* The implementation may normalize the requested priority to
+* device supported priority value.
+*/
 };
 /* >8 End of structure rte_cryptodev_qp_conf. */
 
-- 
2.25.1



[PATCH v4] eal: add build-time option to omit trace

2024-09-24 Thread Morten Brørup
Some applications want to omit the trace feature.
Either to reduce the memory footprint, to reduce the exposed attack
surface, or for other reasons.

This patch adds an option in rte_config.h to include or omit trace in the
build. Trace is included by default.

Omitting trace works by omitting all trace points.
For API and ABI compatibility, the trace feature itself remains.

Signed-off-by: Morten Brørup 
---
v4:
* Added check for generic trace enabled when registering trace points, in
  RTE_INIT. (Jerin Jacob)
* Test application uses function instead of macro to check if generic
  trace is enabled. (Jerin Jacob)
* Performance test application uses function to check if generic trace is
  enabled.
v3:
* Simpler version with much fewer ifdefs. (Jerin Jacob)
v2:
* Added/modified macros required for building applications with trace
  omitted.
---
 app/test/test_trace.c  |  4 
 app/test/test_trace_perf.c |  5 +
 config/rte_config.h|  1 +
 lib/eal/include/rte_trace_point.h  | 21 +
 lib/eal/include/rte_trace_point_register.h |  2 ++
 5 files changed, 33 insertions(+)

diff --git a/app/test/test_trace.c b/app/test/test_trace.c
index 00809f433b..26656fe024 100644
--- a/app/test/test_trace.c
+++ b/app/test/test_trace.c
@@ -245,6 +245,10 @@ static struct unit_test_suite trace_tests = {
 static int
 test_trace(void)
 {
+   if (!__rte_trace_point_generic_is_enabled()) {
+   printf("Trace omitted at build-time, skipping test\n");
+   return TEST_SKIPPED;
+   }
return unit_test_suite_runner(&trace_tests);
 }
 
diff --git a/app/test/test_trace_perf.c b/app/test/test_trace_perf.c
index 8257cc02be..504574ef8a 100644
--- a/app/test/test_trace_perf.c
+++ b/app/test/test_trace_perf.c
@@ -150,6 +150,11 @@ test_trace_perf(void)
struct test_data *data;
size_t sz;
 
+   if (!__rte_trace_point_generic_is_enabled()) {
+   printf("Trace omitted at build-time, skipping test\n");
+   return TEST_SKIPPED;
+   }
+
nb_cores = rte_lcore_count();
nb_workers = nb_cores - 1;
if (nb_cores < 2) {
diff --git a/config/rte_config.h b/config/rte_config.h
index dd7bb0d35b..fd6f8a2f1a 100644
--- a/config/rte_config.h
+++ b/config/rte_config.h
@@ -49,6 +49,7 @@
 #define RTE_MAX_TAILQ 32
 #define RTE_LOG_DP_LEVEL RTE_LOG_INFO
 #define RTE_MAX_VFIO_CONTAINERS 64
+#define RTE_TRACE 1
 
 /* bsd module defines */
 #define RTE_CONTIGMEM_MAX_NUM_BUFS 64
diff --git a/lib/eal/include/rte_trace_point.h 
b/lib/eal/include/rte_trace_point.h
index 41e2a7f99e..b80688ce89 100644
--- a/lib/eal/include/rte_trace_point.h
+++ b/lib/eal/include/rte_trace_point.h
@@ -212,6 +212,25 @@ bool rte_trace_point_is_enabled(rte_trace_point_t *tp);
 __rte_experimental
 rte_trace_point_t *rte_trace_point_lookup(const char *name);
 
+/**
+ * @internal
+ *
+ * Test if the tracepoint compile-time option is enabled.
+ *
+ * @return
+ *   true if tracepoint enabled, false otherwise.
+ */
+__rte_experimental
+static __rte_always_inline bool
+__rte_trace_point_generic_is_enabled(void)
+{
+#ifdef RTE_TRACE
+   return true;
+#else
+   return false;
+#endif
+}
+
 /**
  * @internal
  *
@@ -359,6 +378,8 @@ __rte_trace_point_emit_ev_header(void *mem, uint64_t in)
 #define __rte_trace_point_emit_header_generic(t) \
 void *mem; \
 do { \
+   if (!__rte_trace_point_generic_is_enabled()) \
+   return; \
const uint64_t val = rte_atomic_load_explicit(t, 
rte_memory_order_acquire); \
if (likely(!(val & __RTE_TRACE_FIELD_ENABLE_MASK))) \
return; \
diff --git a/lib/eal/include/rte_trace_point_register.h 
b/lib/eal/include/rte_trace_point_register.h
index 41260e5964..429b993fc2 100644
--- a/lib/eal/include/rte_trace_point_register.h
+++ b/lib/eal/include/rte_trace_point_register.h
@@ -23,6 +23,8 @@ rte_trace_point_t __rte_section("__rte_trace_point") 
__##trace; \
 static const char __##trace##_name[] = RTE_STR(name); \
 RTE_INIT(trace##_init) \
 { \
+   if (!__rte_trace_point_generic_is_enabled()) \
+   return; \
__rte_trace_point_register(&__##trace, __##trace##_name, \
(void (*)(void)) trace); \
 }
-- 
2.43.0



Re: [PATCH v3 2/2] dts: add binding to different drivers to TG node

2024-09-24 Thread Juraj Linkeš




On 24. 9. 2024 18:28, jspew...@iol.unh.edu wrote:

From: Jeremy Spewock 

The DTS framework in its current state supports binding ports to
different drivers on the SUT node but not the TG node. The TG node
already has the information that it needs about the different drivers
that it has available in the configuration file, but it did not
previously have access to the devbind script, so it did not use that
information for anything.

This patch moves the location of the tmp directory as well as the method
for binding ports into the node class rather than the SUT node class and
adds an abstract method for getting the path to the devbind script into
the node class. Then, binding ports to the correct drivers is moved into
the build target setup and run on both nodes.

Bugzilla ID: 1420

Signed-off-by: Jeremy Spewock 
---


Reviewed-by: Juraj Linkeš 



[PATCH v8 0/8] support dump register names and filter

2024-09-24 Thread Jie Hai
The registers can be dumped through the API rte_eth_dev_get_reg_info.
However, only register values are exported, which is inconvenient for
users to interpret. Therefore, an extension of the structure
"rte_dev_reg_info" and a new API rte_eth_dev_get_reg_info_ext is added
to support the capability of exporting the name of the corresponding
register and filtering by module names.

--
v8:
1. fix typo on patch[0/8].
2. add Reviewed-bys and Acked-bys.
3. fix potentital memory leak and error information on telemetry.
4. fix on reviews about hns3 driver.

v7: fix doc indention.
--

Jie Hai (8):
  ethdev: support report register names and filter
  ethdev: add telemetry cmd for registers
  net/hns3: remove some basic address dump
  net/hns3: fix dump counter of registers
  net/hns3: remove separators between register module
  net/hns3: refactor register dump
  net/hns3: support report names of registers
  net/hns3: support filter registers by module names

 doc/guides/nics/hns3.rst   |7 +
 doc/guides/rel_notes/release_24_11.rst |8 +
 drivers/net/hns3/hns3_regs.c   | 1384 +++-
 lib/ethdev/ethdev_trace.h  |2 +
 lib/ethdev/rte_dev_info.h  |   11 +
 lib/ethdev/rte_ethdev.c|   38 +
 lib/ethdev/rte_ethdev.h|   29 +
 lib/ethdev/rte_ethdev_telemetry.c  |  130 +++
 lib/ethdev/version.map |1 +
 9 files changed, 1344 insertions(+), 266 deletions(-)

-- 
2.33.0



[PATCH v8 3/8] net/hns3: remove some basic address dump

2024-09-24 Thread Jie Hai
For security reasons, some address registers are not suitable
to be exposed, remove them.

Cc: sta...@dpdk.org

Signed-off-by: Jie Hai 
Acked-by: Huisong Li
Acked-by: Chengwen Feng 
---
 drivers/net/hns3/hns3_regs.c | 12 ++--
 1 file changed, 2 insertions(+), 10 deletions(-)

diff --git a/drivers/net/hns3/hns3_regs.c b/drivers/net/hns3/hns3_regs.c
index be1be6a89c94..53d829a4fc68 100644
--- a/drivers/net/hns3/hns3_regs.c
+++ b/drivers/net/hns3/hns3_regs.c
@@ -17,13 +17,9 @@
 
 static int hns3_get_dfx_reg_line(struct hns3_hw *hw, uint32_t *lines);
 
-static const uint32_t cmdq_reg_addrs[] = {HNS3_CMDQ_TX_ADDR_L_REG,
- HNS3_CMDQ_TX_ADDR_H_REG,
- HNS3_CMDQ_TX_DEPTH_REG,
+static const uint32_t cmdq_reg_addrs[] = {HNS3_CMDQ_TX_DEPTH_REG,
  HNS3_CMDQ_TX_TAIL_REG,
  HNS3_CMDQ_TX_HEAD_REG,
- HNS3_CMDQ_RX_ADDR_L_REG,
- HNS3_CMDQ_RX_ADDR_H_REG,
  HNS3_CMDQ_RX_DEPTH_REG,
  HNS3_CMDQ_RX_TAIL_REG,
  HNS3_CMDQ_RX_HEAD_REG,
@@ -44,9 +40,7 @@ static const uint32_t common_vf_reg_addrs[] = 
{HNS3_MISC_VECTOR_REG_BASE,
   HNS3_FUN_RST_ING,
   HNS3_GRO_EN_REG};
 
-static const uint32_t ring_reg_addrs[] = {HNS3_RING_RX_BASEADDR_L_REG,
- HNS3_RING_RX_BASEADDR_H_REG,
- HNS3_RING_RX_BD_NUM_REG,
+static const uint32_t ring_reg_addrs[] = {HNS3_RING_RX_BD_NUM_REG,
  HNS3_RING_RX_BD_LEN_REG,
  HNS3_RING_RX_EN_REG,
  HNS3_RING_RX_MERGE_EN_REG,
@@ -57,8 +51,6 @@ static const uint32_t ring_reg_addrs[] = 
{HNS3_RING_RX_BASEADDR_L_REG,
  HNS3_RING_RX_FBD_OFFSET_REG,
  HNS3_RING_RX_STASH_REG,
  HNS3_RING_RX_BD_ERR_REG,
- HNS3_RING_TX_BASEADDR_L_REG,
- HNS3_RING_TX_BASEADDR_H_REG,
  HNS3_RING_TX_BD_NUM_REG,
  HNS3_RING_TX_EN_REG,
  HNS3_RING_TX_PRIORITY_REG,
-- 
2.33.0



[PATCH v8 7/8] net/hns3: support report names of registers

2024-09-24 Thread Jie Hai
This patch adds names for register lists, and support report
names of registers. Some registers has different names on
different platform, use names of HIP08 as default names.

Signed-off-by: Jie Hai 
Acked-by: Chengwen Feng 
---
 drivers/net/hns3/hns3_regs.c | 1088 +-
 1 file changed, 955 insertions(+), 133 deletions(-)

diff --git a/drivers/net/hns3/hns3_regs.c b/drivers/net/hns3/hns3_regs.c
index 89858c2b1c09..a0d130302839 100644
--- a/drivers/net/hns3/hns3_regs.c
+++ b/drivers/net/hns3/hns3_regs.c
@@ -14,73 +14,827 @@
 
 static int hns3_get_dfx_reg_cnt(struct hns3_hw *hw, uint32_t *count);
 
-static const uint32_t cmdq_reg_addrs[] = {HNS3_CMDQ_TX_DEPTH_REG,
- HNS3_CMDQ_TX_TAIL_REG,
- HNS3_CMDQ_TX_HEAD_REG,
- HNS3_CMDQ_RX_DEPTH_REG,
- HNS3_CMDQ_RX_TAIL_REG,
- HNS3_CMDQ_RX_HEAD_REG,
- HNS3_VECTOR0_CMDQ_SRC_REG,
- HNS3_CMDQ_INTR_STS_REG,
- HNS3_CMDQ_INTR_EN_REG,
- HNS3_CMDQ_INTR_GEN_REG};
-
-static const uint32_t common_reg_addrs[] = {HNS3_MISC_VECTOR_REG_BASE,
-   HNS3_VECTOR0_OTER_EN_REG,
-   HNS3_MISC_RESET_STS_REG,
-   HNS3_VECTOR0_OTHER_INT_STS_REG,
-   HNS3_GLOBAL_RESET_REG,
-   HNS3_FUN_RST_ING,
-   HNS3_GRO_EN_REG};
-
-static const uint32_t common_vf_reg_addrs[] = {HNS3_MISC_VECTOR_REG_BASE,
-  HNS3_FUN_RST_ING,
-  HNS3_GRO_EN_REG};
-
-static const uint32_t ring_reg_addrs[] = {HNS3_RING_RX_BD_NUM_REG,
- HNS3_RING_RX_BD_LEN_REG,
- HNS3_RING_RX_EN_REG,
- HNS3_RING_RX_MERGE_EN_REG,
- HNS3_RING_RX_TAIL_REG,
- HNS3_RING_RX_HEAD_REG,
- HNS3_RING_RX_FBDNUM_REG,
- HNS3_RING_RX_OFFSET_REG,
- HNS3_RING_RX_FBD_OFFSET_REG,
- HNS3_RING_RX_STASH_REG,
- HNS3_RING_RX_BD_ERR_REG,
- HNS3_RING_TX_BD_NUM_REG,
- HNS3_RING_TX_EN_REG,
- HNS3_RING_TX_PRIORITY_REG,
- HNS3_RING_TX_TC_REG,
- HNS3_RING_TX_MERGE_EN_REG,
- HNS3_RING_TX_TAIL_REG,
- HNS3_RING_TX_HEAD_REG,
- HNS3_RING_TX_FBDNUM_REG,
- HNS3_RING_TX_OFFSET_REG,
- HNS3_RING_TX_EBD_NUM_REG,
- HNS3_RING_TX_EBD_OFFSET_REG,
- HNS3_RING_TX_BD_ERR_REG,
- HNS3_RING_EN_REG};
-
-static const uint32_t tqp_intr_reg_addrs[] = {HNS3_TQP_INTR_CTRL_REG,
- HNS3_TQP_INTR_GL0_REG,
- HNS3_TQP_INTR_GL1_REG,
- HNS3_TQP_INTR_GL2_REG,
- HNS3_TQP_INTR_RL_REG};
+struct hns3_dirt_reg_entry {
+   const char *name;
+   uint32_t addr;
+};
+
+static const struct hns3_dirt_reg_entry cmdq_reg_list[] = {
+   {"cmdq_tx_depth",   HNS3_CMDQ_TX_DEPTH_REG},
+   {"cmdq_tx_tail",HNS3_CMDQ_TX_TAIL_REG},
+   {"cmdq_tx_head",HNS3_CMDQ_TX_HEAD_REG},
+   {"cmdq_rx_depth",   HNS3_CMDQ_RX_DEPTH_REG},
+   {"cmdq_rx_tail",HNS3_CMDQ_RX_TAIL_REG},
+   {"cmdq_rx_head",HNS3_CMDQ_RX_HEAD_REG},
+   {"vector0_cmdq_src",HNS3_VECTOR0_CMDQ_SRC_REG},
+   {"cmdq_intr_sts",   HNS3_CMDQ_INTR_STS_REG},
+   {"cmdq_intr_en",HNS3_CMDQ_INTR_EN_REG},
+   {"cmdq_intr_gen",   HNS3_CMDQ_INTR_GEN_REG},
+};
+
+static const struct hns3_dirt_reg_entry common_reg_list[] = {
+   {"misc_vector_reg_base",HNS3_MISC_VECTOR_REG_BASE},
+   {"vector0_oter_en", HNS3_VECTOR0_OTER_EN_REG},
+   {"misc_reset_sts",  HNS3_MISC_RESET_STS_REG},
+

[PATCH v8 6/8] net/hns3: refactor register dump

2024-09-24 Thread Jie Hai
This patch refactors codes dumping registers from firmware.

Signed-off-by: Jie Hai 
Acked-by: Chengwen Feng 
---
 drivers/net/hns3/hns3_regs.c | 203 ---
 1 file changed, 115 insertions(+), 88 deletions(-)

diff --git a/drivers/net/hns3/hns3_regs.c b/drivers/net/hns3/hns3_regs.c
index c8e3fb118e4b..89858c2b1c09 100644
--- a/drivers/net/hns3/hns3_regs.c
+++ b/drivers/net/hns3/hns3_regs.c
@@ -104,12 +104,93 @@ hns3_get_regs_num(struct hns3_hw *hw, uint32_t 
*regs_num_32_bit,
return 0;
 }
 
+static int
+hns3_get_32_64_regs_cnt(struct hns3_hw *hw, uint32_t *count)
+{
+   uint32_t regs_num_32_bit, regs_num_64_bit;
+   int ret;
+
+   ret = hns3_get_regs_num(hw, ®s_num_32_bit, ®s_num_64_bit);
+   if (ret) {
+   hns3_err(hw, "fail to get the number of registers, "
+"ret = %d.", ret);
+   return ret;
+   }
+
+   *count += regs_num_32_bit + regs_num_64_bit * 
HNS3_64_BIT_REG_OUTPUT_SIZE;
+   return 0;
+}
+
+static int
+hns3_get_dfx_reg_bd_num(struct hns3_hw *hw, uint32_t *bd_num_list,
+   uint32_t list_size)
+{
+#define HNS3_GET_DFX_REG_BD_NUM_SIZE   4
+   struct hns3_cmd_desc desc[HNS3_GET_DFX_REG_BD_NUM_SIZE];
+   uint32_t index, desc_index;
+   uint32_t bd_num;
+   uint32_t i;
+   int ret;
+
+   for (i = 0; i < HNS3_GET_DFX_REG_BD_NUM_SIZE - 1; i++) {
+   hns3_cmd_setup_basic_desc(&desc[i], HNS3_OPC_DFX_BD_NUM, true);
+   desc[i].flag |= rte_cpu_to_le_16(HNS3_CMD_FLAG_NEXT);
+   }
+   /* The last BD does not need a next flag */
+   hns3_cmd_setup_basic_desc(&desc[i], HNS3_OPC_DFX_BD_NUM, true);
+
+   ret = hns3_cmd_send(hw, desc, HNS3_GET_DFX_REG_BD_NUM_SIZE);
+   if (ret) {
+   hns3_err(hw, "fail to get dfx bd num, ret = %d.\n", ret);
+   return ret;
+   }
+
+   /* The first data in the first BD is a reserved field */
+   for (i = 1; i <= list_size; i++) {
+   desc_index = i / HNS3_CMD_DESC_DATA_NUM;
+   index = i % HNS3_CMD_DESC_DATA_NUM;
+   bd_num = rte_le_to_cpu_32(desc[desc_index].data[index]);
+   bd_num_list[i - 1] = bd_num;
+   }
+
+   return 0;
+}
+
+static int
+hns3_get_dfx_reg_cnt(struct hns3_hw *hw, uint32_t *count)
+{
+   int opcode_num = RTE_DIM(hns3_dfx_reg_opcode_list);
+   uint32_t bd_num_list[opcode_num];
+   int ret;
+   int i;
+
+   ret = hns3_get_dfx_reg_bd_num(hw, bd_num_list, opcode_num);
+   if (ret)
+   return ret;
+
+   for (i = 0; i < opcode_num; i++)
+   *count += bd_num_list[i] * HNS3_CMD_DESC_DATA_NUM;
+
+   return 0;
+}
+
+static int
+hns3_get_firmware_reg_cnt(struct hns3_hw *hw, uint32_t *count)
+{
+   int ret;
+
+   ret = hns3_get_32_64_regs_cnt(hw, count);
+   if (ret < 0)
+   return ret;
+
+   return hns3_get_dfx_reg_cnt(hw, count);
+}
+
 static int
 hns3_get_regs_length(struct hns3_hw *hw, uint32_t *length)
 {
struct hns3_adapter *hns = HNS3_DEV_HW_TO_ADAPTER(hw);
-   uint32_t regs_num_32_bit, regs_num_64_bit;
-   uint32_t dfx_reg_cnt;
+   uint32_t dfx_reg_cnt = 0;
uint32_t common_cnt;
uint32_t len;
int ret;
@@ -125,16 +206,7 @@ hns3_get_regs_length(struct hns3_hw *hw, uint32_t *length)
len /= sizeof(uint32_t);
 
if (!hns->is_vf) {
-   ret = hns3_get_regs_num(hw, ®s_num_32_bit, ®s_num_64_bit);
-   if (ret) {
-   hns3_err(hw, "fail to get the number of registers, "
-"ret = %d.", ret);
-   return ret;
-   }
-   dfx_reg_cnt = regs_num_32_bit +
- regs_num_64_bit * HNS3_64_BIT_REG_OUTPUT_SIZE;
-
-   ret = hns3_get_dfx_reg_cnt(hw, &dfx_reg_cnt);
+   ret = hns3_get_firmware_reg_cnt(hw, &dfx_reg_cnt);
if (ret) {
hns3_err(hw, "fail to get the number of dfx registers, "
 "ret = %d.", ret);
@@ -304,41 +376,6 @@ hns3_direct_access_regs(struct hns3_hw *hw, uint32_t *data)
return data - origin_data_ptr;
 }
 
-static int
-hns3_get_dfx_reg_bd_num(struct hns3_hw *hw, uint32_t *bd_num_list,
-   uint32_t list_size)
-{
-#define HNS3_GET_DFX_REG_BD_NUM_SIZE   4
-   struct hns3_cmd_desc desc[HNS3_GET_DFX_REG_BD_NUM_SIZE];
-   uint32_t index, desc_index;
-   uint32_t bd_num;
-   uint32_t i;
-   int ret;
-
-   for (i = 0; i < HNS3_GET_DFX_REG_BD_NUM_SIZE - 1; i++) {
-   hns3_cmd_setup_basic_desc(&desc[i], HNS3_OPC_DFX_BD_NUM, true);
-   desc[i].flag |= rte_cpu_to_le_16(HNS3_CMD_FLAG_NEXT);
-   }
-   /* The last BD does not need a next flag */
-   hns3_cmd_setup_basic_desc(&desc[i], HNS3_OPC_DFX_BD_NUM, true);
-
-   ret = hns3_cmd_send(hw, 

[PATCH v8 8/8] net/hns3: support filter registers by module names

2024-09-24 Thread Jie Hai
This patch support dumping registers which module name is the
`filter` string. The module names are in lower case and so is
the `filter`. Available module names are cmdq, common_pf,
common_vf, ring, tqp_intr, 32_bit_dfx, 64_bit_dfx, bios, igu_egu,
ssu, ppp, rpu, ncsi, rtc, rcb, etc.

Signed-off-by: Jie Hai 
---
 doc/guides/nics/hns3.rst |   7 +
 drivers/net/hns3/hns3_regs.c | 257 ---
 2 files changed, 157 insertions(+), 107 deletions(-)

diff --git a/doc/guides/nics/hns3.rst b/doc/guides/nics/hns3.rst
index 97b4686bfabe..20765bd7d145 100644
--- a/doc/guides/nics/hns3.rst
+++ b/doc/guides/nics/hns3.rst
@@ -407,6 +407,13 @@ be provided to avoid scheduling the CPU core used by DPDK 
application threads fo
 other tasks. Before starting the Linux OS, add the kernel isolation boot 
parameter.
 For example, "isolcpus=1-18 nohz_full=1-18 rcu_nocbs=1-18".
 
+Dump registers
+--
+HNS3 supports dumping registers values with their names, and suppotrt filtering
+by module names. The available modules are ``bios``, ``ssu``, ``igu_egu``,
+``rpu``, ``ncsi``, ``rtc``, ``ppp``, ``rcb``, ``tqp``, ``rtc``, ``cmdq``,
+``common_pf``, ``common_vf``, ``ring``, ``tqp_intr``, ``32_bit_dfx``,
+``64_bit_dfx``, etc.
 
 Limitations or Known issues
 ---
diff --git a/drivers/net/hns3/hns3_regs.c b/drivers/net/hns3/hns3_regs.c
index a0d130302839..2e19add21265 100644
--- a/drivers/net/hns3/hns3_regs.c
+++ b/drivers/net/hns3/hns3_regs.c
@@ -12,7 +12,7 @@
 
 #define HNS3_64_BIT_REG_OUTPUT_SIZE (sizeof(uint64_t) / sizeof(uint32_t))
 
-static int hns3_get_dfx_reg_cnt(struct hns3_hw *hw, uint32_t *count);
+#define HNS3_MAX_MODULES_LEN 512
 
 struct hns3_dirt_reg_entry {
const char *name;
@@ -795,11 +795,39 @@ enum hns3_reg_modules {
HNS3_64_BIT_DFX,
 };
 
+#define HNS3_MODULE_MASK(x) RTE_BIT32(x)
+#define HNS3_VF_MODULES (HNS3_MODULE_MASK(HNS3_CMDQ) | 
HNS3_MODULE_MASK(HNS3_COMMON_VF) | \
+HNS3_MODULE_MASK(HNS3_RING) | 
HNS3_MODULE_MASK(HNS3_TQP_INTR))
+#define HNS3_VF_ONLY_MODULES HNS3_MODULE_MASK(HNS3_COMMON_VF)
+
 struct hns3_reg_list {
const void *reg_list;
uint32_t entry_num;
 };
 
+struct {
+   const char *name;
+   uint32_t module;
+} hns3_module_name_map[] = {
+   { "bios",   HNS3_MODULE_MASK(HNS3_BIOS_COMMON) },
+   { "ssu",HNS3_MODULE_MASK(HNS3_SSU_0) | 
HNS3_MODULE_MASK(HNS3_SSU_1) |
+   HNS3_MODULE_MASK(HNS3_SSU_2) },
+   { "igu_egu",HNS3_MODULE_MASK(HNS3_IGU_EGU) },
+   { "rpu",HNS3_MODULE_MASK(HNS3_RPU_0) | 
HNS3_MODULE_MASK(HNS3_RPU_1) },
+   { "ncsi",   HNS3_MODULE_MASK(HNS3_NCSI) },
+   { "rtc",HNS3_MODULE_MASK(HNS3_RTC) },
+   { "ppp",HNS3_MODULE_MASK(HNS3_PPP) },
+   { "rcb",HNS3_MODULE_MASK(HNS3_RCB) },
+   { "tqp",HNS3_MODULE_MASK(HNS3_TQP) },
+   { "cmdq",   HNS3_MODULE_MASK(HNS3_CMDQ) },
+   { "common_pf",  HNS3_MODULE_MASK(HNS3_COMMON_PF) },
+   { "common_vf",  HNS3_MODULE_MASK(HNS3_COMMON_VF) },
+   { "ring",   HNS3_MODULE_MASK(HNS3_RING) },
+   { "tqp_intr",   HNS3_MODULE_MASK(HNS3_TQP_INTR) },
+   { "32_bit_dfx", HNS3_MODULE_MASK(HNS3_32_BIT_DFX) },
+   { "64_bit_dfx", HNS3_MODULE_MASK(HNS3_64_BIT_DFX) },
+};
+
 static struct hns3_reg_list hns3_reg_lists[] = {
[HNS3_BIOS_COMMON]  = { dfx_bios_common_reg_list,   
RTE_DIM(dfx_bios_common_reg_list) },
[HNS3_SSU_0]= { dfx_ssu_reg_0_list, 
RTE_DIM(dfx_ssu_reg_0_list) },
@@ -863,21 +891,58 @@ hns3_get_regs_num(struct hns3_hw *hw, uint32_t 
*regs_num_32_bit,
return 0;
 }
 
-static int
-hns3_get_32_64_regs_cnt(struct hns3_hw *hw, uint32_t *count)
+static const char *
+hns3_get_name_by_module(enum hns3_reg_modules module)
 {
-   uint32_t regs_num_32_bit, regs_num_64_bit;
-   int ret;
+   size_t i;
 
-   ret = hns3_get_regs_num(hw, ®s_num_32_bit, ®s_num_64_bit);
-   if (ret) {
-   hns3_err(hw, "fail to get the number of registers, "
-"ret = %d.", ret);
-   return ret;
+   for (i = 0; i < RTE_DIM(hns3_module_name_map); i++) {
+   if (hns3_module_name_map[i].module && HNS3_MODULE_MASK(module) 
!= 0)
+   return hns3_module_name_map[i].name;
}
+   return "unknown";
+}
 
-   *count += regs_num_32_bit + regs_num_64_bit * 
HNS3_64_BIT_REG_OUTPUT_SIZE;
-   return 0;
+static void
+hns3_get_module_names(char *names, uint32_t len)
+{
+   size_t i;
+
+   for (i = 0; i < RTE_DIM(hns3_module_name_map); i++) {
+   strlcat(names, " ", len);
+   strlcat(names, hns3_module_name_map[i].name, len);
+   }
+}
+
+static uint32_t
+hns3_parse_modules_by_filter(struct hns3_hw *hw, const char *filter)
+{
+   struct hns3_adapter *hns = HNS3_DEV_HW_TO_ADAPTER(hw);
+   char names[HNS3_MAX_MODULES_LEN] = {0};
+

[PATCH v8 5/8] net/hns3: remove separators between register module

2024-09-24 Thread Jie Hai
Since the driver is going to support reporting names of
all registers, remove the counter and insert of separators
between different register modules.

Signed-off-by: Jie Hai 
Reviewed-by: Huisong Li 
Acked-by: Chengwen Feng 
---
 drivers/net/hns3/hns3_regs.c | 68 ++--
 1 file changed, 18 insertions(+), 50 deletions(-)

diff --git a/drivers/net/hns3/hns3_regs.c b/drivers/net/hns3/hns3_regs.c
index d9c546470dbe..c8e3fb118e4b 100644
--- a/drivers/net/hns3/hns3_regs.c
+++ b/drivers/net/hns3/hns3_regs.c
@@ -10,12 +10,9 @@
 #include "hns3_rxtx.h"
 #include "hns3_regs.h"
 
-#define MAX_SEPARATE_NUM   4
-#define SEPARATOR_VALUE0x
-#define REG_NUM_PER_LINE   4
-#define REG_LEN_PER_LINE   (REG_NUM_PER_LINE * sizeof(uint32_t))
+#define HNS3_64_BIT_REG_OUTPUT_SIZE (sizeof(uint64_t) / sizeof(uint32_t))
 
-static int hns3_get_dfx_reg_line(struct hns3_hw *hw, uint32_t *lines);
+static int hns3_get_dfx_reg_cnt(struct hns3_hw *hw, uint32_t *count);
 
 static const uint32_t cmdq_reg_addrs[] = {HNS3_CMDQ_TX_DEPTH_REG,
  HNS3_CMDQ_TX_TAIL_REG,
@@ -111,23 +108,21 @@ static int
 hns3_get_regs_length(struct hns3_hw *hw, uint32_t *length)
 {
struct hns3_adapter *hns = HNS3_DEV_HW_TO_ADAPTER(hw);
-   uint32_t cmdq_lines, common_lines, ring_lines, tqp_intr_lines;
uint32_t regs_num_32_bit, regs_num_64_bit;
-   uint32_t dfx_reg_lines;
+   uint32_t dfx_reg_cnt;
+   uint32_t common_cnt;
uint32_t len;
int ret;
 
-   cmdq_lines = sizeof(cmdq_reg_addrs) / REG_LEN_PER_LINE + 1;
if (hns->is_vf)
-   common_lines =
-   sizeof(common_vf_reg_addrs) / REG_LEN_PER_LINE + 1;
+   common_cnt = sizeof(common_vf_reg_addrs);
else
-   common_lines = sizeof(common_reg_addrs) / REG_LEN_PER_LINE + 1;
-   ring_lines = sizeof(ring_reg_addrs) / REG_LEN_PER_LINE + 1;
-   tqp_intr_lines = sizeof(tqp_intr_reg_addrs) / REG_LEN_PER_LINE + 1;
+   common_cnt = sizeof(common_reg_addrs);
 
-   len = (cmdq_lines + common_lines + ring_lines * hw->tqps_num +
- tqp_intr_lines * hw->intr_tqps_num) * REG_NUM_PER_LINE;
+   len = sizeof(cmdq_reg_addrs) + common_cnt +
+   sizeof(ring_reg_addrs) * hw->tqps_num +
+   sizeof(tqp_intr_reg_addrs) * hw->intr_tqps_num;
+   len /= sizeof(uint32_t);
 
if (!hns->is_vf) {
ret = hns3_get_regs_num(hw, ®s_num_32_bit, ®s_num_64_bit);
@@ -136,18 +131,16 @@ hns3_get_regs_length(struct hns3_hw *hw, uint32_t *length)
 "ret = %d.", ret);
return ret;
}
-   dfx_reg_lines = regs_num_32_bit * sizeof(uint32_t) /
-   REG_LEN_PER_LINE + 1;
-   dfx_reg_lines += regs_num_64_bit * sizeof(uint64_t) /
-   REG_LEN_PER_LINE + 1;
+   dfx_reg_cnt = regs_num_32_bit +
+ regs_num_64_bit * HNS3_64_BIT_REG_OUTPUT_SIZE;
 
-   ret = hns3_get_dfx_reg_line(hw, &dfx_reg_lines);
+   ret = hns3_get_dfx_reg_cnt(hw, &dfx_reg_cnt);
if (ret) {
hns3_err(hw, "fail to get the number of dfx registers, "
 "ret = %d.", ret);
return ret;
}
-   len += dfx_reg_lines * REG_NUM_PER_LINE;
+   len += dfx_reg_cnt;
}
 
*length = len;
@@ -268,18 +261,6 @@ hns3_get_64_bit_regs(struct hns3_hw *hw, uint32_t 
regs_num, void *data)
return 0;
 }
 
-static int
-hns3_insert_reg_separator(int reg_num, uint32_t *data)
-{
-   int separator_num;
-   int i;
-
-   separator_num = MAX_SEPARATE_NUM - reg_num % REG_NUM_PER_LINE;
-   for (i = 0; i < separator_num; i++)
-   *data++ = SEPARATOR_VALUE;
-   return separator_num;
-}
-
 static int
 hns3_direct_access_regs(struct hns3_hw *hw, uint32_t *data)
 {
@@ -294,7 +275,6 @@ hns3_direct_access_regs(struct hns3_hw *hw, uint32_t *data)
reg_num = sizeof(cmdq_reg_addrs) / sizeof(uint32_t);
for (i = 0; i < reg_num; i++)
*data++ = hns3_read_dev(hw, cmdq_reg_addrs[i]);
-   data += hns3_insert_reg_separator(reg_num, data);
 
if (hns->is_vf)
reg_num = sizeof(common_vf_reg_addrs) / sizeof(uint32_t);
@@ -305,7 +285,6 @@ hns3_direct_access_regs(struct hns3_hw *hw, uint32_t *data)
*data++ = hns3_read_dev(hw, common_vf_reg_addrs[i]);
else
*data++ = hns3_read_dev(hw, common_reg_addrs[i]);
-   data += hns3_insert_reg_separator(reg_num, data);
 
reg_num = sizeof(ring_reg_addrs) / sizeof(uint32_t);
for (j = 0; j < hw->tqps_num; j++) {
@@ -313,7 +292,6 @@ hns3_direct_access_regs(struct hns3_hw *hw, uint32

[PATCH v8 2/8] ethdev: add telemetry cmd for registers

2024-09-24 Thread Jie Hai
This patch adds a telemetry command for registers dump,
and supports obtaining the registers of a specified module.

In one way, the number of registers that can be exported
is limited by the number of elements carried by dict and
container. In another way, the length of the string
exported by telemetry is limited by MAX_OUTPUT_LEN.
Therefore, when the number of registers to be exported
exceeds, some information will be lost. Warn on the former
case.

An example usage is shown below:
--> /ethdev/regs,0,ring
{
  "/ethdev/regs": {
"registers_length": 318,
"registers_width": 4,
"register_offset": "0x0",
"version": "0x1140011",
"group_0": {
  "Q0_ring_rx_bd_num": "0x0",
  "Q0_ring_rx_bd_len": "0x0",
  ...
  },
"group_1": {
...
},
...
}

Signed-off-by: Jie Hai 
Reviewed-by: Ferruh Yigit 
---
 lib/ethdev/rte_ethdev_telemetry.c | 130 ++
 1 file changed, 130 insertions(+)

diff --git a/lib/ethdev/rte_ethdev_telemetry.c 
b/lib/ethdev/rte_ethdev_telemetry.c
index 6b873e7abe68..181390691099 100644
--- a/lib/ethdev/rte_ethdev_telemetry.c
+++ b/lib/ethdev/rte_ethdev_telemetry.c
@@ -1395,6 +1395,134 @@ eth_dev_handle_port_tm_node_caps(const char *cmd 
__rte_unused,
return ret;
 }
 
+static void
+eth_dev_add_reg_data(struct rte_tel_data *d, struct rte_dev_reg_info *reg_info,
+uint32_t idx)
+{
+   if (reg_info->width == sizeof(uint32_t))
+   rte_tel_data_add_dict_uint_hex(d, reg_info->names[idx].name,
+   *((uint32_t *)reg_info->data + idx), 0);
+   else
+   rte_tel_data_add_dict_uint_hex(d, reg_info->names[idx].name,
+   *((uint64_t *)reg_info->data + idx), 0);
+}
+
+static int
+eth_dev_store_regs(struct rte_tel_data *d, struct rte_dev_reg_info *reg_info)
+{
+   struct rte_tel_data *groups[RTE_TEL_MAX_DICT_ENTRIES];
+   char group_name[RTE_TEL_MAX_STRING_LEN] = {0};
+   struct rte_tel_data *group = NULL;
+   uint32_t grp_num = 0;
+   uint32_t i, max_cap;
+   int ret;
+
+   rte_tel_data_start_dict(d);
+   rte_tel_data_add_dict_uint(d, "register_length", reg_info->length);
+   rte_tel_data_add_dict_uint(d, "register_width", reg_info->width);
+   rte_tel_data_add_dict_uint_hex(d, "register_offset", reg_info->offset, 
0);
+   rte_tel_data_add_dict_uint_hex(d, "version", reg_info->version, 0);
+
+   max_cap = (RTE_TEL_MAX_DICT_ENTRIES - 4) * RTE_TEL_MAX_DICT_ENTRIES;
+   if (reg_info->length > max_cap) {
+   RTE_ETHDEV_LOG_LINE(WARNING,
+   "Registers to be displayed are reduced from %u to %u 
due to limited capacity",
+   reg_info->length, max_cap);
+   reg_info->length = max_cap;
+   }
+
+   for (i = 0; i < reg_info->length; i++) {
+   if (i % RTE_TEL_MAX_DICT_ENTRIES != 0) {
+   eth_dev_add_reg_data(group, reg_info, i);
+   continue;
+   }
+
+   group = rte_tel_data_alloc();
+   if (group == NULL) {
+   ret = -ENOMEM;
+   RTE_ETHDEV_LOG_LINE(WARNING, "No enough memory for 
group data");
+   goto out;
+   }
+   groups[grp_num++] = group;
+   rte_tel_data_start_dict(group);
+   eth_dev_add_reg_data(group, reg_info, i);
+   }
+
+   for (i = 0; i < grp_num; i++) {
+   snprintf(group_name, RTE_TEL_MAX_STRING_LEN, "group_%u", i);
+   rte_tel_data_add_dict_container(d, group_name, groups[i], 0);
+   }
+   return 0;
+out:
+   for (i = 0; i < grp_num; i++)
+   rte_tel_data_free(groups[i]);
+
+   return ret;
+}
+
+static int
+eth_dev_get_port_regs(int port_id, struct rte_tel_data *d, char *filter)
+{
+   struct rte_dev_reg_info reg_info;
+   int ret;
+
+   memset(®_info, 0, sizeof(reg_info));
+   reg_info.filter = filter;
+
+   ret = rte_eth_dev_get_reg_info_ext(port_id, ®_info);
+   if (ret != 0) {
+   RTE_ETHDEV_LOG_LINE(ERR, "Failed to get device reg info: %d", 
ret);
+   return ret;
+   }
+
+   reg_info.data = calloc(reg_info.length, reg_info.width);
+   if (reg_info.data == NULL) {
+   RTE_ETHDEV_LOG_LINE(ERR, "Failed to allocate memory for 
reg_info.data");
+   return -ENOMEM;
+   }
+
+   reg_info.names = calloc(reg_info.length, sizeof(struct 
rte_eth_reg_name));
+   if (reg_info.names == NULL) {
+   RTE_ETHDEV_LOG_LINE(ERR, "Failed to allocate memory for 
reg_info.names");
+   free(reg_info.data);
+   return -ENOMEM;
+   }
+
+   ret = rte_eth_dev_get_reg_info_ext(port_id, ®_info);
+   if (ret != 0) {
+   RTE_ETHDEV_LOG_LINE(ERR, "Failed to get device reg info: %d", 
ret);
+   ret = -EINVAL;
+   g

[PATCH v8 4/8] net/hns3: fix dump counter of registers

2024-09-24 Thread Jie Hai
Since the driver dumps the queue interrupt registers according
to the intr_tqps_num, the counter should be the same.

Fixes: acb3260fac5c ("net/hns3: fix dump register out of range")
Fixes: 936eda25e8da ("net/hns3: support dump register")
Cc: sta...@dpdk.org

Signed-off-by: Jie Hai 
Acked-by: Huisong Li 
Acked-by: Chengwen Feng 
---
 drivers/net/hns3/hns3_regs.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/drivers/net/hns3/hns3_regs.c b/drivers/net/hns3/hns3_regs.c
index 53d829a4fc68..d9c546470dbe 100644
--- a/drivers/net/hns3/hns3_regs.c
+++ b/drivers/net/hns3/hns3_regs.c
@@ -127,7 +127,7 @@ hns3_get_regs_length(struct hns3_hw *hw, uint32_t *length)
tqp_intr_lines = sizeof(tqp_intr_reg_addrs) / REG_LEN_PER_LINE + 1;
 
len = (cmdq_lines + common_lines + ring_lines * hw->tqps_num +
- tqp_intr_lines * hw->num_msi) * REG_NUM_PER_LINE;
+ tqp_intr_lines * hw->intr_tqps_num) * REG_NUM_PER_LINE;
 
if (!hns->is_vf) {
ret = hns3_get_regs_num(hw, ®s_num_32_bit, ®s_num_64_bit);
-- 
2.33.0



[PATCH v8 1/8] ethdev: support report register names and filter

2024-09-24 Thread Jie Hai
This patch adds "filter" and "names" fields to "rte_dev_reg_info"
structure. Names of registers in data fields can be reported and
the registers can be filtered by their module names.

The new API rte_eth_dev_get_reg_info_ext() is added to support
reporting names and filtering by modules. And the original API
rte_eth_dev_get_reg_info() does not use the names and filter fields.
A local variable is used in rte_eth_dev_get_reg_info for
compatibility. If the drivers does not report the names, set them
to "index_XXX", which means the location in the register table.

Signed-off-by: Jie Hai 
Acked-by: Huisong Li 
Acked-by: Chengwen Feng 
Reviewed-by: Ferruh Yigit 
---
 doc/guides/rel_notes/release_24_11.rst |  8 ++
 lib/ethdev/ethdev_trace.h  |  2 ++
 lib/ethdev/rte_dev_info.h  | 11 
 lib/ethdev/rte_ethdev.c| 38 ++
 lib/ethdev/rte_ethdev.h| 29 
 lib/ethdev/version.map |  1 +
 6 files changed, 89 insertions(+)

diff --git a/doc/guides/rel_notes/release_24_11.rst 
b/doc/guides/rel_notes/release_24_11.rst
index 5dd575fb39df..85960e71d0ca 100644
--- a/doc/guides/rel_notes/release_24_11.rst
+++ b/doc/guides/rel_notes/release_24_11.rst
@@ -60,6 +60,11 @@ New Features
   * Added SR-IOV VF support.
   * Added recent 1400/14000 and 15000 models to the supported list.
 
+* **Added support for dumping registers with names and filtering by modules.**
+
+  Added new API functions ``rte_eth_dev_get_reg_info_ext()`` to filter the
+  registers by module names and get the information (names, values and other
+  attributes) of the filtered registers.
 
 Removed Items
 -
@@ -108,6 +113,9 @@ ABI Changes
Also, make sure to start the actual text at the margin.
===
 
+* ethdev: Added ``filter`` and ``names`` fields to ``rte_dev_reg_info``
+  structure for filtering by modules and reporting names of registers.
+
 
 Known Issues
 
diff --git a/lib/ethdev/ethdev_trace.h b/lib/ethdev/ethdev_trace.h
index 3bec87bfdb70..0c4780a09ef5 100644
--- a/lib/ethdev/ethdev_trace.h
+++ b/lib/ethdev/ethdev_trace.h
@@ -1152,6 +1152,8 @@ RTE_TRACE_POINT(
rte_trace_point_emit_u32(info->length);
rte_trace_point_emit_u32(info->width);
rte_trace_point_emit_u32(info->version);
+   rte_trace_point_emit_ptr(info->names);
+   rte_trace_point_emit_ptr(info->filter);
rte_trace_point_emit_int(ret);
 )
 
diff --git a/lib/ethdev/rte_dev_info.h b/lib/ethdev/rte_dev_info.h
index 67cf0ae52668..26b777f9836e 100644
--- a/lib/ethdev/rte_dev_info.h
+++ b/lib/ethdev/rte_dev_info.h
@@ -11,6 +11,11 @@ extern "C" {
 
 #include 
 
+#define RTE_ETH_REG_NAME_SIZE 64
+struct rte_eth_reg_name {
+   char name[RTE_ETH_REG_NAME_SIZE];
+};
+
 /*
  * Placeholder for accessing device registers
  */
@@ -20,6 +25,12 @@ struct rte_dev_reg_info {
uint32_t length; /**< Number of registers to fetch */
uint32_t width; /**< Size of device register */
uint32_t version; /**< Device version */
+   /**
+* Name of target module, filter for target subset of registers.
+* This field could affects register selection for data/length/names.
+*/
+   const char *filter;
+   struct rte_eth_reg_name *names; /**< Registers name saver */
 };
 
 /*
diff --git a/lib/ethdev/rte_ethdev.c b/lib/ethdev/rte_ethdev.c
index f1c658f49e80..30ca4a0043c5 100644
--- a/lib/ethdev/rte_ethdev.c
+++ b/lib/ethdev/rte_ethdev.c
@@ -6388,8 +6388,37 @@ rte_eth_read_clock(uint16_t port_id, uint64_t *clock)
 
 int
 rte_eth_dev_get_reg_info(uint16_t port_id, struct rte_dev_reg_info *info)
+{
+   struct rte_dev_reg_info reg_info = { 0 };
+   int ret;
+
+   if (info == NULL) {
+   RTE_ETHDEV_LOG_LINE(ERR,
+   "Cannot get ethdev port %u register info to NULL",
+   port_id);
+   return -EINVAL;
+   }
+
+   reg_info.length = info->length;
+   reg_info.data = info->data;
+
+   ret = rte_eth_dev_get_reg_info_ext(port_id, ®_info);
+   if (ret != 0)
+   return ret;
+
+   info->length = reg_info.length;
+   info->width = reg_info.width;
+   info->version = reg_info.version;
+   info->offset = reg_info.offset;
+
+   return 0;
+}
+
+int
+rte_eth_dev_get_reg_info_ext(uint16_t port_id, struct rte_dev_reg_info *info)
 {
struct rte_eth_dev *dev;
+   uint32_t i;
int ret;
 
RTE_ETH_VALID_PORTID_OR_ERR_RET(port_id, -ENODEV);
@@ -6402,12 +6431,21 @@ rte_eth_dev_get_reg_info(uint16_t port_id, struct 
rte_dev_reg_info *info)
return -EINVAL;
}
 
+   if (info->names != NULL && info->length != 0)
+   memset(info->names, 0, sizeof(struct rte_eth_reg_name) * 
info->length);
+
if (*dev->dev_ops->get_reg == NULL)
return -ENOTSUP;
ret

Re: [PATCH v3 1/2] dts: add symbolic link to dpdk-devbind script

2024-09-24 Thread Juraj Linkeš




On 24. 9. 2024 18:28, jspew...@iol.unh.edu wrote:

From: Jeremy Spewock 

The devbind script is used throughout DTS to manage drivers on the
remote hosts. Currently, the only way to copy this script onto a host
is to either copy the entire DPDK directory onto a host, or reach out
of the dts directory into its parent DPDK directory to access the
script. The first is undesirable if the host doesn't require any other
DPDK tools since you would be copying extra unneeded information and
the second is undesirable since it enforces the assumption that DTS is
being run from within the DPDK directory. To solve this issue a symbolic
link is added which links the devbind script from the parent into the
DTS directory.

Since this file is not part of DTS and therefore is not expected to
follow DTS formatting rules, it is excluded from the DTS formatting
script.

Signed-off-by: Jeremy Spewock 
---


Reviewed-by: Juraj Linkeš 



Re: [PATCH v2 1/1] dts: add binding to different drivers to TG node

2024-09-24 Thread Juraj Linkeš

I have some thoughts for the future:
1a. The traffic generator is specified per-node, so maybe we could also 
change the binding to be for the whole lifetime of the TG node,
1b. But the same is true for the SUT node as well, right? After we do 
the port update (with kernel driver), we can just bind to DPDK driver. 
With SUT in the mix, this looks like a change for a different patch,
2. We could add a symlink to the devbind script with the target being in 
the dts directory. This way we don't have to go outside the dts 
directory and if DTS ever become a python package, we could just copy 
the script to the appropriate place. This is also something we don't 
really need to do.


And also two minor comments. A lot of suggestions for separate patches 
overall. :-)


On 19. 9. 2024 20:16, jspew...@iol.unh.edu wrote:

From: Jeremy Spewock 

The DTS framework in its current state supports binding ports to
different drivers on the SUT node but not the TG node. The TG node
already has the information that it needs about the different drivers
that it has available in the configuration file, but it did not
previously have access to the devbind script, so it did not use that
information for anything.

This patch moves the location of the tmp directory as well as the method
for binding ports into the node class rather than the SUT node class and
adds an abstract method for getting the path to the devbind script into
the node class. Then, binding ports to the correct drivers is moved into
the build target setup and run on both nodes.

Bugzilla ID: 1420

Signed-off-by: Jeremy Spewock 
---


With the two minor comments,
Reviewed-by: Juraj Linkeš 


diff --git a/dts/framework/testbed_model/node.py 
b/dts/framework/testbed_model/node.py



@@ -58,8 +65,10 @@ class Node(ABC):
  lcores: list[LogicalCore]
  ports: list[Port]
  _logger: DTSLogger
+_remote_tmp_dir: PurePath
  _other_sessions: list[OSSession]
  _test_run_config: TestRunConfiguration
+_path_to_devbind_script: PurePath | None


A note on the naming. We have _remote_tmp_dir and 
_path_to_devbind_script. Both are pointing to a remote file/dir, but 
only one has the _remote prefix. They should probably be unified.


I've thought a bit about what the right name is. Dropping the prefix 
makes sense; sut_node.tmp_dir should mean the tmp dir on the SUT node 
(which would make it remote from the execution host's point of view, but 
not the node's view; the file is local to SUT node). This could be a 
good separate patch (improving the remote/local naming scheme to make it 
consistent and as sensible as possible).




diff --git a/dts/framework/utils.py b/dts/framework/utils.py



@@ -29,6 +29,8 @@
  from .exception import ConfigurationError, InternalError
  
  REGEX_FOR_PCI_ADDRESS: str = "/[0-9a-fA-F]{4}:[0-9a-fA-F]{2}:[0-9a-fA-F]{2}.[0-9]{1}/"

+#: Path to DPDK directory on the host where DTS is being run.
+LOCAL_DPDK_DIR: PurePath = PurePath(__file__).parents[2]


Local paths can be just pathlib.Path. PurePaths are for path 
manipulations only (useful for remote paths in RemoteSessions, 
OSSessions and Nodes), but for local existing paths we should use Path.


The OSSession and subclasses need a bit of an update in this regard - 
use Path for local paths and PurePaths for remote ones. We added this 
into our pre-built DPDK patch.


  
  
  def expand_range(range_str: str) -> list[int]:




[PATCH v4 00/11] dts: add test skipping based on capabilities

2024-09-24 Thread Juraj Linkeš
Add an automated way to gather available capabilities of the tested
hardware and skip test suites or cases which require capabilities that
are not available.

This is done through two decorators:
1. The first marks a test suite method as test case. This populates the
   default attributes of each test case.
2. The seconds adds the required capabilities to a test suite or case,
   using the attributes from 1).

Two types of capabilities are added:
1. NIC capabilities. These are gathered once DPDK has been built because
   we use testpmd for this. It's possible to add a function that will
   add configuration before assessing capabilities associated with the
   function. This is because some capabilities return different status
   with different configuration present.
2. The topology capability. Each test case is marked as requiring a
   default topology. The required topology of a test case (or a whole
   test suite) may be change with the second decorator.

This is how it all works:
1. The required capabilities are first all gathered from all test suites
   and test cases.
2. The list of required capabilities is divided into supported and
   unsupported capabilities. In this step, the probing of hardware
   and/or anything else that needs to happen to gauge whether a
   capability is supported is done.
3. Each test suite and test case is then marked to be skipped if any of
   their required capabilities are not supported.

v4:
Rebased on next-dts.
Adjusted the decorator for setting and reverting 9000 MTU to be
parametrized.
Added references to DPDK libraries and testpmd code to testpmd command
parsing methods.
Updated capability docstrings. Moved the capability decorator to
NicCapability.

Juraj Linkeš (11):
  dts: add the aenum dependency
  dts: add test case decorators
  dts: add mechanism to skip test cases or suites
  dts: add support for simpler topologies
  dts: add basic capability support
  dts: add NIC capability support
  dts: add NIC capabilities from show rxq info
  dts: add topology capability
  doc: add DTS capability doc sources
  dts: add Rx offload capabilities
  dts: add NIC capabilities from show port info

 .../framework.testbed_model.capability.rst|   6 +
 doc/api/dts/framework.testbed_model.rst   |   2 +
 .../dts/framework.testbed_model.topology.rst  |   6 +
 dts/framework/remote_session/testpmd_shell.py | 461 +++-
 dts/framework/runner.py   | 155 +++---
 dts/framework/test_result.py  | 119 ++--
 dts/framework/test_suite.py   | 161 +-
 dts/framework/testbed_model/capability.py | 507 ++
 dts/framework/testbed_model/node.py   |   2 +-
 dts/framework/testbed_model/port.py   |   4 +-
 dts/framework/testbed_model/topology.py   | 127 +
 dts/poetry.lock   |  14 +-
 dts/pyproject.toml|   1 +
 dts/tests/TestSuite_hello_world.py|  10 +-
 dts/tests/TestSuite_os_udp.py |   3 +-
 dts/tests/TestSuite_pmd_buffer_scatter.py |  14 +-
 dts/tests/TestSuite_smoke_tests.py|   8 +-
 17 files changed, 1447 insertions(+), 153 deletions(-)
 create mode 100644 doc/api/dts/framework.testbed_model.capability.rst
 create mode 100644 doc/api/dts/framework.testbed_model.topology.rst
 create mode 100644 dts/framework/testbed_model/capability.py
 create mode 100644 dts/framework/testbed_model/topology.py

-- 
2.43.0