Re: [net-next V2 08/15] net/mlx5e: Add TX PTP port object support

2020-12-06 Thread Eran Ben Elisha
On 12/4/2020 11:57 PM, Saeed Mahameed wrote: We only forward ptp traffic to the new special queue but we create more than one to avoid internal locking as we will utilize the tx softirq percpu. After double checking the code it seems Eran and Tariq have decided to forward all UDP traffic, let

Re: [net-next V2 08/15] net/mlx5e: Add TX PTP port object support

2020-12-06 Thread Eran Ben Elisha
On 12/5/2020 1:17 AM, Jakub Kicinski wrote: We only forward ptp traffic to the new special queue but we create more than one to avoid internal locking as we will utilize the tx softirq percpu. In other words to make the driver implementation simpler we'll have a pretty basic feature hidden be

Re: [net-next V2 08/15] net/mlx5e: Add TX PTP port object support

2020-12-06 Thread Eran Ben Elisha
On 12/5/2020 2:24 AM, Jakub Kicinski wrote: On Fri, 04 Dec 2020 15:57:36 -0800 Saeed Mahameed wrote: On Fri, 2020-12-04 at 15:17 -0800, Jakub Kicinski wrote: On Fri, 04 Dec 2020 13:57:49 -0800 Saeed Mahameed wrote: option 2) route PTP traffic to a special SQs per ring, this SQ will be PTP p

Re: [net-next V2 08/15] net/mlx5e: Add TX PTP port object support

2020-12-07 Thread Eran Ben Elisha
On 12/7/2020 10:37 AM, Saeed Mahameed wrote: On Sun, 2020-12-06 at 09:08 -0800, Richard Cochran wrote: On Sun, Dec 06, 2020 at 03:37:47PM +0200, Eran Ben Elisha wrote: Adding new enum to the ioctl means we have add (HWTSTAMP_TX_ON_TIME_CRITICAL_ONLY for example) all the way - drivers

[PATCH net-next 0/2] Dissect PTP L2 packet header

2021-01-07 Thread Eran Ben Elisha
Hi Jakub, Dave, This series adds support for dissecting PTP L2 packet header (EtherType 0x88F7). For packet header dissecting, skb->protocol is needed. Add protocol parsing operation to vlan ops, to guarantee skb->protocol is set, as EtherType 0x88F7 occasionally follows a vlan header. Er

[PATCH net-next 1/2] net: vlan: Add parse protocol header ops

2021-01-07 Thread Eran Ben Elisha
Add parse protocol header ops for vlan device. Before this patch, vlan tagged packet transmitted by af_packet had skb->protocol unset. Some kernel methods (like __skb_flow_dissect()) rely on this missing information for its packet processing. Signed-off-by: Eran Ben Elisha Reviewed-by: Ta

[PATCH net-next 2/2] net: flow_dissector: Parse PTP L2 packet header

2021-01-07 Thread Eran Ben Elisha
Add support for parsing PTP L2 packet header. Such packet consists of an L2 header (with ethertype of ETH_P_1588), PTP header, body and an optional suffix. Signed-off-by: Eran Ben Elisha Reviewed-by: Tariq Toukan --- net/core/flow_dissector.c | 16 1 file changed, 16

[PATCH net-next v2 1/2] net: vlan: Add parse protocol header ops

2021-01-11 Thread Eran Ben Elisha
Add parse protocol header ops for vlan device. Before this patch, vlan tagged packet transmitted by af_packet had skb->protocol unset. Some kernel methods (like __skb_flow_dissect()) rely on this missing information for its packet processing. Signed-off-by: Eran Ben Elisha Reviewed-by: Ta

[PATCH net-next v2 0/2] Dissect PTP L2 packet header

2021-01-11 Thread Eran Ben Elisha
header. Changelog: v2: - Add more people to CC list. Eran Ben Elisha (2): net: vlan: Add parse protocol header ops net: flow_dissector: Parse PTP L2 packet header net/8021q/vlan_dev.c | 9 + net/core/flow_dissector.c | 16 2 files changed, 25 insertions(+) -- 2.17.1

[PATCH net-next v2 2/2] net: flow_dissector: Parse PTP L2 packet header

2021-01-11 Thread Eran Ben Elisha
Add support for parsing PTP L2 packet header. Such packet consists of an L2 header (with ethertype of ETH_P_1588), PTP header, body and an optional suffix. Signed-off-by: Eran Ben Elisha Reviewed-by: Tariq Toukan --- net/core/flow_dissector.c | 16 1 file changed, 16

Re: [PATCH net-next v2 0/2] Dissect PTP L2 packet header

2021-01-11 Thread Eran Ben Elisha
On 1/11/2021 5:49 PM, Andrew Lunn wrote: On Mon, Jan 11, 2021 at 11:46:50AM +0200, Eran Ben Elisha wrote: Hi Jakub, Dave, This series adds support for dissecting PTP L2 packet header (EtherType 0x88F7). For packet header dissecting, skb->protocol is needed. Add protocol parsing operat

[PATCH net-next v3 0/2] Dissect PTP L2 packet header

2021-01-11 Thread Eran Ben Elisha
header. Changelog: v2, v3: - Add more people to the CC list. Eran Ben Elisha (2): net: vlan: Add parse protocol header ops net: flow_dissector: Parse PTP L2 packet header net/8021q/vlan_dev.c | 9 + net/core/flow_dissector.c | 16 2 files changed, 25 insertions(+) -- 2.17.1

[PATCH net-next v3 1/2] net: vlan: Add parse protocol header ops

2021-01-11 Thread Eran Ben Elisha
Add parse protocol header ops for vlan device. Before this patch, vlan tagged packet transmitted by af_packet had skb->protocol unset. Some kernel methods (like __skb_flow_dissect()) rely on this missing information for its packet processing. Signed-off-by: Eran Ben Elisha Reviewed-by: Ta

[PATCH net-next v3 2/2] net: flow_dissector: Parse PTP L2 packet header

2021-01-11 Thread Eran Ben Elisha
Add support for parsing PTP L2 packet header. Such packet consists of an L2 header (with ethertype of ETH_P_1588), PTP header, body and an optional suffix. Signed-off-by: Eran Ben Elisha Reviewed-by: Tariq Toukan --- net/core/flow_dissector.c | 16 1 file changed, 16

Re: [PATCH net-next v3 2/2] net: flow_dissector: Parse PTP L2 packet header

2021-01-12 Thread Eran Ben Elisha
On 1/12/2021 3:49 PM, Richard Cochran wrote: On Mon, Jan 11, 2021 at 08:17:48PM +0200, Eran Ben Elisha wrote: Add support for parsing PTP L2 packet header. Such packet consists of an L2 header (with ethertype of ETH_P_1588), PTP header, body and an optional suffix. Signed-off-by: Eran Ben

[PATCH net-next v4 2/2] net: flow_dissector: Parse PTP L2 packet header

2021-01-12 Thread Eran Ben Elisha
Add support for parsing PTP L2 packet header. Such packet consists of an L2 header (with ethertype of ETH_P_1588), PTP header, body and an optional suffix. Signed-off-by: Eran Ben Elisha Reviewed-by: Tariq Toukan --- net/core/flow_dissector.c | 16 1 file changed, 16

[PATCH net-next v4 0/2] Dissect PTP L2 packet header

2021-01-12 Thread Eran Ben Elisha
header. Changelog: v4: - Drop a redundant length check when fetching ptp header from skb. v2, v3: - Add more people to the CC list. Eran Ben Elisha (2): net: vlan: Add parse protocol header ops net: flow_dissector: Parse PTP L2 packet header net/8021q/vlan_dev.c | 9 + net/core/flow_disse

[PATCH net-next v4 1/2] net: vlan: Add parse protocol header ops

2021-01-12 Thread Eran Ben Elisha
Add parse protocol header ops for vlan device. Before this patch, vlan tagged packet transmitted by af_packet had skb->protocol unset. Some kernel methods (like __skb_flow_dissect()) rely on this missing information for its packet processing. Signed-off-by: Eran Ben Elisha Reviewed-by: Ta

Re: [PATCH mellanox-tree] net/mlx5: prevent an integer underflow in mlx5_perout_configure()

2021-03-01 Thread Eran Ben Elisha
On 2/19/2021 11:57 AM, Dan Carpenter wrote: The value of "sec" comes from the user. Negative values will lead to shift wrapping inside the perout_conf_real_time() function and triggger a UBSan warning. Add a check and return -EINVAL to prevent that from happening. Fixes: 432119de33d9 ("net/

Re: net/mlx5: Maintain separate page trees for ECPF and PF functions

2021-01-29 Thread Eran Ben Elisha
On 1/29/2021 2:18 PM, Colin Ian King wrote: Hi, Static analysis with Coverity has detected an issue with the following commit: commit 0aa128475d33d2d0095947eeab6b3e4d22dbd578 Author: Daniel Jurgens Date: Fri Jan 22 23:13:53 2021 +0200 net/mlx5: Maintain separate page trees for ECPF

Re: [RFC] devlink: health: add remediation type

2021-03-07 Thread Eran Ben Elisha
On 3/6/2021 4:42 AM, Jakub Kicinski wrote: Currently devlink health does not give user any clear information of what kind of remediation ->recover callback will perform. This makes it difficult to understand the impact of enabling auto- -remediation, and the severity of the error itself. To a

Re: [RFC] devlink: health: add remediation type

2021-03-09 Thread Eran Ben Elisha
On 3/8/2021 7:59 PM, Jakub Kicinski wrote: On Mon, 8 Mar 2021 09:16:00 -0800 Jakub Kicinski wrote: + DLH_REMEDY_BAD_PART, BAD_PART probably indicates that the reporter (or any command line execution) cannot recover the issue. As the suggested remedy is static per reporter's recover met

Re: [RFC] devlink: health: add remediation type

2021-03-09 Thread Eran Ben Elisha
On 3/8/2021 7:16 PM, Jakub Kicinski wrote: On Sun, 7 Mar 2021 17:59:58 +0200 Eran Ben Elisha wrote: On 3/6/2021 4:42 AM, Jakub Kicinski wrote: Currently devlink health does not give user any clear information of what kind of remediation ->recover callback will perform. This makes

Re: [RFC net-next v2 3/3] devlink: add more failure modes

2021-03-11 Thread Eran Ben Elisha
On 3/11/2021 5:26 AM, Jakub Kicinski wrote: Pending vendors adding the right reporters. << Would you like Nvidia to reply with the remedy per reporter or to actually prepare the patch? Extend the applicability of devlink health reporters beyond what can be locally remedied. Add failure

Re: [RFC net-next v2 2/3] devlink: health: add remediation type

2021-03-11 Thread Eran Ben Elisha
On 3/11/2021 5:26 AM, Jakub Kicinski wrote: Currently devlink health does not give user any clear information of what kind of remediation ->recover callback will perform. This makes it difficult to understand the impact of enabling auto- -remediation, and the severity of the error itself. To

Re: [RFC net-next v2 3/3] devlink: add more failure modes

2021-03-14 Thread Eran Ben Elisha
On 3/11/2021 6:49 PM, Jakub Kicinski wrote: On Thu, 11 Mar 2021 16:23:09 +0200 Eran Ben Elisha wrote: On 3/11/2021 5:26 AM, Jakub Kicinski wrote: Pending vendors adding the right reporters. << Would you like Nvidia to reply with the remedy per reporter or to actually prepare the

Re: [PATCH net-next,v5, 0/6] Add software backchannel and mlx5e HV VHCA stats

2019-08-22 Thread Eran Ben Elisha
On 8/23/2019 1:43 AM, Haiyang Zhang wrote: > > >> -Original Message- >> From: David Miller >> Sent: Thursday, August 22, 2019 3:39 PM >> To: Haiyang Zhang >> Cc: sas...@kernel.org; sae...@mellanox.com; l...@kernel.org; >> era...@mellanox.com; lorenzo.pieral...@arm.com; bhelg...@google

[PATCH net-next] net/mlx5: Fix return code in case of hyperv wrong size read

2019-08-23 Thread Eran Ben Elisha
t;net/mlx5: Add wrappers for HyperV PCIe operations") Reported-by: Leon Romanovsky Signed-off-by: Eran Ben Elisha --- drivers/net/ethernet/mellanox/mlx5/core/lib/hv.c | 6 +++--- 1 file changed, 3 insertions(+), 3 deletions(-) diff --git a/drivers/net/ethernet/mellanox/mlx5/core/lib/hv.c b/d

[PATCH net-next 1/2] devlink: Rename devlink health attributes

2019-02-21 Thread Eran Ben Elisha
From: Aya Levin Rename devlink health attributes for better reflect the attributes use. Add COUNT prefix on error counter attribute and recovery counter attribute. Fixes: 7afe335a8bed ("devlink: Add health get command") Signed-off-by: Aya Levin Signed-off-by: Eran Ben Elisha Acke

[PATCH net-next 0/2] Devlink health fixes series

2019-02-21 Thread Eran Ben Elisha
This series includes two small fixes from Aya for the devlink health infrastructure introduced earlier in this window. First patch rename some UAPI attributes to better reflect their use. Second patch reduces the amount of data passed from the devlink to the netlink layer upon get reporter command

[PATCH net-next 2/2] devlink: Modify reply of DEVLINK_CMD_HEALTH_REPORTER_GET

2019-02-21 Thread Eran Ben Elisha
gured on a reporter that did not provide a recover operation, thus not needed to return them. Fixes: 7afe335a8bed ("devlink: Add health get command") Signed-off-by: Aya Levin Signed-off-by: Eran Ben Elisha Acked-by: Jiri Pirko --- net/core/devlink.c | 6 -- 1 file changed

[PATCH net-next 3/3] net/mlx5e: Update tx reporter status in case channels were successfully opened

2019-03-03 Thread Eran Ben Elisha
are opened After that flow, the original error was "solved", and tx reporter state should be healthy. However, as it was resolved as a side effect, and not via tx reporter recover method, driver needs to inform devlink health about it. Signed-off-by: Eran Ben Elisha Acked-by: Jiri Pirko

[PATCH net-next 2/3] devlink: Add support for direct reporter health state update

2019-03-03 Thread Eran Ben Elisha
devlink_health_reporter_state enum definition to devlink.h so it could be used from drivers as a parameter of devlink_health_reporter_state_update. In addition, add trace_devlink_health_reporter_state_update to provide user notification for reporter state change. Signed-off-by: Eran Ben Elisha Acked-by: Jiri Pirko

[PATCH net-next 0/3] Devlink health updates

2019-03-03 Thread Eran Ben Elisha
This patchset includes a fix [patch 01] to the devlink health state update, in case recover was aborted. In addition, it includes a small enhancement to the infrastructure in order to allow direct state update in run-time, and use it from mlx5e tx reporter. Eran Ben Elisha (3): devlink: Update

[PATCH net-next 1/3] devlink: Update reporter state to error even if recover aborted

2019-03-03 Thread Eran Ben Elisha
. In addition, save the previous health_state in a temporary variable, then use it in the abort check comparison instead of using reporter->health_state which might be already changed. Fixes: c8e1da0bf923 ("devlink: Add health report functionality") Signed-off-by: Eran Ben Elisha A

Re: [Patch net-next] net: add a generic tracepoint for TX queue timeout

2019-05-01 Thread Eran Ben Elisha
ftirqd/1-16[001] ..s2 144.043173: net_dev_xmit_timeout: dev=ens3 > driver=e1000 queue=0 > > Cc: Eran Ben Elisha > Cc: Jiri Pirko > Signed-off-by: Cong Wang > --- > include/trace/events/net.h | 23 +++ > net/sched/sch_generic.c| 2 ++ > 2 files

Re: [Patch net-next v2] net: add a generic tracepoint for TX queue timeout

2019-05-02 Thread Eran Ben Elisha
3: net_dev_xmit_timeout: dev=ens3 > driver=e1000 queue=0 > > Cc: Eran Ben Elisha > Cc: Jiri Pirko > Signed-off-by: Cong Wang Reviewed-by: Eran Ben Elisha

Re: [PATCH net-next] net: ipv6/gre: Add GRO support

2018-04-02 Thread Eran Ben Elisha
On Sun, Apr 1, 2018 at 7:35 PM, Eric Dumazet wrote: > > > On 04/01/2018 06:17 AM, Tariq Toukan wrote: >> From: Eran Ben Elisha >> >> Add GRO capability for IPv6 GRE tunnel and ip6erspan tap, via gro_cells >> infrastructure. >> >> Performance testing

Re: [PATCH net-next] net: ipv6/gre: Add GRO support

2018-04-02 Thread Eran Ben Elisha
>>> Seems good, but why isn't this handled directly in GRO native layer ? >> ip6_tunnel and ip6_gre do not share initialization flow functions (unlike >> ipv4). >> Changing the ipv6 init infrastructure should not be part of this >> patch. we prefer to keep this one minimal, simple and safe. > > >

[RFC PATCH iproute2-next] man: Add devlink health man page

2018-09-13 Thread Eran Ben Elisha
Add devlink-health man page. Devlink-health tool will control device health attributes, sensors, actions and logging. Signed-off-by: Eran Ben Elisha --- Copy paste man output to here for easier review process of the RFC. DEVLINK-HEALTH(8

[RFC PATCH iproute2-next] System specification health API

2018-09-13 Thread Eran Ben Elisha
and actions. Eran Ben Elisha (1): man: Add devlink health man page man/man8/devlink-health.8 | 171 ++ 1 file changed, 171 insertions(+) create mode 100644 man/man8/devlink-health.8 -- 1.8.3.1

Re: [RFC PATCH iproute2-next] man: Add devlink health man page

2018-09-13 Thread Eran Ben Elisha
On 9/13/2018 1:27 PM, Tobin C. Harding wrote: On Thu, Sep 13, 2018 at 11:18:16AM +0300, Eran Ben Elisha wrote: Add devlink-health man page. Devlink-health tool will control device health attributes, sensors, actions and logging. Signed-off-by: Eran Ben Elisha

Re: [RFC PATCH iproute2-next] man: Add devlink health man page

2018-09-13 Thread Eran Ben Elisha
On 9/13/2018 3:08 PM, Andrew Lunn wrote: devlink health sensor set pci/:01:00.0 name TX_COMP_ERROR action reset off action dump on Sets TX_COMP_ERROR sensor parameters for a specific device. I hope the real sensors have more understandable names. If i remember correc

Re: [RFC PATCH iproute2-next] man: Add devlink health man page

2018-09-13 Thread Eran Ben Elisha
On 9/13/2018 4:24 PM, Andrew Lunn wrote: On Thu, Sep 13, 2018 at 03:49:37PM +0300, Eran Ben Elisha wrote: On 9/13/2018 3:08 PM, Andrew Lunn wrote: devlink health sensor set pci/:01:00.0 name TX_COMP_ERROR action reset off action dump on Sets TX_COMP_ERROR sensor

Re: [RFC PATCH iproute2-next] man: Add devlink health man page

2018-09-16 Thread Eran Ben Elisha
On 9/13/2018 6:12 PM, Andrew Lunn wrote: devlink health sensor set pci/:01:00.0 name TX_COMP_ERROR action reset off action dump on Sets TX_COMP_ERROR sensor parameters for a specific device. This is what I had in mind: 1. command interface error 2. command interface

Re: [RFC PATCH iproute2-next] System specification health API

2018-09-16 Thread Eran Ben Elisha
On 9/13/2018 8:36 PM, Jakub Kicinski wrote: On Thu, 13 Sep 2018 11:18:15 +0300, Eran Ben Elisha wrote: The health spec is targeted for Real Time Alerting, in order to know when something bad had happened to a PCI device By spec you mean some standards body spec you implement or this

Re: linux-next: manual merge of the net-next tree with the net tree

2018-01-14 Thread Eran Ben Elisha
On Mon, Jan 15, 2018 at 1:52 AM, Stephen Rothwell wrote: > Hi all, > > Today's linux-next merge of the net-next tree got a conflict in: > > include/linux/mlx5/mlx5_ifc.h > > between commit: > > 8978cc921fc7 ("{net,ib}/mlx5: Don't disable local loopback multicast > traffic when needed") > > fr

[PATCH net-next 09/27] net/mlx5e: Add TX reporter support

2019-01-17 Thread Eran Ben Elisha
reporter_tx.c. Signed-off-by: Eran Ben Elisha Reviewed-by: Saeed Mahameed --- .../net/ethernet/mellanox/mlx5/core/Makefile | 2 +- drivers/net/ethernet/mellanox/mlx5/core/en.h | 18 +- .../ethernet/mellanox/mlx5/core/en/reporter.h | 14 + .../mellanox/mlx5/core/en/reporter_tx.c | 321

[PATCH net-next 19/27] net/mlx5: Refactor print health info

2019-01-17 Thread Eran Ben Elisha
From: Moshe Shemesh Refactor print health info code, split to two functions: 1. mlx5_get_health_info() - writes the health info into a buffer. 2. mlx5_print_health_info() - prints the health info to kernel log. This refactoring is done to enable using the health info data by devlink health repo

[PATCH net-next 26/27] net/mlx5: Report devlink health on FW fatal issues

2019-01-17 Thread Eran Ben Elisha
run it manually. Signed-off-by: Moshe Shemesh Signed-off-by: Eran Ben Elisha Reviewed-by: Saeed Mahameed --- .../net/ethernet/mellanox/mlx5/core/devlink.c | 19 +++ .../net/ethernet/mellanox/mlx5/core/devlink.h | 1 + .../net/ethernet/mellanox/mlx5/core/health.c | 23

[PATCH net-next 20/27] net/mlx5: Create FW devlink_health_reporter

2019-01-17 Thread Eran Ben Elisha
From: Moshe Shemesh Create mlx5_devlink_health_reporter for FW reporter. The FW reporter implements devlink_health_reporter diagnose callback. Signed-off-by: Moshe Shemesh Reviewed-by: Saeed Mahameed --- .../net/ethernet/mellanox/mlx5/core/devlink.c | 114 ++ .../net/ethernet/

[PATCH net-next 27/27] devlink: Add Documentation/networking/devlink-health.txt

2019-01-17 Thread Eran Ben Elisha
From: Aya Levin This patch adds a new file to add information about devlink health mechanism. Signed-off-by: Aya Levin Signed-off-by: Eran Ben Elisha --- Documentation/networking/devlink-health.txt | 86 + 1 file changed, 86 insertions(+) create mode 100644 Documentation

[PATCH net-next 18/27] net/mlx5: Issue SW reset on FW assert

2019-01-17 Thread Eran Ben Elisha
From: Feras Daoud If a FW assert is considered fatal, indicated by a new bit in the health buffer, reset the FW. After the reset go through the normal recovery flow. Only one PF needs to issue the reset, so an attempt is made to prevent the 2nd function from also issuing the reset. It's not an er

[PATCH net-next 21/27] net/mlx5: Add core dump register access functions

2019-01-17 Thread Eran Ben Elisha
From: Moshe Shemesh Add access functions to core dump register to enable trigger FW core dump. Signed-off-by: Moshe Shemesh Reviewed-by: Saeed Mahameed --- .../mellanox/mlx5/core/diag/fw_tracer.c | 32 +++ include/linux/mlx5/driver.h | 1 + include/lin

[PATCH net-next 13/27] net/mlx5: Add Crdump FW snapshot support

2019-01-17 Thread Eran Ben Elisha
From: Alex Vesker Crdump allows the driver to create a snapshot of the FW PCI crspace. This is useful in case of catastrophic issues which require FW reset. The snapshot can be used for later debug. The snapshot is exposed using devlink, cr-space address regions are registered on init and snapsh

[PATCH net-next 03/27] devlink: Add health report functionality

2019-01-17 Thread Eran Ben Elisha
already stored) * Auto recovery attempt is being done. depends on: - Auto Recovery configuration - Grace period vs. time since last recover Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/net/devlink.h | 9 include/trace/events/devlink.h | 62

[PATCH net-next 16/27] net/mlx5: Handle SW reset of FW in error flow

2019-01-17 Thread Eran Ben Elisha
From: Feras Daoud New mlx5 adapters allow the driver to reset the FW in the event of an error, this action called "SW Reset". When an SW reset is issued on any PF all PFs enter reset state which is a recoverable condition. The existing recovery flow was designed to allow the recovery of a VF afte

[PATCH net-next 02/27] devlink: Add health reporter create/destroy functionality

2019-01-17 Thread Eran Ben Elisha
eporter by devlink_health_reporter_priv function. For each reporter, devlink will hold a metadata of statistics, buffers and status. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/net/devlink.h | 59 net/core/devlink.

[PATCH net-next 22/27] net/mlx5: Add support for FW reporter dump

2019-01-17 Thread Eran Ben Elisha
From: Moshe Shemesh Add support of dump callback for mlx5 FW reporter. Once we trigger FW dump, the FW will write the core dump to its raw data buffer. The tracer translates the raw data to traces and save it to a buffer. Once dump is done, the saved traces data is filled as objects into the dump

[PATCH net-next 01/27] devlink: Add health buffer support

2019-01-17 Thread Eran Ben Elisha
-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/net/devlink.h| 76 ++ include/uapi/linux/devlink.h | 8 + net/core/devlink.c | 501 +++ 3 files changed, 585 insertions(+) diff --git a/include/net/devlink.h b/include/net

[PATCH net-next 10/27] net/mlx5e: Add TX timeout support for mlx5e TX reporter

2019-01-17 Thread Eran Ben Elisha
recover pci/:00:09 reporter TX $devlink health show pci/:00:09.0: name TX state healthy #err 1 #recover 1 last_dump_ts N/A dump_available false attributes: grace_period 500 auto_recover false Signed-off-by: Eran Ben Elisha Reviewed-by: Saeed Mahameed --- .../ethernet/mellanox

[PATCH net-next 25/27] net/mlx5: Add support for FW fatal reporter dump

2019-01-17 Thread Eran Ben Elisha
. Signed-off-by: Moshe Shemesh Signed-off-by: Eran Ben Elisha --- .../net/ethernet/mellanox/mlx5/core/devlink.c | 91 +++ 1 file changed, 91 insertions(+) diff --git a/drivers/net/ethernet/mellanox/mlx5/core/devlink.c b/drivers/net/ethernet/mellanox/mlx5/core/devlink.c index

[PATCH net-next 23/27] net/mlx5: Report devlink health on FW issues

2019-01-17 Thread Eran Ben Elisha
From: Moshe Shemesh Use devlink_health_report() to report any symptom of FW issue as FW counter miss or new health syndrom. Signed-off-by: Moshe Shemesh Reviewed-by: Saeed Mahameed --- .../net/ethernet/mellanox/mlx5/core/devlink.c | 21 +++ .../net/ethernet/mellanox/mlx5/core/

[PATCH net-next 12/27] net/mlx5: Add Vendor Specific Capability access gateway

2019-01-17 Thread Eran Ben Elisha
From: Alex Vesker The Vendor Specific Capability (VSC) is used to activate a gateway interfacing with the device. The gateway is used to read or write device configurations, which are organized in different domains (spaces). A configuration access may result in multiple actions, reads, writes. E

[PATCH net-next 00/27] Devlink health reporting and recovery system

2019-01-17 Thread Eran Ben Elisha
dded. Alex Vesker (2): net/mlx5: Add Vendor Specific Capability access gateway net/mlx5: Add Crdump FW snapshot support Aya Levin (1): devlink: Add Documentation/networking/devlink-health.txt Eran Ben Elisha (11): devlink: Add health buffer support devlink: Add health reporter create/dest

[PATCH net-next 15/27] Documentation: mlx5: Update kernel documentation

2019-01-17 Thread Eran Ben Elisha
From: Feras Daoud Initial kernel documentation for mlx5 core driver. This change includes also documentation for the new devlink region param : DEVLINK_PARAM_GENERIC_ID_REGION_SNAPSHOT. Signed-off-by: Feras Daoud Signed-off-by: Saeed Mahameed --- Documentation/networking/mlx5.rst | 39 +++

[PATCH net-next 04/27] devlink: Add health get command

2019-01-17 Thread Eran Ben Elisha
Add devlink health get command to provide reporter/s data for user space. Add the ability to get data per reporter or dump data from all available reporters. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 12 +++ net/core/devlink.c | 152

[PATCH net-next 17/27] net/mlx5: Control CR-space access by different PFs

2019-01-17 Thread Eran Ben Elisha
From: Feras Daoud Since the FW can be shared between different PFs/VFs it is common that more than one health poll will detected a failure, this can lead to multiple resets which are unneeded. The solution is to use a FW locking mechanism using semaphore space to provide a way to allow only one

[PATCH net-next 24/27] net/mlx5: Add FW fatal devlink_health_reporter

2019-01-17 Thread Eran Ben Elisha
From: Moshe Shemesh Create mlx5_devlink_health_reporter for FW fatal reporter. The FW fatal reporter is added in addition to the fw reporter and implements only the recover callback. The point of having two reporters for FW issues, is that we don't want to run FW recover on any issue, but only fa

[PATCH net-next 07/27] devlink: Add health diagnose command

2019-01-17 Thread Eran Ben Elisha
user. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 1 + net/core/devlink.c | 51 2 files changed, 52 insertions(+) diff --git a/include/uapi/linux/devlink.h b/include/uapi/linux/devlink.h index 1c186fd

[PATCH net-next 05/27] devlink: Add health set command

2019-01-17 Thread Eran Ben Elisha
the reporter Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 1 + net/core/devlink.c | 36 2 files changed, 37 insertions(+) diff --git a/include/uapi/linux/devlink.h b/include/uapi/linux/devlink.h index

[PATCH net-next 06/27] devlink: Add health recover command

2019-01-17 Thread Eran Ben Elisha
Add devlink health recover command to the uapi, in order to allow the user to execute a recover operation over a specific reporter. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 1 + net/core/devlink.c | 20 2 files

[PATCH net-next 14/27] net/mlx5: Use devlink region_snapshot parameter

2019-01-17 Thread Eran Ben Elisha
From: Moshe Shemesh This parameter enables capturing region snapshot of the crspace during critical errors. The default value of this parameter is disabled, it can be enabled using devlink param commands. It is possible to configure during runtime and also driver init. Signed-off-by: Alex Vesker

[PATCH net-next 08/27] devlink: Add health dump {get,clear} commands

2019-01-17 Thread Eran Ben Elisha
fill it via the buffer descriptors API. Devlink will parse it and convert it to netlink nla API in order to pass it to the user. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 2 + net/core/devlink.c

[PATCH net-next 11/27] net/mlx5: Move all devlink related functions calls to devlink.c

2019-01-17 Thread Eran Ben Elisha
Centralize all devlink related callbacks in one file. In the downstream patch, some more functionality will be added, this patch is preparing the driver infrastructure for it. Currently, move devlink un/register functions calls into this file. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe

Re: [PATCH net-next 00/27] Devlink health reporting and recovery system

2019-01-17 Thread Eran Ben Elisha
On 1/17/2019 8:52 PM, Jiri Pirko wrote: > Thu, Jan 17, 2019 at 04:04:54PM CET, era...@mellanox.com wrote: > > > 27 patches is way too many to review in one go. > Please split to at least 2 patchsets. Thanks! > Ack, will post v2 soon.

[PATCH net-next v2 10/11] net/mlx5e: Add TX timeout support for mlx5e TX reporter

2019-01-17 Thread Eran Ben Elisha
recover pci/:00:09 reporter TX $devlink health show pci/:00:09.0: name TX state healthy #err 1 #recover 1 last_dump_ts N/A dump_available false attributes: grace_period 500 auto_recover false Signed-off-by: Eran Ben Elisha Reviewed-by: Saeed Mahameed --- .../ethernet/mellanox

[PATCH net-next v2 08/11] devlink: Add health dump {get,clear} commands

2019-01-17 Thread Eran Ben Elisha
fill it via the buffer descriptors API. Devlink will parse it and convert it to netlink nla API in order to pass it to the user. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 2 + net/core/devlink.c

[PATCH net-next v2 03/11] devlink: Add health report functionality

2019-01-17 Thread Eran Ben Elisha
already stored) * Auto recovery attempt is being done. depends on: - Auto Recovery configuration - Grace period vs. time since last recover Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/net/devlink.h | 9 include/trace/events/devlink.h | 62

[PATCH net-next v2 04/11] devlink: Add health get command

2019-01-17 Thread Eran Ben Elisha
Add devlink health get command to provide reporter/s data for user space. Add the ability to get data per reporter or dump data from all available reporters. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 12 +++ net/core/devlink.c | 152

[PATCH net-next v2 00/11] Devlink health reporting and recovery system

2019-01-17 Thread Eran Ben Elisha
FW* reporters to decrease the amount of patches in the patchset Aya Levin (1): devlink: Add Documentation/networking/devlink-health.txt Eran Ben Elisha (10): devlink: Add health buffer support devlink: Add health reporter create/destroy functionality devlink: Add health report functionality de

[PATCH net-next v2 11/11] devlink: Add Documentation/networking/devlink-health.txt

2019-01-17 Thread Eran Ben Elisha
From: Aya Levin This patch adds a new file to add information about devlink health mechanism. Signed-off-by: Aya Levin Signed-off-by: Eran Ben Elisha --- Documentation/networking/devlink-health.txt | 86 + 1 file changed, 86 insertions(+) create mode 100644 Documentation

[PATCH net-next v2 01/11] devlink: Add health buffer support

2019-01-17 Thread Eran Ben Elisha
-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/net/devlink.h| 76 ++ include/uapi/linux/devlink.h | 8 + net/core/devlink.c | 501 +++ 3 files changed, 585 insertions(+) diff --git a/include/net/devlink.h b/include/net

[PATCH net-next v2 02/11] devlink: Add health reporter create/destroy functionality

2019-01-17 Thread Eran Ben Elisha
eporter by devlink_health_reporter_priv function. For each reporter, devlink will hold a metadata of statistics, buffers and status. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/net/devlink.h | 59 net/core/devlink.

[PATCH net-next v2 06/11] devlink: Add health recover command

2019-01-17 Thread Eran Ben Elisha
Add devlink health recover command to the uapi, in order to allow the user to execute a recover operation over a specific reporter. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 1 + net/core/devlink.c | 20 2 files

[PATCH net-next v2 07/11] devlink: Add health diagnose command

2019-01-17 Thread Eran Ben Elisha
user. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 1 + net/core/devlink.c | 51 2 files changed, 52 insertions(+) diff --git a/include/uapi/linux/devlink.h b/include/uapi/linux/devlink.h index 1c186fd

[PATCH net-next v2 05/11] devlink: Add health set command

2019-01-17 Thread Eran Ben Elisha
the reporter Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/uapi/linux/devlink.h | 1 + net/core/devlink.c | 36 2 files changed, 37 insertions(+) diff --git a/include/uapi/linux/devlink.h b/include/uapi/linux/devlink.h index

[PATCH net-next v2 09/11] net/mlx5e: Add TX reporter support

2019-01-17 Thread Eran Ben Elisha
reporter_tx.c. Signed-off-by: Eran Ben Elisha Reviewed-by: Saeed Mahameed --- .../net/ethernet/mellanox/mlx5/core/Makefile | 2 +- drivers/net/ethernet/mellanox/mlx5/core/en.h | 18 +- .../ethernet/mellanox/mlx5/core/en/reporter.h | 14 + .../mellanox/mlx5/core/en/reporter_tx.c | 321

Re: [PATCH net-next v2 01/11] devlink: Add health buffer support

2019-01-20 Thread Eran Ben Elisha
On 1/20/2019 12:03 PM, Jiri Pirko wrote: > Thu, Jan 17, 2019 at 10:59:10PM CET, era...@mellanox.com wrote: > > [...] > >> +static void >> +devlink_health_buffers_destroy(struct devlink_health_buffer **buffers_list, >> + u64 size); > > Avoid fwd declarations. > > >>

Re: [PATCH net-next v2 01/11] devlink: Add health buffer support

2019-01-21 Thread Eran Ben Elisha
On 1/20/2019 8:45 PM, David Miller wrote: > From: Jiri Pirko > Date: Sun, 20 Jan 2019 12:08:50 +0100 > >> I haven't have time to review this due to travel. I think it was mistake >> to merge this as the buffer api is wrong in my opinion. I would vote for >> revert if possible. > > Let's spend

Re: [PATCH net-next v2 03/11] devlink: Add health report functionality

2019-01-21 Thread Eran Ben Elisha
On 1/20/2019 1:27 PM, Jiri Pirko wrote: > Thu, Jan 17, 2019 at 10:59:12PM CET, era...@mellanox.com wrote: > > [...] > >> + >> +TRACE_EVENT(devlink_health_recover_aborted, >> +TP_PROTO(const struct devlink *devlink, const char *reporter_name, >> + bool health_state, u64 time_sinc

Re: [PATCH net-next] devlink: Fix error return code in devlink_health_buffer_prepare_skb()

2019-01-21 Thread Eran Ben Elisha
On 1/21/2019 10:21 AM, Wei Yongjun wrote: > Fix to return a negative error code -EMSGSIZE from the error handling > case, otherwise 0 or uninitialized value may be returned. > > Fixes: cb5ccfbe73b3 ("devlink: Add health buffer support") > Signed-off-by: Wei Yongjun Thanks for the fix! However,

Re: [PATCH net-next v2 09/11] net/mlx5e: Add TX reporter support

2019-01-21 Thread Eran Ben Elisha
>> >> The SQ recover from error CQE flow is not a new feature in the driver, >> this patch re-organize the functions and adapt them for the devlink >> health API. For this purpose, move code from en_main.c to a new file >> named reporter_tx.c. >> >> Signed-o

Re: [PATCH net-next v2 09/11] net/mlx5e: Add TX reporter support

2019-01-21 Thread Eran Ben Elisha
On 1/21/2019 2:11 PM, Jiri Pirko wrote: > Mon, Jan 21, 2019 at 12:32:07PM CET, era...@mellanox.com wrote: >> >> >> On 1/20/2019 1:06 PM, Jiri Pirko wrote: >>> Thu, Jan 17, 2019 at 10:59:18PM CET, era...@mellanox.com wrote: > > [...] > > +static int +mlx5e_tx_reporter_build_diag

[PATCH net-next 4/7] devlink: Delete depracated health buffers API

2019-01-22 Thread Eran Ben Elisha
It is not in use anymore and can be safely removed from the kernel code. Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- include/net/devlink.h| 76 -- include/uapi/linux/devlink.h | 8 - net/core/devlink.c | 501 --- 3

[PATCH net-next 0/7] Devlink health updates

2019-01-22 Thread Eran Ben Elisha
evlink health and mlx5e TX reporter. Also, added some helpers together with the new API, which reduce the code required by the driver to fill dump and diagnose significantly. Eventually, it also deletes the old API. In addition, it includes some small fixes in the devlink and mlx5e TX reporter. Er

[PATCH net-next 5/7] devlink: Remove spaces around "=" in the logger print

2019-01-22 Thread Eran Ben Elisha
No need for spaces around "=" in the logger print. Fixes: c7af343b4e33 ("devlink: Add health report functionality") Signed-off-by: Eran Ben Elisha Reported-by: Jiri Pirko Reviewed-by: Moshe Shemesh --- include/trace/events/devlink.h | 2 +- 1 file changed, 1 insertion(+),

[PATCH net-next 6/7] devlink: Fix use-after-free at reporter destroy

2019-01-22 Thread Eran Ben Elisha
Fix a bug where reporter->devlink attribute was used after the reporter was freed. Fixes: 880ee82f0313 ("devlink: Add health reporter create/destroy functionality") Signed-off-by: Eran Ben Elisha Reviewed-by: Moshe Shemesh --- net/core/devlink.c | 6 -- 1 file changed, 4 ins

[PATCH net-next 7/7] net/mlx5e: Add RTNL lock to TX recover flow

2019-01-22 Thread Eran Ben Elisha
As part of the recover flow, driver calls mlx5e_open_locked, which eventually calls netif_set_real_num_tx_queues. RTNL lock must be held as part of this flow in advance, like in other flows in the driver. Fixes: aba25279c100 ("net/mlx5e: Add TX reporter support") Signed-off-by: Eran

[PATCH net-next 1/7] devlink: Add devlink msg API

2019-01-22 Thread Eran Ben Elisha
-by: Eran Ben Elisha CC: Wei Yongjun Reviewed-by: Moshe Shemesh --- include/net/devlink.h| 70 ++ include/uapi/linux/devlink.h | 8 + net/core/devlink.c | 455 +++ 3 files changed, 533 insertions(+) diff --git a/include/net/devlink.h b

  1   2   >