[dpdk-dev] [PATCH] maintainers: claim responsibility for Vhost-user, Virtio PMD & Vhost PMD

2017-02-17 Thread Maxime Coquelin
Add myself as co-maintainer for these drivers and library.

Suggested-by: Yuanhan Liu 
Signed-off-by: Maxime Coquelin 
---
 MAINTAINERS | 3 +++
 1 file changed, 3 insertions(+)

diff --git a/MAINTAINERS b/MAINTAINERS
index cc3bf98..8305237 100644
--- a/MAINTAINERS
+++ b/MAINTAINERS
@@ -381,6 +381,7 @@ F: doc/guides/nics/vmxnet3.rst
 
 Vhost-user
 M: Yuanhan Liu 
+M: Maxime Coquelin 
 T: git://dpdk.org/next/dpdk-next-virtio
 F: lib/librte_vhost/
 F: doc/guides/prog_guide/vhost_lib.rst
@@ -390,11 +391,13 @@ F: doc/guides/sample_app_ug/vhost.rst
 Vhost PMD
 M: Tetsuya Mukawa 
 M: Yuanhan Liu 
+M: Maxime Coquelin 
 T: git://dpdk.org/next/dpdk-next-virtio
 F: drivers/net/vhost/
 
 Virtio PMD
 M: Yuanhan Liu 
+M: Maxime Coquelin 
 T: git://dpdk.org/next/dpdk-next-virtio
 F: drivers/net/virtio/
 F: doc/guides/nics/virtio.rst
-- 
2.9.3



[dpdk-dev] [PATCH v3] net/i40e: fix allocating hash table on socket 0

2017-02-17 Thread Beilei Xing
Testpmd failed to start in another hugetlbfs mount point on
i40e, the root cause is that hash table is always allocated
on socket 0. Fix the issue by assigning scocket id during
hash parameter defination.

Fixes: 5c53c82c8174 ("net/i40e: store flow director filter")
Fixes: 425c3325f0b0 ("net/i40e: store tunnel filter")
Fixes: 078259773da9 ("net/i40e: store ethertype filter")

Signed-off-by: Beilei Xing 
---
v3 changes:
 Update commit log.
v2 changes:
 Update commit log.

 drivers/net/i40e/i40e_ethdev.c | 6 ++
 1 file changed, 6 insertions(+)

diff --git a/drivers/net/i40e/i40e_ethdev.c b/drivers/net/i40e/i40e_ethdev.c
index 303027b..be2e580 100644
--- a/drivers/net/i40e/i40e_ethdev.c
+++ b/drivers/net/i40e/i40e_ethdev.c
@@ -899,6 +899,8 @@ i40e_init_ethtype_filter_list(struct rte_eth_dev *dev)
.entries = I40E_MAX_ETHERTYPE_FILTER_NUM,
.key_len = sizeof(struct i40e_ethertype_filter_input),
.hash_func = rte_hash_crc,
+   .hash_func_init_val = 0,
+   .socket_id = rte_socket_id(),
};
 
/* Initialize ethertype filter rule list and hash */
@@ -942,6 +944,8 @@ i40e_init_tunnel_filter_list(struct rte_eth_dev *dev)
.entries = I40E_MAX_TUNNEL_FILTER_NUM,
.key_len = sizeof(struct i40e_tunnel_filter_input),
.hash_func = rte_hash_crc,
+   .hash_func_init_val = 0,
+   .socket_id = rte_socket_id(),
};
 
/* Initialize tunnel filter rule list and hash */
@@ -985,6 +989,8 @@ i40e_init_fdir_filter_list(struct rte_eth_dev *dev)
.entries = I40E_MAX_FDIR_FILTER_NUM,
.key_len = sizeof(struct rte_eth_fdir_input),
.hash_func = rte_hash_crc,
+   .hash_func_init_val = 0,
+   .socket_id = rte_socket_id(),
};
 
/* Initialize flow director filter rule list and hash */
-- 
2.5.5



Re: [dpdk-dev] [PATCH] maintainers: claim responsibility for Vhost-user, Virtio PMD & Vhost PMD

2017-02-17 Thread Yuanhan Liu
On Fri, Feb 17, 2017 at 09:31:42AM +0100, Maxime Coquelin wrote:
> Add myself as co-maintainer for these drivers and library.
> 
> Suggested-by: Yuanhan Liu 
> Signed-off-by: Maxime Coquelin 

I think it's obvious that Maxime is done great job at reviewing
vhost/virtio patches. So,

Acked-by: Yuanhan Liu 

--yliu


Re: [dpdk-dev] [PATCH v1] doc: add template release notes for 17.05

2017-02-17 Thread Thomas Monjalon
2017-02-16 15:01, Mcnamara, John:
> 
> > -Original Message-
> > From: Thomas Monjalon [mailto:thomas.monja...@6wind.com]
> > Sent: Wednesday, February 15, 2017 2:55 PM
> > To: Mcnamara, John 
> > Cc: dev@dpdk.org
> > Subject: Re: [dpdk-dev] [PATCH v1] doc: add template release notes for
> > 17.05
> > 
> > 2017-02-15 12:38, John McNamara:
> > > +   Build the docs and view the output file to ensure the changes are
> > correct::
> > > +
> > > +  make doc-guides-html
> > > +
> > > +  firefox build/doc/html/guides/rel_notes/release_17_05.html
> > 
> > I would suggest xdg-open instead of firefox.
> > It is more open regarding chromium ;)
> 
> I generally use xdg-open, which in my case gives me midori but for the
> docs it is probably better to have something that is easily recognizable
> as a browser. Either way I'm okay if you wish to make the change inline
> during merge. Or would you prefer a V2.

Midori? Seems another interesting choice:)

Applied with this change, thanks


[dpdk-dev] DPDK Technical Board Meeting, 2017-02-15

2017-02-17 Thread Richardson, Bruce
A meeting of the DPDK technical board was held last Wednesday, 2017-02-15. 
Below are the meeting minutes.  

Please note that future meetings will be open to all to attend, as described 
below. The next meeting is planned for March 1st, and any topics to be referred 
to the tech board for discussion at that meeting should be emailed to 
techbo...@dpdk.org by February 26th at the latest.

Regards,
/Bruce

--
Attendees:
Hemant Agrawal, Konstantin Ananyev, Jan Blunck, Stephen Hemminger, Jerin Jacob, 
Yuanhan Liu, Olivier Matz, Thomas Monjalon, Bruce Richardson

Agenda and Notes:
0. General Meeting Admin
 * Bruce Richardson volunteered to chair this meeting (Meeting chair was 
previously agreed to be a rotating role.)
 * Olivier Matz volunteered to chair the next meeting
 * At each future meetings, the chair for the next meeting will be decided and 
that person also has the responsibility of preparing the meeting agenda ahead 
of time.
 * Next meeting was agreed to be at the same time in two weeks - 1st March @ 
3pm UTC (4pm France, 11pm PRC, 7am US Pacific). 
   - Olivier to send a reminder with agenda a few days in advance.

1. Welcome new members to the board, and answer any questions they had about 
technical board operations.
 * Hemant, Jan and Yuanhan were welcomed, but had no questions on tech board at 
this point.

2. Making meetings public
 * LF charter for DPDK specifies that meetings of the techboard should be public
 * Decision made to continue to hold meetings on #dpdk-board channel on IRC
 * Meetings will be announced ahead of time:
- In minutes of previous meeting
- Via separate email a couple of days before the meeting
 * Anyone interested in the board meeting can join the IRC channel and make 
their contribution.
 * Only items on the agreed agenda to be discussed. Any topics to be covered 
must be sent in advance to techbo...@dpdk.org

3. Update board charter on website
 * The website details of the Tech board is out of date, both membership and 
its mode of operation and scope
 * Thomas has already sent one proposed update out via email, in the form of 
patch to the website, and Hemant has already provided one comment
 * Thomas to send out a new draft, and all tech board members to comment or ack 
it to signal agreement.
 * Update won't be applied until all members ack. 
 * If not agreed before next meeting, should be discussed at that meeting

4. Issue of review and timely feedback
 * Discussion focused on review of patches for existing DPDK 
components/libraries
 * Agreed that patch maintainers are primarily responsible for accepting 
patches in their area, and need to ensure sufficient reviews are done
 * Once patch has been sufficiently reviewed, the maintainer should ack the 
patch
 * Any patches which have been acked by the maintainer, and which have no 
subsequent issues raised with them, should be automatically merged to the 
relevant tree after 2 weeks.
 - in case where the ack is received less than 2 weeks before a merge 
deadline, the point at which it can be automatically merged is 2 days before 
the deadline
 - To make a release, a patch must be acked by the maintainer at least 2 
days before the merge deadline.
 * If not merged at the automatic-merge deadline itself, e.g. unavoidable delay 
by committer, any subsequent feedback received should be considered "too late", 
and must be addressed by a follow-on patch, rather than via a revision of the 
original submission.
 * Trivial patches may be merged sooner, or without maintainer ack, at the tree 
committers discretion.

5. Accept and review new libraries
 * Discussion begun on this topic, but no consensus reached before time ran out
 * Most members felt this topic was closely tied to the "scope" of DPDK as a 
project
 * Discussion to continue over email, and to be discussed at the next meeting 
if not resolved before then.

Agenda Topics Not Covered:
* Community questions/issues
* Any action for some old patches/threads?

Action Summary:
* Olivier to prepare agenda for next meeting, and send out reminders
* Thomas to send revised draft of website update for tech board membership etc.

NEXT MEETING:
* 2017 - 03 - 01 @ 3pm UTC


Re: [dpdk-dev] [PATCH] maintainers: claim responsibility for Vhost-user, Virtio PMD & Vhost PMD

2017-02-17 Thread Thomas Monjalon
2017-02-17 16:49, Yuanhan Liu:
> On Fri, Feb 17, 2017 at 09:31:42AM +0100, Maxime Coquelin wrote:
> > Add myself as co-maintainer for these drivers and library.
> > 
> > Suggested-by: Yuanhan Liu 
> > Signed-off-by: Maxime Coquelin 
> 
> I think it's obvious that Maxime is done great job at reviewing
> vhost/virtio patches. So,
> 
> Acked-by: Yuanhan Liu 

Thanks Yuanhan for proposing.

Acked-by: Thomas Monjalon 

Applied, congratulations for your new responsibilities :)



[dpdk-dev] [PATCH] version: 17.05-rc0

2017-02-17 Thread Thomas Monjalon
Signed-off-by: Thomas Monjalon 
---
 lib/librte_eal/common/include/rte_version.h | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/lib/librte_eal/common/include/rte_version.h 
b/lib/librte_eal/common/include/rte_version.h
index 979b6c3..c84c748 100644
--- a/lib/librte_eal/common/include/rte_version.h
+++ b/lib/librte_eal/common/include/rte_version.h
@@ -61,7 +61,7 @@ extern "C" {
 /**
  * Minor version/month number i.e. the mm in yy.mm.z
  */
-#define RTE_VER_MONTH 2
+#define RTE_VER_MONTH 5
 
 /**
  * Patch level number i.e. the z in yy.mm.z
@@ -71,14 +71,14 @@ extern "C" {
 /**
  * Extra string to be appended to version number
  */
-#define RTE_VER_SUFFIX ""
+#define RTE_VER_SUFFIX "-rc"
 
 /**
  * Patch release number
  *   0-15 = release candidates
  *   16   = release
  */
-#define RTE_VER_RELEASE 16
+#define RTE_VER_RELEASE 0
 
 /**
  * Macro to compute a version number usable for comparisons
-- 
2.7.0



Re: [dpdk-dev] [RFC 0/8] mbuf: structure reorganization

2017-02-17 Thread Olivier Matz
Hi Jan,

On Thu, 16 Feb 2017 18:26:39 +0100, Jan Blunck 
wrote:
> On Thu, Feb 16, 2017 at 2:48 PM, Olivier Matz
>  wrote:
> > On Mon, 6 Feb 2017 18:41:27 +, "Ananyev, Konstantin"
> >  wrote:  
> >> >
> >> > The main changes are:
> >> > - reorder structure to increase vector performance on some non-ia
> >> >   platforms.
> >> > - add a 64bits timestamp field in the 1st cache line  
> >>
> >> Wonder why it deserves to be in first cache line?
> >> How it differs from seqn below (pure SW stuff right now).  
> >
> > In case the timestamp is set from a NIC value, it is set in the Rx
> > path. So that's why I think it deserve to be located in the 1st
> > cache line.
> >
> > As you said, the seqn is a pure sw stuff right: it is set in a lib,
> > not in a PMD rx path.
> >  
> 
> If we talk about setting the timestamp value in the RX path this
> implicitly means software timestamps. Hardware timestamping usually
> works by letting the hardware inject sync events for coarse time
> tracking and additionally injecting fine granular per-packet ticks at
> a specific offset in the packet. Out of performance reasons I don't
> think it makes sense to extract this during the burst and write it
> into the mbuf again.

From what I've understand, at least it does not work like this for
mellanox NICs: timestamp is a metadata attached to a rx packet. But
maybe they (and other NIC vendors interrested in the feature) can
confirm or not.

> 
> The problem with timestamps is to get the abstraction right wrt the
> correction factors and the size of the tick vs. the timestamp in the
> events injected. From my perspective it would be better to extract the
> handling of timestamp data into a library with PMD specific
> implementation of the conversions. That way the normalized timestamp
> values can get extracted if they are present. The mbuf itself would
> only indicate the presence of timestamp metadata in that case.

I agree however that we need to properly define the meaning of this
field. My idea is:

- the timestamp is in nanosecond
- the reference is always the same for a given path: if the timestamp is
  set in a PMD, all the packets for this PMD will have the same
  reference, but for 2 different PMDs (or a sw lib), the reference
  would not be the same.

I think it's enough for many use cases.
We can later add helpers to compare timestamps with different
references.


Regards,
Olivier


Re: [dpdk-dev] [PATCH] version: 17.05-rc0

2017-02-17 Thread Bruce Richardson
On Fri, Feb 17, 2017 at 11:42:20AM +0100, Thomas Monjalon wrote:
> Signed-off-by: Thomas Monjalon 

Acked-by: Bruce Richardson 



Re: [dpdk-dev] Coverity project created for dpdk-next-net tree

2017-02-17 Thread Ferruh Yigit
On 1/11/2017 6:57 PM, Ferruh Yigit wrote:
> On 11/21/2016 11:29 PM, Ferruh Yigit wrote:
>> Coverity project created for dpdk-next-net tree:
>> https://scan.coverity.com/projects/dpdk-next-net
>>
>> This can be useful to fix coverity issues before next-net merged into
>> master branch.
>>
>> Project is open for everyone to register and to get scan reports, there
>> will be regular scans for next-net tree.
>>
>> Specially driver maintainers, please consider registering to the project
>> and checking your driver's defect status.
>> I believe there are some false positives, it is appreciated if you can
>> identify and mark them in coverity tool.
>>
>> PS: As a reminder, a coverity project already exists for main dpdk repo:
>> https://scan.coverity.com/projects/dpdk-data-plane-development-kit
>>
> 
> Reminder of the next-net coverity project.
> Project kept up to date with new driver patches merged in.
> 
> Driver maintainers can check and fix issues introduced in next-net
> before sub-tree merged into main tree.

A new coverity scan done for next-net tree for 17.02 release:
https://scan.coverity.com/projects/dpdk-next-net?tab=overview

Driver maintainers, please consider fixing existing defects before
adding more code into 17.05 release.


Thanks,
ferruh


Re: [dpdk-dev] [PATCH] version: 17.05-rc0

2017-02-17 Thread Ferruh Yigit
On 2/17/2017 10:59 AM, Bruce Richardson wrote:
> On Fri, Feb 17, 2017 at 11:42:20AM +0100, Thomas Monjalon wrote:
>> Signed-off-by: Thomas Monjalon 
> 
> Acked-by: Bruce Richardson 

Hi Thomas,

What do you think about tagging -rc0 releases?

I find it useful for "git describe" output, but not sure if it bloats
ref list.

Thanks,
ferruh


[dpdk-dev] decision process to accept new libraries

2017-02-17 Thread Thomas Monjalon
2017-02-17 10:22, Richardson, Bruce:
> 4. Issue of review and timely feedback
>  * Discussion focused on review of patches for existing DPDK 
> components/libraries
>  * Agreed that patch maintainers are primarily responsible for accepting 
> patches in their area, and need to ensure sufficient reviews are done
>  * Once patch has been sufficiently reviewed, the maintainer should ack the 
> patch
>  * Any patches which have been acked by the maintainer, and which have no 
> subsequent issues raised with them, should be automatically merged to the 
> relevant tree after 2 weeks.
>  - in case where the ack is received less than 2 weeks before a merge 
> deadline, the point at which it can be automatically merged is 2 days before 
> the deadline
>  - To make a release, a patch must be acked by the maintainer at least 2 
> days before the merge deadline.
>  * If not merged at the automatic-merge deadline itself, e.g. unavoidable 
> delay by committer, any subsequent feedback received should be considered 
> "too late", and must be addressed by a follow-on patch, rather than via a 
> revision of the original submission.
>  * Trivial patches may be merged sooner, or without maintainer ack, at the 
> tree committers discretion.

That's really good to have more rules like the one above.
It will make the process clearer to everyone,
and hopefully will speed up our workflow.

> 5. Accept and review new libraries
>  * Discussion begun on this topic, but no consensus reached before time ran 
> out
>  * Most members felt this topic was closely tied to the "scope" of DPDK as a 
> project
>  * Discussion to continue over email, and to be discussed at the next meeting 
> if not resolved before then.

Let's discuss this important topic.

Deciding to add a library is about deciding the scope of the project.
However I think we do not need to define the DPDK scope now,
but we can define a process to help in scope decisions.
It is a first step which will help us to progress later in scope discussions.

Below are 3 separate (and related) topics to discuss.
Please let's target a decision for the first item only.

1/
I suggest that each new library must be developed in a separate repository
on dpdk.org. Then it can be asked to integrate it in the main project/repo.
Such discussion must happen on the mailing list and the techboard will vote
for the integration of the *existing* library.
I suggest such vote should be done by majority as stated in the LF charter.

2/
We could imagine the same kind of process for removal,
i.e. moving a library or an app outside of DPDK in a separate repository.
If there is a public request with enough discussion, the techboard could vote
for moving some DPDK parts in a separate repository.
The committer would be a volunteer or the current maintainer of the code.
A dedicated mailing-list will be created for the new project.

3/
Why a project scope should be defined?
I think it is important to agree on the long-term goal of defining a scope.
My view on the goal for the main DPDK project (git://dpdk.org/dpdk):
- a consistent set of libraries
- a good quality in every libraries
- contributors interested in every libraries (no community segmentation)
- contributors able to understand every libraries
In less words, a clear project by a strong community of contributors.



Re: [dpdk-dev] [PATCH] version: 17.05-rc0

2017-02-17 Thread Thomas Monjalon
2017-02-17 10:59, Bruce Richardson:
> On Fri, Feb 17, 2017 at 11:42:20AM +0100, Thomas Monjalon wrote:
> > Signed-off-by: Thomas Monjalon 
> 
> Acked-by: Bruce Richardson 

Applied, let's start the 17.05 cycle :)


Re: [dpdk-dev] [PATCH] version: 17.05-rc0

2017-02-17 Thread Thomas Monjalon
2017-02-17 11:15, Ferruh Yigit:
> What do you think about tagging -rc0 releases?
> 
> I find it useful for "git describe" output, but not sure if it bloats
> ref list.

You're right, it could be useful for "git describe".
Unfortunately it would create a snapshot in the cgit view:
http://dpdk.org/browse/dpdk/refs/


Re: [dpdk-dev] [PATCHv7 00/47] NXP DPAA2 PMD

2017-02-17 Thread Ferruh Yigit
On 2/16/2017 1:27 PM, Bruce Richardson wrote:
> On Thu, Feb 16, 2017 at 08:22:49AM -0500, Neil Horman wrote:
>> On Thu, Feb 16, 2017 at 06:08:59AM +0530, Hemant Agrawal wrote:
>>> The patch series adds NXP’s QorIQ-Layerscape DPAA2 Architecture based
>>> fsl-mc bus driver and network SoC PMD.  This version of the driver
>>> supports NXP LS208xA, LS204xA and LS108x families Network SoCs.
>>>
>>> DPAA2, or Data Path Acceleration Architecture, is a hardware architecture
>>> designed for high-speed network packet processing. It uses a bus name
>>> ‘fsl-mc’, part of Linux Kernel Staging tree [1], for resource management.
>>>
>>> A brief description of architecture is given below; detailed description
>>> is part of the documentation in the patches itself.
>>>
>>> DPAA2 contains hardware component called the Management Complex (or MC).
>>> It manages the DPAA2 hardware resources.  The MC provides an object-based
>>> abstraction for software drivers to use the DPAA2 hardware.
>>>
>>> Some of the key objects are:
>>> - DPNI, which refers to the network interface object.
>>> - DPBP, which refers to HW based memory pool object
>>> - DPIO, refers to processing context for accessing QBMAN
>>>
>>> Besides the MC, DPAA2 also includes a Hardware based Queue and Buffer 
>>> Manager
>>> called QBMAN. Prime responsibility of QBMAN is to allow lockless access to
>>> software/user-space to the queues and buffers implemented in the hardware.
>>>
>>> The patch series could be logically structured into following sub-areas:
>>> 1. Make file changes for crc in armv8 core machine type and driver 
>>> dependency
>>> 2. Common dpaa2 hw accelerator drivers for QBMAN.
>>> 3. Indroducing fsl-mc bus as rte_bus, it's componenets.
>>> 4. Introducing dpaa2 pmd driver
>>> 5. Introducing dpaa2 mempool 
>>> 6. Support for DPAA2 Ethernet Device (ethdev)
>>> 7. Additional functionality in DPAA2 ethdev.
>>>
>>> The following design decisions are made during development:
>>>
>>> 1. DPAA2 implements a new bus called "fsl-mc" and some common accelerator 
>>> drivers.
>>>These drivers will be shared with dpaa2 based crypto drivers.
>>>
>>> 2. DPAA2 implements the HW mempool offload with DPBP object.
>>>  - The new pool is being configured using compile time option and pool name
>>>as "dpaa2".
>>>
>>> 3. It maintains per lcore DPIO objects and affine the DPIO instance to the
>>>processing threads accessing the QBMAN HW.
>>>
>>> Prerequisites:
>>>  - For running the PMD, NXP's SoC (board) and SDK (software/BSP) is 
>>> required.
>>>Information about obtaining relevant software is available in the docs
>>>as part of the patch.
>>
>> NAK.  The SDK requires registration to obtain, and appears to be non-open
>> source.  This driver is unmaintainable given that.
>>
> Hi Hemant,
> 
> can you perhaps clarify things here. What is the requirement to:
> * build the driver/DPDK for the platform
> * run applications using DPDK on the platform
> 
> Also what is the license/availability for those requirements.

Hi Bruce, Hemant, Neil, Thomas,

I did able to compile the driver without the SDK. It looks like that SDK
is a runtime dependency.

What is the DPDK requirement here?
If it is not breaking the build, this PMD provides it.

Thanks,
ferruh




Re: [dpdk-dev] [PATCH] maintainers: claim responsibility for Vhost-user, Virtio PMD & Vhost PMD

2017-02-17 Thread Maxime Coquelin



On 02/17/2017 11:35 AM, Thomas Monjalon wrote:

2017-02-17 16:49, Yuanhan Liu:

On Fri, Feb 17, 2017 at 09:31:42AM +0100, Maxime Coquelin wrote:

Add myself as co-maintainer for these drivers and library.

Suggested-by: Yuanhan Liu 
Signed-off-by: Maxime Coquelin 


I think it's obvious that Maxime is done great job at reviewing
vhost/virtio patches. So,

Acked-by: Yuanhan Liu 


Thanks Yuanhan for proposing.

Acked-by: Thomas Monjalon 

Applied, congratulations for your new responsibilities :)



Thanks Thomas :)
I'll do my best to be as good as Yuanhan in this role!

Cheers,
Maxime



[dpdk-dev] [PATCH 0/3] cryptodev: change device configuration API

2017-02-17 Thread Fan Zhang
This patchset changes the cryptodev library's device configuration API
and updates all crypto PMD to comply with this change.

Fan Zhang (3):
  cryptodev: change device configuration API
  crypto/scheduler: improve slave configuration
  doc: remove deprecation notice

 app/test/test_cryptodev.c  | 24 +---
 doc/guides/rel_notes/deprecation.rst   |  4 
 drivers/crypto/aesni_gcm/aesni_gcm_pmd_ops.c   |  3 ++-
 drivers/crypto/aesni_mb/rte_aesni_mb_pmd_ops.c |  3 ++-
 drivers/crypto/armv8/rte_armv8_pmd_ops.c   |  3 ++-
 drivers/crypto/kasumi/rte_kasumi_pmd_ops.c |  3 ++-
 drivers/crypto/null/null_crypto_pmd_ops.c  |  3 ++-
 drivers/crypto/openssl/rte_openssl_pmd_ops.c   |  3 ++-
 drivers/crypto/qat/qat_crypto.c|  5 +++--
 drivers/crypto/qat/qat_crypto.h|  3 ++-
 drivers/crypto/scheduler/scheduler_pmd_ops.c   | 20 +++-
 drivers/crypto/snow3g/rte_snow3g_pmd_ops.c |  3 ++-
 drivers/crypto/zuc/rte_zuc_pmd_ops.c   |  3 ++-
 lib/librte_cryptodev/rte_cryptodev.c   |  8 +++-
 lib/librte_cryptodev/rte_cryptodev_pmd.h   |  4 +++-
 15 files changed, 47 insertions(+), 45 deletions(-)

-- 
2.7.4



[dpdk-dev] [PATCH 1/3] cryptodev: change device configuration API

2017-02-17 Thread Fan Zhang
This patch changes the device configuration API for rte_cryptodev_ops
function prototype, and update all cryptodev PMDs for this change.

Signed-off-by: Fan Zhang 
---
 drivers/crypto/aesni_gcm/aesni_gcm_pmd_ops.c   | 3 ++-
 drivers/crypto/aesni_mb/rte_aesni_mb_pmd_ops.c | 3 ++-
 drivers/crypto/armv8/rte_armv8_pmd_ops.c   | 3 ++-
 drivers/crypto/kasumi/rte_kasumi_pmd_ops.c | 3 ++-
 drivers/crypto/null/null_crypto_pmd_ops.c  | 3 ++-
 drivers/crypto/openssl/rte_openssl_pmd_ops.c   | 3 ++-
 drivers/crypto/qat/qat_crypto.c| 5 +++--
 drivers/crypto/qat/qat_crypto.h| 3 ++-
 drivers/crypto/scheduler/scheduler_pmd_ops.c   | 6 --
 drivers/crypto/snow3g/rte_snow3g_pmd_ops.c | 3 ++-
 drivers/crypto/zuc/rte_zuc_pmd_ops.c   | 3 ++-
 lib/librte_cryptodev/rte_cryptodev.c   | 8 +++-
 lib/librte_cryptodev/rte_cryptodev_pmd.h   | 4 +++-
 13 files changed, 35 insertions(+), 15 deletions(-)

diff --git a/drivers/crypto/aesni_gcm/aesni_gcm_pmd_ops.c 
b/drivers/crypto/aesni_gcm/aesni_gcm_pmd_ops.c
index 2362006..7e7e477 100644
--- a/drivers/crypto/aesni_gcm/aesni_gcm_pmd_ops.c
+++ b/drivers/crypto/aesni_gcm/aesni_gcm_pmd_ops.c
@@ -114,7 +114,8 @@ static const struct rte_cryptodev_capabilities 
aesni_gcm_pmd_capabilities[] = {
 
 /** Configure device */
 static int
-aesni_gcm_pmd_config(__rte_unused struct rte_cryptodev *dev)
+aesni_gcm_pmd_config(__rte_unused struct rte_cryptodev *dev,
+   __rte_unused struct rte_cryptodev_config *config)
 {
return 0;
 }
diff --git a/drivers/crypto/aesni_mb/rte_aesni_mb_pmd_ops.c 
b/drivers/crypto/aesni_mb/rte_aesni_mb_pmd_ops.c
index 3d49e2a..b08bf05 100644
--- a/drivers/crypto/aesni_mb/rte_aesni_mb_pmd_ops.c
+++ b/drivers/crypto/aesni_mb/rte_aesni_mb_pmd_ops.c
@@ -233,7 +233,8 @@ static const struct rte_cryptodev_capabilities 
aesni_mb_pmd_capabilities[] = {
 
 /** Configure device */
 static int
-aesni_mb_pmd_config(__rte_unused struct rte_cryptodev *dev)
+aesni_mb_pmd_config(__rte_unused struct rte_cryptodev *dev,
+   __rte_unused struct rte_cryptodev_config *config)
 {
return 0;
 }
diff --git a/drivers/crypto/armv8/rte_armv8_pmd_ops.c 
b/drivers/crypto/armv8/rte_armv8_pmd_ops.c
index 2bf6475..78c0b4e 100644
--- a/drivers/crypto/armv8/rte_armv8_pmd_ops.c
+++ b/drivers/crypto/armv8/rte_armv8_pmd_ops.c
@@ -111,7 +111,8 @@ static const struct rte_cryptodev_capabilities
 
 /** Configure device */
 static int
-armv8_crypto_pmd_config(__rte_unused struct rte_cryptodev *dev)
+armv8_crypto_pmd_config(__rte_unused struct rte_cryptodev *dev,
+   __rte_unused struct rte_cryptodev_config *config)
 {
return 0;
 }
diff --git a/drivers/crypto/kasumi/rte_kasumi_pmd_ops.c 
b/drivers/crypto/kasumi/rte_kasumi_pmd_ops.c
index b9285a4..40997d6 100644
--- a/drivers/crypto/kasumi/rte_kasumi_pmd_ops.c
+++ b/drivers/crypto/kasumi/rte_kasumi_pmd_ops.c
@@ -89,7 +89,8 @@ static const struct rte_cryptodev_capabilities 
kasumi_pmd_capabilities[] = {
 
 /** Configure device */
 static int
-kasumi_pmd_config(__rte_unused struct rte_cryptodev *dev)
+kasumi_pmd_config(__rte_unused struct rte_cryptodev *dev,
+   __rte_unused struct rte_cryptodev_config *config)
 {
return 0;
 }
diff --git a/drivers/crypto/null/null_crypto_pmd_ops.c 
b/drivers/crypto/null/null_crypto_pmd_ops.c
index 26ff631..801b5f9 100644
--- a/drivers/crypto/null/null_crypto_pmd_ops.c
+++ b/drivers/crypto/null/null_crypto_pmd_ops.c
@@ -85,7 +85,8 @@ static const struct rte_cryptodev_capabilities 
null_crypto_pmd_capabilities[] =
 
 /** Configure device */
 static int
-null_crypto_pmd_config(__rte_unused struct rte_cryptodev *dev)
+null_crypto_pmd_config(__rte_unused struct rte_cryptodev *dev,
+   __rte_unused struct rte_cryptodev_config *config)
 {
return 0;
 }
diff --git a/drivers/crypto/openssl/rte_openssl_pmd_ops.c 
b/drivers/crypto/openssl/rte_openssl_pmd_ops.c
index 875550c..03b053e 100644
--- a/drivers/crypto/openssl/rte_openssl_pmd_ops.c
+++ b/drivers/crypto/openssl/rte_openssl_pmd_ops.c
@@ -449,7 +449,8 @@ static const struct rte_cryptodev_capabilities 
openssl_pmd_capabilities[] = {
 
 /** Configure device */
 static int
-openssl_pmd_config(__rte_unused struct rte_cryptodev *dev)
+openssl_pmd_config(__rte_unused struct rte_cryptodev *dev,
+   __rte_unused struct rte_cryptodev_config *config)
 {
return 0;
 }
diff --git a/drivers/crypto/qat/qat_crypto.c b/drivers/crypto/qat/qat_crypto.c
index 43e1d00..67cb8f8 100644
--- a/drivers/crypto/qat/qat_crypto.c
+++ b/drivers/crypto/qat/qat_crypto.c
@@ -1326,10 +1326,11 @@ void qat_crypto_sym_session_init(struct rte_mempool 
*mp, void *sym_sess)
offsetof(struct rte_cryptodev_sym_session, _private);
 }
 
-int qat_dev_config(__rte_unused struct rte_cryptodev *dev)
+int qat_dev_config(__rte_unused struct rte_cryptodev *dev,
+   __rte_unused struct rte_cryptodev_config *config)
 {
PMD_I

[dpdk-dev] [PATCH 2/3] crypto/scheduler: improve slave configuration

2017-02-17 Thread Fan Zhang
Since the new device configuration API is updated, we can make use of
this feature to the crypto scheduler PMD to configure its slaves
automatically with the same configurations it got. As originally the
slaves have to be manually configured one by one, this patch should
help reducing the coding complexity.

Signed-off-by: Fan Zhang 
---
 app/test/test_cryptodev.c| 24 +---
 drivers/crypto/scheduler/scheduler_pmd_ops.c | 18 +-
 2 files changed, 14 insertions(+), 28 deletions(-)

diff --git a/app/test/test_cryptodev.c b/app/test/test_cryptodev.c
index 357a92e..6fe5362 100644
--- a/app/test/test_cryptodev.c
+++ b/app/test/test_cryptodev.c
@@ -7382,17 +7382,8 @@ test_scheduler_attach_slave_op(void)
 {
struct crypto_testsuite_params *ts_params = &testsuite_params;
uint8_t sched_id = ts_params->valid_devs[0];
-   uint32_t nb_devs, qp_id, i, nb_devs_attached = 0;
+   uint32_t nb_devs, i, nb_devs_attached = 0;
int ret;
-   struct rte_cryptodev_config config = {
-   .nb_queue_pairs = 8,
-   .socket_id = SOCKET_ID_ANY,
-   .session_mp = {
-   .nb_objs = 2048,
-   .cache_size = 256
-   }
-   };
-   struct rte_cryptodev_qp_conf qp_conf = {2048};
 
/* create 2 AESNI_MB if necessary */
nb_devs = rte_cryptodev_count_devtype(
@@ -7418,19 +7409,6 @@ test_scheduler_attach_slave_op(void)
if (info.dev_type != RTE_CRYPTODEV_AESNI_MB_PMD)
continue;
 
-   ret = rte_cryptodev_configure(i, &config);
-   TEST_ASSERT(ret == 0,
-   "Failed to configure device %u of pmd : %s", i,
-   RTE_STR(CRYPTODEV_NAME_AESNI_MB_PMD));
-
-   for (qp_id = 0; qp_id < info.max_nb_queue_pairs; qp_id++) {
-   TEST_ASSERT_SUCCESS(rte_cryptodev_queue_pair_setup(
-   i, qp_id, &qp_conf,
-   rte_cryptodev_socket_id(i)),
-   "Failed to setup queue pair %u on "
-   "cryptodev %u", qp_id, i);
-   }
-
ret = rte_cryptodev_scheduler_slave_attach(sched_id,
(uint8_t)i);
 
diff --git a/drivers/crypto/scheduler/scheduler_pmd_ops.c 
b/drivers/crypto/scheduler/scheduler_pmd_ops.c
index 79be119..ea755e0 100644
--- a/drivers/crypto/scheduler/scheduler_pmd_ops.c
+++ b/drivers/crypto/scheduler/scheduler_pmd_ops.c
@@ -52,11 +52,8 @@ scheduler_pmd_config(struct rte_cryptodev *dev,
 
for (i = 0; i < sched_ctx->nb_slaves; i++) {
uint8_t slave_dev_id = sched_ctx->slaves[i].dev_id;
-   struct rte_cryptodev *slave_dev =
-   rte_cryptodev_pmd_get_dev(slave_dev_id);
 
-   ret = (*slave_dev->dev_ops->dev_configure)(slave_dev,
-   config);
+   ret = rte_cryptodev_configure(slave_dev_id, config);
if (ret < 0)
break;
}
@@ -340,11 +337,13 @@ scheduler_pmd_qp_release(struct rte_cryptodev *dev, 
uint16_t qp_id)
 /** Setup a queue pair */
 static int
 scheduler_pmd_qp_setup(struct rte_cryptodev *dev, uint16_t qp_id,
-   __rte_unused const struct rte_cryptodev_qp_conf *qp_conf, int socket_id)
+   const struct rte_cryptodev_qp_conf *qp_conf, int socket_id)
 {
struct scheduler_ctx *sched_ctx = dev->data->dev_private;
struct scheduler_qp_ctx *qp_ctx;
char name[RTE_CRYPTODEV_NAME_MAX_LEN];
+   uint32_t i;
+   int ret;
 
if (snprintf(name, RTE_CRYPTODEV_NAME_MAX_LEN,
"CRYTO_SCHE PMD %u QP %u",
@@ -357,6 +356,15 @@ scheduler_pmd_qp_setup(struct rte_cryptodev *dev, uint16_t 
qp_id,
if (dev->data->queue_pairs[qp_id] != NULL)
scheduler_pmd_qp_release(dev, qp_id);
 
+   for (i = 0; i < sched_ctx->nb_slaves; i++) {
+   uint8_t slave_id = sched_ctx->slaves[i].dev_id;
+
+   ret = rte_cryptodev_queue_pair_setup(slave_id, qp_id,
+   qp_conf, socket_id);
+   if (ret < 0)
+   return ret;
+   }
+
/* Allocate the queue pair data structure. */
qp_ctx = rte_zmalloc_socket(name, sizeof(*qp_ctx), RTE_CACHE_LINE_SIZE,
socket_id);
-- 
2.7.4



[dpdk-dev] [PATCH 3/3] doc: remove deprecation notice

2017-02-17 Thread Fan Zhang
Signed-off-by: Fan Zhang 
---
 doc/guides/rel_notes/deprecation.rst | 4 
 1 file changed, 4 deletions(-)

diff --git a/doc/guides/rel_notes/deprecation.rst 
b/doc/guides/rel_notes/deprecation.rst
index 9d4dfcc..3e17b20 100644
--- a/doc/guides/rel_notes/deprecation.rst
+++ b/doc/guides/rel_notes/deprecation.rst
@@ -119,10 +119,6 @@ Deprecation Notices
   or VHOST feature, but KNI_VHOST which is a KNI feature enabled via a compile
   time option, and disabled by default.
 
-* ABI changes are planned for 17.05 in the ``rte_cryptodev_ops`` structure.
-  A pointer to a rte_cryptodev_config structure will be added to the
-  function prototype ``cryptodev_configure_t``, as a new parameter.
-
 * cryptodev: A new parameter ``max_nb_sessions_per_qp`` will be added to
   ``rte_cryptodev_info.sym``. Some drivers may support limited number of
   sessions per queue_pair. With this new parameter application will know
-- 
2.7.4



Re: [dpdk-dev] [PATCHv7 00/47] NXP DPAA2 PMD

2017-02-17 Thread Bruce Richardson
On Fri, Feb 17, 2017 at 11:34:43AM +, Ferruh Yigit wrote:
> On 2/16/2017 1:27 PM, Bruce Richardson wrote:
> > On Thu, Feb 16, 2017 at 08:22:49AM -0500, Neil Horman wrote:
> >> On Thu, Feb 16, 2017 at 06:08:59AM +0530, Hemant Agrawal wrote:
> >>> The patch series adds NXP’s QorIQ-Layerscape DPAA2 Architecture based
> >>> fsl-mc bus driver and network SoC PMD.  This version of the driver
> >>> supports NXP LS208xA, LS204xA and LS108x families Network SoCs.
> >>>
> >>> DPAA2, or Data Path Acceleration Architecture, is a hardware architecture
> >>> designed for high-speed network packet processing. It uses a bus name
> >>> ‘fsl-mc’, part of Linux Kernel Staging tree [1], for resource management.
> >>>
> >>> A brief description of architecture is given below; detailed description
> >>> is part of the documentation in the patches itself.
> >>>
> >>> DPAA2 contains hardware component called the Management Complex (or MC).
> >>> It manages the DPAA2 hardware resources.  The MC provides an object-based
> >>> abstraction for software drivers to use the DPAA2 hardware.
> >>>
> >>> Some of the key objects are:
> >>> - DPNI, which refers to the network interface object.
> >>> - DPBP, which refers to HW based memory pool object
> >>> - DPIO, refers to processing context for accessing QBMAN
> >>>
> >>> Besides the MC, DPAA2 also includes a Hardware based Queue and Buffer 
> >>> Manager
> >>> called QBMAN. Prime responsibility of QBMAN is to allow lockless access to
> >>> software/user-space to the queues and buffers implemented in the hardware.
> >>>
> >>> The patch series could be logically structured into following sub-areas:
> >>> 1. Make file changes for crc in armv8 core machine type and driver 
> >>> dependency
> >>> 2. Common dpaa2 hw accelerator drivers for QBMAN.
> >>> 3. Indroducing fsl-mc bus as rte_bus, it's componenets.
> >>> 4. Introducing dpaa2 pmd driver
> >>> 5. Introducing dpaa2 mempool 
> >>> 6. Support for DPAA2 Ethernet Device (ethdev)
> >>> 7. Additional functionality in DPAA2 ethdev.
> >>>
> >>> The following design decisions are made during development:
> >>>
> >>> 1. DPAA2 implements a new bus called "fsl-mc" and some common accelerator 
> >>> drivers.
> >>>These drivers will be shared with dpaa2 based crypto drivers.
> >>>
> >>> 2. DPAA2 implements the HW mempool offload with DPBP object.
> >>>  - The new pool is being configured using compile time option and pool 
> >>> name
> >>>as "dpaa2".
> >>>
> >>> 3. It maintains per lcore DPIO objects and affine the DPIO instance to the
> >>>processing threads accessing the QBMAN HW.
> >>>
> >>> Prerequisites:
> >>>  - For running the PMD, NXP's SoC (board) and SDK (software/BSP) is 
> >>> required.
> >>>Information about obtaining relevant software is available in the docs
> >>>as part of the patch.
> >>
> >> NAK.  The SDK requires registration to obtain, and appears to be non-open
> >> source.  This driver is unmaintainable given that.
> >>
> > Hi Hemant,
> > 
> > can you perhaps clarify things here. What is the requirement to:
> > * build the driver/DPDK for the platform
> > * run applications using DPDK on the platform
> > 
> > Also what is the license/availability for those requirements.
> 
> Hi Bruce, Hemant, Neil, Thomas,
> 
> I did able to compile the driver without the SDK. It looks like that SDK
> is a runtime dependency.
> 
> What is the DPDK requirement here?
> If it is not breaking the build, this PMD provides it.
> 
If it builds without an SDK dependency I'd be happy enough to see this
merged into DPDK. Since I can't run it without the correct hardware, I
don't see needing the correct runtime software being a difficulty too.
So long as we can compile this, we can check for breakages in it due to
refactoring in other parts of DPDK, like we do for other drivers. As
with those other drivers, it's up to the maintainers/vendors to do the
functional checking for additional issues. Nobody is likely to have the
hardware to functionality test all drivers in DPDK anyway.

Regards,
/Bruce


Re: [dpdk-dev] [PATCHv7 00/47] NXP DPAA2 PMD

2017-02-17 Thread Vincent JARDIN

Le 17/02/2017 à 13:13, Bruce Richardson a écrit :

If it builds without an SDK dependency I'd be happy enough to see this
merged into DPDK.


+1, the patch is clean enough to be compiled.

Boards or CPUs can rely on specific SDK, firmware, etc., it is up to the 
vendors.


regards,
  Vincent


[dpdk-dev] [PATCH 1/2] mempool: remove deprecated count functions

2017-02-17 Thread Olivier Matz
As announced in the deprecation notice, remove these functions.

Signed-off-by: Olivier Matz 
---
 doc/guides/rel_notes/deprecation.rst   |  5 
 lib/librte_mempool/rte_mempool.c   |  6 -
 lib/librte_mempool/rte_mempool.h   | 41 --
 lib/librte_mempool/rte_mempool_version.map |  1 -
 4 files changed, 53 deletions(-)

diff --git a/doc/guides/rel_notes/deprecation.rst 
b/doc/guides/rel_notes/deprecation.rst
index 9d4dfcc..79660ec 100644
--- a/doc/guides/rel_notes/deprecation.rst
+++ b/doc/guides/rel_notes/deprecation.rst
@@ -87,11 +87,6 @@ Deprecation Notices
   PKT_RX_QINQ_STRIPPED, that are better described. The old flags and
   their behavior will be kept until 17.02 and will be removed in 17.05.
 
-* mempool: The functions ``rte_mempool_count`` and ``rte_mempool_free_count``
-  will be removed in 17.05.
-  They are replaced by ``rte_mempool_avail_count`` and
-  ``rte_mempool_in_use_count`` respectively.
-
 * mempool: The functions for single/multi producer/consumer are deprecated
   and will be removed in 17.05.
   It is replaced by ``rte_mempool_generic_get/put`` functions.
diff --git a/lib/librte_mempool/rte_mempool.c b/lib/librte_mempool/rte_mempool.c
index 1c2aed8..40d3afd 100644
--- a/lib/librte_mempool/rte_mempool.c
+++ b/lib/librte_mempool/rte_mempool.c
@@ -997,12 +997,6 @@ rte_mempool_in_use_count(const struct rte_mempool *mp)
return mp->size - rte_mempool_avail_count(mp);
 }
 
-unsigned int
-rte_mempool_count(const struct rte_mempool *mp)
-{
-   return rte_mempool_avail_count(mp);
-}
-
 /* dump the cache status */
 static unsigned
 rte_mempool_dump_cache(FILE *f, const struct rte_mempool *mp)
diff --git a/lib/librte_mempool/rte_mempool.h b/lib/librte_mempool/rte_mempool.h
index d0f5b27..7a31685 100644
--- a/lib/librte_mempool/rte_mempool.h
+++ b/lib/librte_mempool/rte_mempool.h
@@ -1508,22 +1508,6 @@ rte_mempool_get(struct rte_mempool *mp, void **obj_p)
 unsigned int rte_mempool_avail_count(const struct rte_mempool *mp);
 
 /**
- * @deprecated
- * Return the number of entries in the mempool.
- *
- * When cache is enabled, this function has to browse the length of
- * all lcores, so it should not be used in a data path, but only for
- * debug purposes.
- *
- * @param mp
- *   A pointer to the mempool structure.
- * @return
- *   The number of entries in the mempool.
- */
-__rte_deprecated
-unsigned rte_mempool_count(const struct rte_mempool *mp);
-
-/**
  * Return the number of elements which have been allocated from the mempool
  *
  * When cache is enabled, this function has to browse the length of
@@ -1539,31 +1523,6 @@ unsigned int
 rte_mempool_in_use_count(const struct rte_mempool *mp);
 
 /**
- * @deprecated
- * Return the number of free entries in the mempool ring.
- * i.e. how many entries can be freed back to the mempool.
- *
- * NOTE: This corresponds to the number of elements *allocated* from the
- * memory pool, not the number of elements in the pool itself. To count
- * the number elements currently available in the pool, use "rte_mempool_count"
- *
- * When cache is enabled, this function has to browse the length of
- * all lcores, so it should not be used in a data path, but only for
- * debug purposes. User-owned mempool caches are not accounted for.
- *
- * @param mp
- *   A pointer to the mempool structure.
- * @return
- *   The number of free entries in the mempool.
- */
-__rte_deprecated
-static inline unsigned
-rte_mempool_free_count(const struct rte_mempool *mp)
-{
-   return rte_mempool_in_use_count(mp);
-}
-
-/**
  * Test if the mempool is full.
  *
  * When cache is enabled, this function has to browse the length of all
diff --git a/lib/librte_mempool/rte_mempool_version.map 
b/lib/librte_mempool/rte_mempool_version.map
index dee1c99..f9c0794 100644
--- a/lib/librte_mempool/rte_mempool_version.map
+++ b/lib/librte_mempool/rte_mempool_version.map
@@ -3,7 +3,6 @@ DPDK_2.0 {
 
rte_mempool_audit;
rte_mempool_calc_obj_size;
-   rte_mempool_count;
rte_mempool_create;
rte_mempool_dump;
rte_mempool_list_dump;
-- 
2.8.1



[dpdk-dev] [PATCH 2/2] mempool: remove deprecated get and put functions

2017-02-17 Thread Olivier Matz
As announced in the deprecation notice, remove the functions for
single/multi producer/consumer enqueue/dequeue.

Signed-off-by: Olivier Matz 
---
 doc/guides/rel_notes/deprecation.rst |   4 -
 lib/librte_mempool/rte_mempool.h | 180 ---
 2 files changed, 184 deletions(-)

diff --git a/doc/guides/rel_notes/deprecation.rst 
b/doc/guides/rel_notes/deprecation.rst
index 79660ec..642f770 100644
--- a/doc/guides/rel_notes/deprecation.rst
+++ b/doc/guides/rel_notes/deprecation.rst
@@ -87,10 +87,6 @@ Deprecation Notices
   PKT_RX_QINQ_STRIPPED, that are better described. The old flags and
   their behavior will be kept until 17.02 and will be removed in 17.05.
 
-* mempool: The functions for single/multi producer/consumer are deprecated
-  and will be removed in 17.05.
-  It is replaced by ``rte_mempool_generic_get/put`` functions.
-
 * ethdev: the legacy filter API, including
   ``rte_eth_dev_filter_supported()``, ``rte_eth_dev_filter_ctrl()`` as well
   as filter types MACVLAN, ETHERTYPE, FLEXIBLE, SYN, NTUPLE, TUNNEL, FDIR,
diff --git a/lib/librte_mempool/rte_mempool.h b/lib/librte_mempool/rte_mempool.h
index 7a31685..991feaa 100644
--- a/lib/librte_mempool/rte_mempool.h
+++ b/lib/librte_mempool/rte_mempool.h
@@ -1108,46 +1108,6 @@ rte_mempool_generic_put(struct rte_mempool *mp, void * 
const *obj_table,
 }
 
 /**
- * @deprecated
- * Put several objects back in the mempool (multi-producers safe).
- *
- * @param mp
- *   A pointer to the mempool structure.
- * @param obj_table
- *   A pointer to a table of void * pointers (objects).
- * @param n
- *   The number of objects to add in the mempool from the obj_table.
- */
-__rte_deprecated
-static inline void __attribute__((always_inline))
-rte_mempool_mp_put_bulk(struct rte_mempool *mp, void * const *obj_table,
-   unsigned n)
-{
-   struct rte_mempool_cache *cache;
-   cache = rte_mempool_default_cache(mp, rte_lcore_id());
-   rte_mempool_generic_put(mp, obj_table, n, cache, 0);
-}
-
-/**
- * @deprecated
- * Put several objects back in the mempool (NOT multi-producers safe).
- *
- * @param mp
- *   A pointer to the mempool structure.
- * @param obj_table
- *   A pointer to a table of void * pointers (objects).
- * @param n
- *   The number of objects to add in the mempool from obj_table.
- */
-__rte_deprecated
-static inline void __attribute__((always_inline))
-rte_mempool_sp_put_bulk(struct rte_mempool *mp, void * const *obj_table,
-   unsigned n)
-{
-   rte_mempool_generic_put(mp, obj_table, n, NULL, MEMPOOL_F_SP_PUT);
-}
-
-/**
  * Put several objects back in the mempool.
  *
  * This function calls the multi-producer or the single-producer
@@ -1171,40 +1131,6 @@ rte_mempool_put_bulk(struct rte_mempool *mp, void * 
const *obj_table,
 }
 
 /**
- * @deprecated
- * Put one object in the mempool (multi-producers safe).
- *
- * @param mp
- *   A pointer to the mempool structure.
- * @param obj
- *   A pointer to the object to be added.
- */
-__rte_deprecated
-static inline void __attribute__((always_inline))
-rte_mempool_mp_put(struct rte_mempool *mp, void *obj)
-{
-   struct rte_mempool_cache *cache;
-   cache = rte_mempool_default_cache(mp, rte_lcore_id());
-   rte_mempool_generic_put(mp, &obj, 1, cache, 0);
-}
-
-/**
- * @deprecated
- * Put one object back in the mempool (NOT multi-producers safe).
- *
- * @param mp
- *   A pointer to the mempool structure.
- * @param obj
- *   A pointer to the object to be added.
- */
-__rte_deprecated
-static inline void __attribute__((always_inline))
-rte_mempool_sp_put(struct rte_mempool *mp, void *obj)
-{
-   rte_mempool_generic_put(mp, &obj, 1, NULL, MEMPOOL_F_SP_PUT);
-}
-
-/**
  * Put one object back in the mempool.
  *
  * This function calls the multi-producer or the single-producer
@@ -1332,62 +1258,6 @@ rte_mempool_generic_get(struct rte_mempool *mp, void 
**obj_table, unsigned n,
 }
 
 /**
- * @deprecated
- * Get several objects from the mempool (multi-consumers safe).
- *
- * If cache is enabled, objects will be retrieved first from cache,
- * subsequently from the common pool. Note that it can return -ENOENT when
- * the local cache and common pool are empty, even if cache from other
- * lcores are full.
- *
- * @param mp
- *   A pointer to the mempool structure.
- * @param obj_table
- *   A pointer to a table of void * pointers (objects) that will be filled.
- * @param n
- *   The number of objects to get from mempool to obj_table.
- * @return
- *   - 0: Success; objects taken.
- *   - -ENOENT: Not enough entries in the mempool; no object is retrieved.
- */
-__rte_deprecated
-static inline int __attribute__((always_inline))
-rte_mempool_mc_get_bulk(struct rte_mempool *mp, void **obj_table, unsigned n)
-{
-   struct rte_mempool_cache *cache;
-   cache = rte_mempool_default_cache(mp, rte_lcore_id());
-   return rte_mempool_generic_get(mp, obj_table, n, cache, 0);
-}
-
-/**
- * @deprecated
- * Get 

Re: [dpdk-dev] [PATCH] igb_uio: map dummy dma forcing iommu domain attachment

2017-02-17 Thread Ferruh Yigit
On 1/18/2017 12:27 PM, Alejandro Lucero wrote:
> For using a DPDK app when iommu is enabled, it requires to
> add iommu=pt to the kernel command line. But using igb_uio driver
> makes DMAR errors because the device has not an IOMMU domain.
> 
> Since kernel 3.15, iommu=pt requires to use the internal kernel
> DMA API for attaching the device to the IOMMU 1:1 mapping, aka
> si_domain. Previous versions did attach the device to that
> domain when intel iommu notifier was called.
> 
> This is not a problem if the driver does later some call to the
> DMA API because the mapping can be done then. But DPDK apps do
> not use that DMA API at all.
> 
> Doing this dma map and unmap is harmless even when iommu is not
> enabled at all.
> 
> Signed-off-by: Alejandro Lucero 

Acked-by: Ferruh Yigit 

(I suggest getting this early in 17.05 release, so it can be tested more)


Re: [dpdk-dev] [PATCHv7 00/47] NXP DPAA2 PMD

2017-02-17 Thread Hemant Agrawal

On 2/16/2017 6:57 PM, Bruce Richardson wrote:

On Thu, Feb 16, 2017 at 08:22:49AM -0500, Neil Horman wrote:

On Thu, Feb 16, 2017 at 06:08:59AM +0530, Hemant Agrawal wrote:

The patch series adds NXP’s QorIQ-Layerscape DPAA2 Architecture based
fsl-mc bus driver and network SoC PMD.  This version of the driver
supports NXP LS208xA, LS204xA and LS108x families Network SoCs.

DPAA2, or Data Path Acceleration Architecture, is a hardware architecture
designed for high-speed network packet processing. It uses a bus name
‘fsl-mc’, part of Linux Kernel Staging tree [1], for resource management.

A brief description of architecture is given below; detailed description
is part of the documentation in the patches itself.

DPAA2 contains hardware component called the Management Complex (or MC).
It manages the DPAA2 hardware resources.  The MC provides an object-based
abstraction for software drivers to use the DPAA2 hardware.

Some of the key objects are:
- DPNI, which refers to the network interface object.
- DPBP, which refers to HW based memory pool object
- DPIO, refers to processing context for accessing QBMAN

Besides the MC, DPAA2 also includes a Hardware based Queue and Buffer Manager
called QBMAN. Prime responsibility of QBMAN is to allow lockless access to
software/user-space to the queues and buffers implemented in the hardware.

The patch series could be logically structured into following sub-areas:
1. Make file changes for crc in armv8 core machine type and driver dependency
2. Common dpaa2 hw accelerator drivers for QBMAN.
3. Indroducing fsl-mc bus as rte_bus, it's componenets.
4. Introducing dpaa2 pmd driver
5. Introducing dpaa2 mempool
6. Support for DPAA2 Ethernet Device (ethdev)
7. Additional functionality in DPAA2 ethdev.

The following design decisions are made during development:

1. DPAA2 implements a new bus called "fsl-mc" and some common accelerator 
drivers.
   These drivers will be shared with dpaa2 based crypto drivers.

2. DPAA2 implements the HW mempool offload with DPBP object.
 - The new pool is being configured using compile time option and pool name
   as "dpaa2".

3. It maintains per lcore DPIO objects and affine the DPIO instance to the
   processing threads accessing the QBMAN HW.

Prerequisites:
 - For running the PMD, NXP's SoC (board) and SDK (software/BSP) is required.
   Information about obtaining relevant software is available in the docs
   as part of the patch.


NAK.  The SDK requires registration to obtain, and appears to be non-open
source.  This driver is unmaintainable given that.


Hi Hemant,

can you perhaps clarify things here. What is the requirement to:
* build the driver/DPDK for the platform
* run applications using DPDK on the platform

Also what is the license/availability for those requirements.

/Bruce



Hi Neil, Bruce,
	I thought SDK is a simpler choice to get the required components in one 
place. However there is no such restriction to get the components only 
from the NXP SDK.

We will update the documentation with the same.

Following is a list of open source components required:

1. ARM 64 tool chain.
e.g. *aarch64* Linaro Toolchain:
https://releases.linaro.org/components/toolchain/binaries/4.9-2017.01/aarch64-linux-gnu/

2. Linux Kernel

http://git.freescale.com/git/cgit.cgi/ppc/sdk/linux.git/log/?h=sdk-v2.0.x
or,
https://github.com/qoriq-open-source/linux
Please note that the particular linux kernel, I have used for my testing 
is 4.1.8 (part of our SDK 2.0-17.01),  I will publish the tree at github 
shortly.


3. Rootfile system : any aarch64 supported e.g.
Ubuntu 15.10 (Wily) or 16.04 LTS (Xenial) userland
http://cdimage.ubuntu.com/ubuntu-base/releases/16.04/release/ubuntu-base-16.04.1-base-arm64.tar.gz

Initial kernel is best built in a Yocto build environment, then deployed 
to target with a disk based Ubuntu userland.
However, both kernel and DPDK release can be natively built on the 
target platform, using Ubuntu devtools.


Regards,
Hemant





Re: [dpdk-dev] [RFC 0/8] mbuf: structure reorganization

2017-02-17 Thread Nélio Laranjeiro
Hi Olivier, Jan,

On Fri, Feb 17, 2017 at 11:51:53AM +0100, Olivier Matz wrote:
> Hi Jan,
> 
> On Thu, 16 Feb 2017 18:26:39 +0100, Jan Blunck 
> wrote:
> > On Thu, Feb 16, 2017 at 2:48 PM, Olivier Matz
> >  wrote:
> > > On Mon, 6 Feb 2017 18:41:27 +, "Ananyev, Konstantin"
> > >  wrote:  
> > >> >
> > >> > The main changes are:
> > >> > - reorder structure to increase vector performance on some non-ia
> > >> >   platforms.
> > >> > - add a 64bits timestamp field in the 1st cache line  
> > >>
> > >> Wonder why it deserves to be in first cache line?
> > >> How it differs from seqn below (pure SW stuff right now).  
> > >
> > > In case the timestamp is set from a NIC value, it is set in the Rx
> > > path. So that's why I think it deserve to be located in the 1st
> > > cache line.
> > >
> > > As you said, the seqn is a pure sw stuff right: it is set in a lib,
> > > not in a PMD rx path.
> > >  
> > 
> > If we talk about setting the timestamp value in the RX path this
> > implicitly means software timestamps. Hardware timestamping usually
> > works by letting the hardware inject sync events for coarse time
> > tracking and additionally injecting fine granular per-packet ticks at
> > a specific offset in the packet. Out of performance reasons I don't
> > think it makes sense to extract this during the burst and write it
> > into the mbuf again.
> 
> From what I've understand, at least it does not work like this for
> mellanox NICs: timestamp is a metadata attached to a rx packet. But
> maybe they (and other NIC vendors interrested in the feature) can
> confirm or not.

Olivier is right, this timestamp information is returned by the hardware
as the other fields describing the Rx packet (length, RSS hash, checksum
...).  The PMD only copy it into the Mbuf.

> > The problem with timestamps is to get the abstraction right wrt the
> > correction factors and the size of the tick vs. the timestamp in the
> > events injected. From my perspective it would be better to extract the
> > handling of timestamp data into a library with PMD specific
> > implementation of the conversions. That way the normalized timestamp
> > values can get extracted if they are present. The mbuf itself would
> > only indicate the presence of timestamp metadata in that case.
> 
> I agree however that we need to properly define the meaning of this
> field. My idea is:
> 
> - the timestamp is in nanosecond
> - the reference is always the same for a given path: if the timestamp is
>   set in a PMD, all the packets for this PMD will have the same
>   reference, but for 2 different PMDs (or a sw lib), the reference
>   would not be the same.
> 
> I think it's enough for many use cases.
> We can later add helpers to compare timestamps with different
> references.
> 
> Regards,
> Olivier

Regards,

-- 
Nélio Laranjeiro
6WIND


Re: [dpdk-dev] [RFC 0/8] mbuf: structure reorganization

2017-02-17 Thread Jan Blunck
On Fri, Feb 17, 2017 at 11:51 AM, Olivier Matz  wrote:
> Hi Jan,
>
> On Thu, 16 Feb 2017 18:26:39 +0100, Jan Blunck 
> wrote:
>> On Thu, Feb 16, 2017 at 2:48 PM, Olivier Matz
>>  wrote:
>> > On Mon, 6 Feb 2017 18:41:27 +, "Ananyev, Konstantin"
>> >  wrote:
>> >> >
>> >> > The main changes are:
>> >> > - reorder structure to increase vector performance on some non-ia
>> >> >   platforms.
>> >> > - add a 64bits timestamp field in the 1st cache line
>> >>
>> >> Wonder why it deserves to be in first cache line?
>> >> How it differs from seqn below (pure SW stuff right now).
>> >
>> > In case the timestamp is set from a NIC value, it is set in the Rx
>> > path. So that's why I think it deserve to be located in the 1st
>> > cache line.
>> >
>> > As you said, the seqn is a pure sw stuff right: it is set in a lib,
>> > not in a PMD rx path.
>> >
>>
>> If we talk about setting the timestamp value in the RX path this
>> implicitly means software timestamps. Hardware timestamping usually
>> works by letting the hardware inject sync events for coarse time
>> tracking and additionally injecting fine granular per-packet ticks at
>> a specific offset in the packet. Out of performance reasons I don't
>> think it makes sense to extract this during the burst and write it
>> into the mbuf again.
>
> From what I've understand, at least it does not work like this for
> mellanox NICs: timestamp is a metadata attached to a rx packet. But
> maybe they (and other NIC vendors interrested in the feature) can
> confirm or not.
>

Mellanox NICs use a 48bit cycle counter split into a high and low
part. To convert the cycle values into a timestamp you need to
initialize and maintainer a timecounter that shifts the cycle count
e.g. nanosecs. IIRC Mellanox doesn't generate explicit clock events
but the cycle counter is large enough so that the user can easily
maintain the timecounter by manually updating it.

>>
>> The problem with timestamps is to get the abstraction right wrt the
>> correction factors and the size of the tick vs. the timestamp in the
>> events injected. From my perspective it would be better to extract the
>> handling of timestamp data into a library with PMD specific
>> implementation of the conversions. That way the normalized timestamp
>> values can get extracted if they are present. The mbuf itself would
>> only indicate the presence of timestamp metadata in that case.
>
> I agree however that we need to properly define the meaning of this
> field. My idea is:
>
> - the timestamp is in nanosecond
> - the reference is always the same for a given path: if the timestamp is
>   set in a PMD, all the packets for this PMD will have the same
>   reference, but for 2 different PMDs (or a sw lib), the reference
>   would not be the same.
>
> I think it's enough for many use cases.
> We can later add helpers to compare timestamps with different
> references.

My point is that I still doubt that it belongs into the first
cacheline. It requires accessing other structures for converting into
nanoseconds anyway. Optimally I would like to see this happening on
access instead but if that isn't achievable at least in a second step.


Re: [dpdk-dev] [PATCHv7 00/47] NXP DPAA2 PMD

2017-02-17 Thread Thomas Monjalon
2017-02-17 12:13, Bruce Richardson:
> On Fri, Feb 17, 2017 at 11:34:43AM +, Ferruh Yigit wrote:
> > On 2/16/2017 1:27 PM, Bruce Richardson wrote:
> > > On Thu, Feb 16, 2017 at 08:22:49AM -0500, Neil Horman wrote:
> > >> On Thu, Feb 16, 2017 at 06:08:59AM +0530, Hemant Agrawal wrote:
> > >>> Prerequisites:
> > >>>  - For running the PMD, NXP's SoC (board) and SDK (software/BSP) is 
> > >>> required.
> > >>>Information about obtaining relevant software is available in the 
> > >>> docs
> > >>>as part of the patch.
> > >>
> > >> NAK.  The SDK requires registration to obtain, and appears to be non-open
> > >> source.  This driver is unmaintainable given that.
> > >>
> > > Hi Hemant,
> > > 
> > > can you perhaps clarify things here. What is the requirement to:
> > > * build the driver/DPDK for the platform
> > > * run applications using DPDK on the platform
> > > 
> > > Also what is the license/availability for those requirements.
> > 
> > Hi Bruce, Hemant, Neil, Thomas,
> > 
> > I did able to compile the driver without the SDK. It looks like that SDK
> > is a runtime dependency.
> > 
> > What is the DPDK requirement here?
> > If it is not breaking the build, this PMD provides it.
> > 
> If it builds without an SDK dependency I'd be happy enough to see this
> merged into DPDK. Since I can't run it without the correct hardware, I
> don't see needing the correct runtime software being a difficulty too.
> So long as we can compile this, we can check for breakages in it due to
> refactoring in other parts of DPDK, like we do for other drivers. As
> with those other drivers, it's up to the maintainers/vendors to do the
> functional checking for additional issues. Nobody is likely to have the
> hardware to functionality test all drivers in DPDK anyway.

+1, I agree with Bruce


[dpdk-dev] [PATCH v9] net/kni: add KNI PMD

2017-02-17 Thread Ferruh Yigit
Add KNI PMD which wraps librte_kni for ease of use.

KNI PMD can be used as any regular PMD to send / receive packets to the
Linux networking stack.

Signed-off-by: Ferruh Yigit 
Reviewed-by: Yong Wang 
---

v9:
* update for 17.05

v8:
* Don't try to link against librte_pmd_kni if librte_kni is disabled

v7:
* Add dependency to CONFIG_RTE_LIBRTE_KNI config

v6:
* documentation typos fixed

v5:
* add kvargs "no_request_thread" to disable a specific pthread creation
to handle control requests.
* add documentation

v4:
* allow only single queue
* use driver.name as name

v3:
* rebase on top of latest master

v2:
* updated driver name eth_kni -> net_kni
---
 MAINTAINERS |   5 +
 config/common_base  |   1 +
 config/common_linuxapp  |   1 +
 doc/guides/nics/features/kni.ini|   7 +
 doc/guides/nics/index.rst   |   1 +
 doc/guides/nics/kni.rst | 197 
 drivers/net/Makefile|   4 +
 drivers/net/kni/Makefile|  64 
 drivers/net/kni/rte_eth_kni.c   | 515 
 drivers/net/kni/rte_pmd_kni_version.map |   4 +
 mk/rte.app.mk   |  12 +-
 11 files changed, 806 insertions(+), 5 deletions(-)
 create mode 100644 doc/guides/nics/features/kni.ini
 create mode 100644 doc/guides/nics/kni.rst
 create mode 100644 drivers/net/kni/Makefile
 create mode 100644 drivers/net/kni/rte_eth_kni.c
 create mode 100644 drivers/net/kni/rte_pmd_kni_version.map

diff --git a/MAINTAINERS b/MAINTAINERS
index 8305237..b4617fc 100644
--- a/MAINTAINERS
+++ b/MAINTAINERS
@@ -412,6 +412,11 @@ M: Keith Wiles 
 F: drivers/net/tap/
 F: doc/guides/nics/tap.rst
 
+KNI PMD
+M: Ferruh Yigit 
+F: drivers/net/kni/
+F: doc/guides/nics/kni.rst
+
 Ring PMD
 M: Bruce Richardson 
 F: drivers/net/ring/
diff --git a/config/common_base b/config/common_base
index 71a4fcb..63756c4 100644
--- a/config/common_base
+++ b/config/common_base
@@ -581,6 +581,7 @@ CONFIG_RTE_PIPELINE_STATS_COLLECT=n
 # Compile librte_kni
 #
 CONFIG_RTE_LIBRTE_KNI=n
+CONFIG_RTE_LIBRTE_PMD_KNI=n
 CONFIG_RTE_KNI_KMOD=n
 CONFIG_RTE_KNI_KMOD_ETHTOOL=n
 CONFIG_RTE_KNI_PREEMPT_DEFAULT=y
diff --git a/config/common_linuxapp b/config/common_linuxapp
index 00ebaac..d03a60a 100644
--- a/config/common_linuxapp
+++ b/config/common_linuxapp
@@ -39,6 +39,7 @@ CONFIG_RTE_EAL_IGB_UIO=y
 CONFIG_RTE_EAL_VFIO=y
 CONFIG_RTE_KNI_KMOD=y
 CONFIG_RTE_LIBRTE_KNI=y
+CONFIG_RTE_LIBRTE_PMD_KNI=y
 CONFIG_RTE_LIBRTE_VHOST=y
 CONFIG_RTE_LIBRTE_PMD_VHOST=y
 CONFIG_RTE_LIBRTE_PMD_AF_PACKET=y
diff --git a/doc/guides/nics/features/kni.ini b/doc/guides/nics/features/kni.ini
new file mode 100644
index 000..6deb66a
--- /dev/null
+++ b/doc/guides/nics/features/kni.ini
@@ -0,0 +1,7 @@
+;
+; Supported features of the 'kni' network poll mode driver.
+;
+; Refer to default.ini for the full list of available PMD features.
+;
+[Features]
+Usage doc= Y
diff --git a/doc/guides/nics/index.rst b/doc/guides/nics/index.rst
index 87f9334..5248625 100644
--- a/doc/guides/nics/index.rst
+++ b/doc/guides/nics/index.rst
@@ -46,6 +46,7 @@ Network Interface Controller Drivers
 i40e
 ixgbe
 intel_vf
+kni
 mlx4
 mlx5
 nfp
diff --git a/doc/guides/nics/kni.rst b/doc/guides/nics/kni.rst
new file mode 100644
index 000..77542b5
--- /dev/null
+++ b/doc/guides/nics/kni.rst
@@ -0,0 +1,197 @@
+..  BSD LICENSE
+Copyright(c) 2017 Intel Corporation. All rights reserved.
+All rights reserved.
+
+Redistribution and use in source and binary forms, with or without
+modification, are permitted provided that the following conditions
+are met:
+
+* Redistributions of source code must retain the above copyright
+notice, this list of conditions and the following disclaimer.
+* Redistributions in binary form must reproduce the above copyright
+notice, this list of conditions and the following disclaimer in
+the documentation and/or other materials provided with the
+distribution.
+* Neither the name of Intel Corporation nor the names of its
+contributors may be used to endorse or promote products derived
+from this software without specific prior written permission.
+
+THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
+"AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
+LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
+A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
+OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
+SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
+LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
+DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
+THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
+(INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY 

Re: [dpdk-dev] [PATCH v9] net/kni: add KNI PMD

2017-02-17 Thread Thomas Monjalon
2017-02-17 13:42, Ferruh Yigit:
> Add KNI PMD which wraps librte_kni for ease of use.
> 
> KNI PMD can be used as any regular PMD to send / receive packets to the
> Linux networking stack.
> 
> Signed-off-by: Ferruh Yigit 
> Reviewed-by: Yong Wang 
> ---
> 
> v9:
> * update for 17.05

You keep updating this patch in the hope that someone would be interested :)

Please let's make clear that I am OK to merge it
but you asked me to wait for someone supporting its inclusion.


Re: [dpdk-dev] [RFC 0/8] mbuf: structure reorganization

2017-02-17 Thread Jan Blunck
On Fri, Feb 17, 2017 at 1:49 PM, Nélio Laranjeiro
 wrote:
> Hi Olivier, Jan,
>
> On Fri, Feb 17, 2017 at 11:51:53AM +0100, Olivier Matz wrote:
>> Hi Jan,
>>
>> On Thu, 16 Feb 2017 18:26:39 +0100, Jan Blunck 
>> wrote:
>> >
>> > If we talk about setting the timestamp value in the RX path this
>> > implicitly means software timestamps. Hardware timestamping usually
>> > works by letting the hardware inject sync events for coarse time
>> > tracking and additionally injecting fine granular per-packet ticks at
>> > a specific offset in the packet. Out of performance reasons I don't
>> > think it makes sense to extract this during the burst and write it
>> > into the mbuf again.
>>
>> From what I've understand, at least it does not work like this for
>> mellanox NICs: timestamp is a metadata attached to a rx packet. But
>> maybe they (and other NIC vendors interrested in the feature) can
>> confirm or not.
>
> Olivier is right, this timestamp information is returned by the hardware
> as the other fields describing the Rx packet (length, RSS hash, checksum
> ...).  The PMD only copy it into the Mbuf.
>

Indeed, for Mellanox the timestamp is stored in the CQ entry.
Solarflares write it relative to the packet header.


Re: [dpdk-dev] [PATCH v9] net/kni: add KNI PMD

2017-02-17 Thread Eelco Chaudron

On 17/02/17 14:47, Thomas Monjalon wrote:

2017-02-17 13:42, Ferruh Yigit:

Add KNI PMD which wraps librte_kni for ease of use.

KNI PMD can be used as any regular PMD to send / receive packets to the
Linux networking stack.

Signed-off-by: Ferruh Yigit 
Reviewed-by: Yong Wang 
---

v9:
* update for 17.05

You keep updating this patch in the hope that someone would be interested :)


We needed this a while back in DPDK for warp17, however I ended up 
implementing this myself, 
https://github.com/Juniper/warp17/blob/master/src/kni_if/tpg_kni_pmd.c


I think its useful,  so you do not have to use different APIs when 
sending/receiving over multiple types of interfaces.



Please let's make clear that I am OK to merge it
but you asked me to wait for someone supporting its inclusion.





Re: [dpdk-dev] [RFC 0/8] mbuf: structure reorganization

2017-02-17 Thread Olivier Matz
Hi Jan,

On Fri, 17 Feb 2017 14:38:32 +0100, Jan Blunck 
wrote:
> On Fri, Feb 17, 2017 at 11:51 AM, Olivier Matz
>  wrote:
> > Hi Jan,
> >
> > On Thu, 16 Feb 2017 18:26:39 +0100, Jan Blunck
> >  wrote:  
> >> On Thu, Feb 16, 2017 at 2:48 PM, Olivier Matz
> >>  wrote:  
> >> > On Mon, 6 Feb 2017 18:41:27 +, "Ananyev, Konstantin"
> >> >  wrote:  
> >> >> >
> >> >> > The main changes are:
> >> >> > - reorder structure to increase vector performance on some
> >> >> > non-ia platforms.
> >> >> > - add a 64bits timestamp field in the 1st cache line  
> >> >>
> >> >> Wonder why it deserves to be in first cache line?
> >> >> How it differs from seqn below (pure SW stuff right now).  
> >> >
> >> > In case the timestamp is set from a NIC value, it is set in the
> >> > Rx path. So that's why I think it deserve to be located in the
> >> > 1st cache line.
> >> >
> >> > As you said, the seqn is a pure sw stuff right: it is set in a
> >> > lib, not in a PMD rx path.
> >> >  
> >>
> >> If we talk about setting the timestamp value in the RX path this
> >> implicitly means software timestamps. Hardware timestamping usually
> >> works by letting the hardware inject sync events for coarse time
> >> tracking and additionally injecting fine granular per-packet ticks
> >> at a specific offset in the packet. Out of performance reasons I
> >> don't think it makes sense to extract this during the burst and
> >> write it into the mbuf again.  
> >
> > From what I've understand, at least it does not work like this for
> > mellanox NICs: timestamp is a metadata attached to a rx packet. But
> > maybe they (and other NIC vendors interrested in the feature) can
> > confirm or not.
> >  
> 
> Mellanox NICs use a 48bit cycle counter split into a high and low
> part. To convert the cycle values into a timestamp you need to
> initialize and maintainer a timecounter that shifts the cycle count
> e.g. nanosecs. IIRC Mellanox doesn't generate explicit clock events
> but the cycle counter is large enough so that the user can easily
> maintain the timecounter by manually updating it.
> 
> >>
> >> The problem with timestamps is to get the abstraction right wrt the
> >> correction factors and the size of the tick vs. the timestamp in
> >> the events injected. From my perspective it would be better to
> >> extract the handling of timestamp data into a library with PMD
> >> specific implementation of the conversions. That way the
> >> normalized timestamp values can get extracted if they are present.
> >> The mbuf itself would only indicate the presence of timestamp
> >> metadata in that case.  
> >
> > I agree however that we need to properly define the meaning of this
> > field. My idea is:
> >
> > - the timestamp is in nanosecond
> > - the reference is always the same for a given path: if the
> > timestamp is set in a PMD, all the packets for this PMD will have
> > the same reference, but for 2 different PMDs (or a sw lib), the
> > reference would not be the same.
> >
> > I think it's enough for many use cases.
> > We can later add helpers to compare timestamps with different
> > references.  
> 
> My point is that I still doubt that it belongs into the first
> cacheline. It requires accessing other structures for converting into
> nanoseconds anyway. Optimally I would like to see this happening on
> access instead but if that isn't achievable at least in a second step.

Sorry, I don't really get your point. My comprehension of the timestamp
usage in a PMD is as following:

rx_burst(struct rxq *rxq, ...)
{
unsigned long factor = rxq->timestamp_factor;
unsigned port = rxq->port;

for each hw_desc {
m = rte_pktmbuf_alloc(rxq->pool);
m->len = hw_desc->len;
m->port = port;
m->ol_flags = 
...
m->timestamp = hw_desc->timestamp * factor;
}
...
}

In that case, I think it deserves to be in the 1st cache line.


Olivier


Re: [dpdk-dev] [PATCH v9] net/kni: add KNI PMD

2017-02-17 Thread Ferruh Yigit
On 2/17/2017 1:47 PM, Thomas Monjalon wrote:
> 2017-02-17 13:42, Ferruh Yigit:
>> Add KNI PMD which wraps librte_kni for ease of use.
>>
>> KNI PMD can be used as any regular PMD to send / receive packets to the
>> Linux networking stack.
>>
>> Signed-off-by: Ferruh Yigit 
>> Reviewed-by: Yong Wang 
>> ---
>>
>> v9:
>> * update for 17.05
> 
> You keep updating this patch in the hope that someone would be interested :)
> 
> Please let's make clear that I am OK to merge it
> but you asked me to wait for someone supporting its inclusion.

Right, it is good to mention that I explicitly asked to wait community
response.

I keep updating it because I believe this is something useful.

Meanwhile adding this into repo means maintenance cost, so this should
not be merged without any usecase or interest from community.

Patch is waiting for an ACK or NAK from community.

Thanks,
ferruh


[dpdk-dev] [PATCH] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Keith Wiles
Calling strncpy with a maximum size argument of 16 bytes on destination
array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
destination string unterminated.

Signed-off-by: Keith Wiles 
---
 drivers/net/tap/rte_eth_tap.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
index efc4426..f9938d7 100644
--- a/drivers/net/tap/rte_eth_tap.c
+++ b/drivers/net/tap/rte_eth_tap.c
@@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short flags, 
int add)
return -1;
}
memset(&ifr, 0, sizeof(ifr));
-   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
+   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ-1);
err = ioctl(s, SIOCGIFFLAGS, &ifr);
if (err < 0) {
RTE_LOG(WARNING, PMD, "Unable to get %s device flags: %s\n",
-- 
2.8.0.GIT



Re: [dpdk-dev] [PATCH] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Wiles, Keith

> On Feb 17, 2017, at 8:44 AM, Keith Wiles  wrote:
> 
> Calling strncpy with a maximum size argument of 16 bytes on destination
> array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
> destination string unterminated.
> 
> Signed-off-by: Keith Wiles 
> ---
> drivers/net/tap/rte_eth_tap.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
> index efc4426..f9938d7 100644
> --- a/drivers/net/tap/rte_eth_tap.c
> +++ b/drivers/net/tap/rte_eth_tap.c
> @@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short 
> flags, int add)
>   return -1;
>   }
>   memset(&ifr, 0, sizeof(ifr));
> - strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
> + strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ-1);
>   err = ioctl(s, SIOCGIFFLAGS, &ifr);
>   if (err < 0) {
>   RTE_LOG(WARNING, PMD, "Unable to get %s device flags: %s\n”,

NAK missed the spaces around ‘-‘ :-(

> -- 
> 2.8.0.GIT
> 

Regards,
Keith



[dpdk-dev] [PATCH v2] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Keith Wiles
Calling strncpy with a maximum size argument of 16 bytes on destination
array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
destination string unterminated.

Signed-off-by: Keith Wiles 
---
 drivers/net/tap/rte_eth_tap.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
index efc4426..cf1cea3 100644
--- a/drivers/net/tap/rte_eth_tap.c
+++ b/drivers/net/tap/rte_eth_tap.c
@@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short flags, 
int add)
return -1;
}
memset(&ifr, 0, sizeof(ifr));
-   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
+   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ - 1);
err = ioctl(s, SIOCGIFFLAGS, &ifr);
if (err < 0) {
RTE_LOG(WARNING, PMD, "Unable to get %s device flags: %s\n",
-- 
2.8.0.GIT



[dpdk-dev] [PATCH v3 00/17] next-eventdev: event/sw software eventdev

2017-02-17 Thread Harry van Haaren

The following patchset adds software eventdev implementation
to the next-eventdev tree.

This implementation is based on the previous software eventdev
v2 patchset, now with comments addressed:
1) Fixed queue configure in single link, returning -EDQOUT (Jerin)
2) SW_SCHED_TYPE_DIRECT now in SW_ namespace (Jerin)
3) SW_PMD_NAME now in SW_ namespace (Jerin)
4) Replaced printfs with SW_LOG() to report errors (Jerin)
5) Update version map to 17.05 (Jerin)
6) Fixed back-to-back tests of event/sw (Jerin)
7) checkpatch issues in v2 of xstats() (Jerin)
8) Updated link() and unlink() functions to accept dev* parameter
9) Fixed bug in load-balanced port credits returning (Gage)
a) Fixed segfault if dump() called before fully configured (Vipin)
b) Removed extern sched_quanta, now tracked internally in state
c) Added compiler barriers to qe_ring datastructures
d) vdev uninit() as implemented in next-eventdev is now supported (Jerin)
c) xstats get() API has "mode" to select dev, port or queue stats (Jerin)
e) xstats reset() added (Jerin)
f) Renamed test_sw_eventdev.c to test_eventdev_sw.c (Jerin)

The first three patches in the series make new changes to the eventdev
API and tests that were not present in the v2 patchset. This was required
to pass the unit tests and provide correct behaviour of eventdev layer
checks of the event/sw implementation:
0) Fix timeout ticks test to return -ENOTSUP if dev doesn't support it
1) Increase the enqueue depth in dev_config and port_conf
2) Create dummy Links before start stop test (details in commit msg)

Next then the software implementation is added, and finally
more tests are added for the sw eventdev implementation.

Git log is clean, while checkpatch issues:
2 Errors on Complex Macro (which I beleive cannot be fixed)
4 Warnings in the scheduler logic, which reduce readability when fixed.

This patchset contains the work of multiple developers,
please see signoffs on each patch.

Signed-off-by: Harry van Haaren 




Bruce Richardson (14):
  eventdev: add APIs for extended stats
  event/sw: add new software-only eventdev driver
  event/sw: add device capabilities function
  event/sw: add configure function
  event/sw: add fns to return default port/queue config
  event/sw: add support for event queues
  event/sw: add support for event ports
  event/sw: add support for linking queues to ports
  event/sw: add worker core functions
  event/sw: add scheduling logic
  event/sw: add start stop and close functions
  event/sw: add dump function for easier debugging
  event/sw: add xstats support
  app/test: add unit tests for SW eventdev driver

Harry van Haaren (3):
  eventdev: fix API docs and test for timeout ticks
  eventdev: increase size of enq deq conf variables
  app/test: eventdev link all queues before start

 app/test/Makefile |5 +-
 app/test/autotest_data.py |   26 +
 app/test/test_eventdev.c  |   20 +-
 app/test/test_eventdev_sw.c   | 2249 +
 config/common_base|5 +
 drivers/event/Makefile|1 +
 drivers/event/sw/Makefile |   69 +
 drivers/event/sw/event_ring.h |  185 ++
 drivers/event/sw/iq_ring.h|  176 ++
 drivers/event/sw/rte_pmd_evdev_sw_version.map |3 +
 drivers/event/sw/sw_evdev.c   |  793 +
 drivers/event/sw/sw_evdev.h   |  306 
 drivers/event/sw/sw_evdev_scheduler.c |  602 +++
 drivers/event/sw/sw_evdev_worker.c|  188 +++
 drivers/event/sw/sw_evdev_xstats.c|  511 ++
 lib/librte_eventdev/rte_eventdev.c|   78 +
 lib/librte_eventdev/rte_eventdev.h|  130 +-
 lib/librte_eventdev/rte_eventdev_pmd.h|   70 +
 lib/librte_eventdev/rte_eventdev_version.map  |4 +
 mk/rte.app.mk |1 +
 20 files changed, 5415 insertions(+), 7 deletions(-)
 create mode 100644 app/test/test_eventdev_sw.c
 create mode 100644 drivers/event/sw/Makefile
 create mode 100644 drivers/event/sw/event_ring.h
 create mode 100644 drivers/event/sw/iq_ring.h
 create mode 100644 drivers/event/sw/rte_pmd_evdev_sw_version.map
 create mode 100644 drivers/event/sw/sw_evdev.c
 create mode 100644 drivers/event/sw/sw_evdev.h
 create mode 100644 drivers/event/sw/sw_evdev_scheduler.c
 create mode 100644 drivers/event/sw/sw_evdev_worker.c
 create mode 100644 drivers/event/sw/sw_evdev_xstats.c

-- 
2.7.4



[dpdk-dev] [PATCH v3 01/17] eventdev: fix API docs and test for timeout ticks

2017-02-17 Thread Harry van Haaren
This commit improves the documentation of the api return
values for rte_event_dequeue_timeout_ticks(), and allows
-ENOTSUP to be returned by devices which do not support
timeouts.

The unit test is modified to accept -ENOTSUP as a pass,
as the device doesn't implement the timeout_ticks function.

Fixes: 4c9a26e419a7 ("app/test: unit test case for eventdev APIs")

Signed-off-by: Harry van Haaren 
---
 app/test/test_eventdev.c   | 4 +++-
 lib/librte_eventdev/rte_eventdev.h | 6 --
 2 files changed, 7 insertions(+), 3 deletions(-)

diff --git a/app/test/test_eventdev.c b/app/test/test_eventdev.c
index 042a446..756bc32 100644
--- a/app/test/test_eventdev.c
+++ b/app/test/test_eventdev.c
@@ -519,7 +519,9 @@ test_eventdev_timeout_ticks(void)
uint64_t timeout_ticks;
 
ret = rte_event_dequeue_timeout_ticks(TEST_DEV_ID, 100, &timeout_ticks);
-   TEST_ASSERT_SUCCESS(ret, "Fail to get timeout_ticks");
+   /* -ENOTSUP is a valid return if timeout is not supported by device  */
+   if (ret != -ENOTSUP)
+   TEST_ASSERT_SUCCESS(ret, "Fail to get timeout_ticks");
 
return TEST_SUCCESS;
 }
diff --git a/lib/librte_eventdev/rte_eventdev.h 
b/lib/librte_eventdev/rte_eventdev.h
index b619160..b0c7f9c 100644
--- a/lib/librte_eventdev/rte_eventdev.h
+++ b/lib/librte_eventdev/rte_eventdev.h
@@ -1158,7 +1158,9 @@ rte_event_enqueue_burst(uint8_t dev_id, uint8_t port_id,
  *
  * @return
  *  - 0 on success.
- *  - <0 on failure.
+ *  - -ENOTSUP if the device doesn't support timeouts.
+ *  - -EINVAL if *dev_id* is invalid or *timeout_ticks* is a null pointer.
+ *  - other values < 0 on failure.
  *
  * @see rte_event_dequeue_burst(), RTE_EVENT_DEV_CFG_PER_DEQUEUE_TIMEOUT
  * @see rte_event_dev_configure()
@@ -1166,7 +1168,7 @@ rte_event_enqueue_burst(uint8_t dev_id, uint8_t port_id,
  */
 int
 rte_event_dequeue_timeout_ticks(uint8_t dev_id, uint64_t ns,
-   uint64_t *timeout_ticks);
+   uint64_t *timeout_ticks);
 
 /**
  * Dequeue a burst of events objects or an event object from the event port
-- 
2.7.4



[dpdk-dev] [PATCH v3 02/17] eventdev: increase size of enq deq conf variables

2017-02-17 Thread Harry van Haaren
Large port enqueue sizes were not supported as the value
it was stored in was a uint8_t. Using uint8_ts to save
space in config apis makes no sense - increasing the 3
instances of uint8_t enqueue / dequeue depths to more
appropriate values (based on the context around them).

Signed-off-by: Harry van Haaren 
---
 lib/librte_eventdev/rte_eventdev.h | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/lib/librte_eventdev/rte_eventdev.h 
b/lib/librte_eventdev/rte_eventdev.h
index b0c7f9c..8b6cb7a 100644
--- a/lib/librte_eventdev/rte_eventdev.h
+++ b/lib/librte_eventdev/rte_eventdev.h
@@ -426,7 +426,7 @@ struct rte_event_dev_config {
 * This value cannot exceed the *max_event_queue_flows* which previously
 * provided in rte_event_dev_info_get()
 */
-   uint8_t nb_event_port_dequeue_depth;
+   uint32_t nb_event_port_dequeue_depth;
/**< Maximum number of events can be dequeued at a time from an
 * event port by this device.
 * This value cannot exceed the *max_event_port_dequeue_depth*
@@ -637,12 +637,12 @@ struct rte_event_port_conf {
 * which was previously supplied to rte_event_dev_configure().
 * This should be set to '-1' for *open system*.
 */
-   uint8_t dequeue_depth;
+   uint16_t dequeue_depth;
/**< Configure number of bulk dequeues for this event port.
 * This value cannot exceed the *nb_event_port_dequeue_depth*
 * which previously supplied to rte_event_dev_configure()
 */
-   uint8_t enqueue_depth;
+   uint16_t enqueue_depth;
/**< Configure number of bulk enqueues for this event port.
 * This value cannot exceed the *nb_event_port_enqueue_depth*
 * which previously supplied to rte_event_dev_configure()
-- 
2.7.4



[dpdk-dev] [PATCH v3 04/17] eventdev: add APIs for extended stats

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Add in APIs for extended stats so that eventdev implementations can report
out information on their internal state. The APIs are based on, but not
identical to, the equivalent ethdev functions.

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 lib/librte_eventdev/rte_eventdev.c   |  78 ++
 lib/librte_eventdev/rte_eventdev.h   | 118 +++
 lib/librte_eventdev/rte_eventdev_pmd.h   |  70 
 lib/librte_eventdev/rte_eventdev_version.map |   4 +
 4 files changed, 270 insertions(+)

diff --git a/lib/librte_eventdev/rte_eventdev.c 
b/lib/librte_eventdev/rte_eventdev.c
index 68bfc3b..628b94c 100644
--- a/lib/librte_eventdev/rte_eventdev.c
+++ b/lib/librte_eventdev/rte_eventdev.c
@@ -920,6 +920,84 @@ rte_event_dev_dump(uint8_t dev_id, FILE *f)
 
 }
 
+static int
+xstats_get_count(uint8_t dev_id, enum rte_event_dev_xstats_mode mode,
+   uint8_t queue_port_id)
+{
+   struct rte_eventdev *dev = &rte_eventdevs[dev_id];
+   if (dev->dev_ops->xstats_get_names != NULL)
+   return (*dev->dev_ops->xstats_get_names)(dev, mode,
+   queue_port_id, NULL, 0);
+   return 0;
+}
+
+int
+rte_event_dev_xstats_names_get(uint8_t dev_id,
+   enum rte_event_dev_xstats_mode mode, uint8_t queue_port_id,
+   struct rte_event_dev_xstats_name *xstats_names,
+   unsigned int size)
+{
+   RTE_EVENTDEV_VALID_DEVID_OR_ERR_RET(dev_id, -EINVAL);
+   const int cnt_expected_entries = xstats_get_count(dev_id, mode,
+ queue_port_id);
+   if (xstats_names == NULL || cnt_expected_entries < 0 ||
+   (int)size < cnt_expected_entries)
+   return cnt_expected_entries;
+
+   /* dev_id checked above */
+   const struct rte_eventdev *dev = &rte_eventdevs[dev_id];
+
+   if (dev->dev_ops->xstats_get_names != NULL)
+   return (*dev->dev_ops->xstats_get_names)(dev, mode,
+   queue_port_id, xstats_names, size);
+
+   return -ENOTSUP;
+}
+
+/* retrieve eventdev extended statistics */
+int
+rte_event_dev_xstats_get(uint8_t dev_id, enum rte_event_dev_xstats_mode mode,
+   uint8_t queue_port_id, const unsigned int ids[],
+   uint64_t values[], unsigned int n)
+{
+   RTE_EVENTDEV_VALID_DEVID_OR_ERR_RET(dev_id, -EINVAL);
+   const struct rte_eventdev *dev = &rte_eventdevs[dev_id];
+
+   /* implemented by the driver */
+   if (dev->dev_ops->xstats_get != NULL)
+   return (*dev->dev_ops->xstats_get)(dev, mode, queue_port_id,
+   ids, values, n);
+   return -ENOTSUP;
+}
+
+uint64_t
+rte_event_dev_xstats_by_name_get(uint8_t dev_id, const char *name,
+   unsigned int *id)
+{
+   RTE_EVENTDEV_VALID_DEVID_OR_ERR_RET(dev_id, 0);
+   const struct rte_eventdev *dev = &rte_eventdevs[dev_id];
+   unsigned int temp = -1;
+
+   if (id != NULL)
+   *id = (unsigned int)-1;
+   else
+   id = &temp; /* ensure driver never gets a NULL value */
+
+   /* implemented by driver */
+   if (dev->dev_ops->xstats_get_by_name != NULL)
+   return (*dev->dev_ops->xstats_get_by_name)(dev, name, id);
+   return -ENOTSUP;
+}
+
+int rte_event_dev_xstats_reset(uint8_t dev_id)
+{
+   RTE_EVENTDEV_VALID_DEVID_OR_ERR_RET(dev_id, -EINVAL);
+   struct rte_eventdev *dev = &rte_eventdevs[dev_id];
+   if (dev->dev_ops->xstats_reset != NULL)
+   return (*dev->dev_ops->xstats_reset)(dev);
+   return -ENOTSUP;
+}
+
 int
 rte_event_dev_start(uint8_t dev_id)
 {
diff --git a/lib/librte_eventdev/rte_eventdev.h 
b/lib/librte_eventdev/rte_eventdev.h
index 8b6cb7a..242b259 100644
--- a/lib/librte_eventdev/rte_eventdev.h
+++ b/lib/librte_eventdev/rte_eventdev.h
@@ -1405,6 +1405,124 @@ rte_event_port_links_get(uint8_t dev_id, uint8_t 
port_id,
 int
 rte_event_dev_dump(uint8_t dev_id, FILE *f);
 
+/** Maximum name length for extended statistics counters */
+#define RTE_EVENT_DEV_XSTATS_NAME_SIZE 64
+
+/**
+ * Selects the component of the eventdev to retrieve statistics from.
+ */
+enum rte_event_dev_xstats_mode {
+   RTE_EVENT_DEV_XSTATS_DEVICE,
+   RTE_EVENT_DEV_XSTATS_PORT,
+   RTE_EVENT_DEV_XSTATS_QUEUE,
+};
+
+/**
+ * A name-key lookup element for extended statistics.
+ *
+ * This structure is used to map between names and ID numbers
+ * for extended ethdev statistics.
+ */
+struct rte_event_dev_xstats_name {
+   char name[RTE_EVENT_DEV_XSTATS_NAME_SIZE];
+};
+
+/**
+ * Retrieve names of extended statistics of an event device.
+ *
+ * @param dev_id
+ *   The identifier of the event device.
+ * @param mode
+ *   The mode of statistics to retrieve. Choices include the device statistics,
+ *   port statistics or queue statistics.

[dpdk-dev] [PATCH v3 05/17] event/sw: add new software-only eventdev driver

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

This adds the minimal changes to allow a SW eventdev implementation to
be compiled, linked and created at run time. The eventdev does nothing,
but can be created via vdev on commandline, e.g.

  sudo ./x86_64-native-linuxapp-gcc/app/test --vdev=event_sw0
  ...
  PMD: Creating eventdev sw device event_sw0, numa_node=0, sched_quanta=128
  RTE>>

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 config/common_base|   5 +
 drivers/event/Makefile|   1 +
 drivers/event/sw/Makefile |  66 ++
 drivers/event/sw/rte_pmd_evdev_sw_version.map |   3 +
 drivers/event/sw/sw_evdev.c   | 177 ++
 drivers/event/sw/sw_evdev.h   | 148 +
 mk/rte.app.mk |   1 +
 7 files changed, 401 insertions(+)
 create mode 100644 drivers/event/sw/Makefile
 create mode 100644 drivers/event/sw/rte_pmd_evdev_sw_version.map
 create mode 100644 drivers/event/sw/sw_evdev.c
 create mode 100644 drivers/event/sw/sw_evdev.h

diff --git a/config/common_base b/config/common_base
index 2538f4a..1121e56 100644
--- a/config/common_base
+++ b/config/common_base
@@ -458,6 +458,11 @@ CONFIG_RTE_LIBRTE_PMD_SKELETON_EVENTDEV=y
 CONFIG_RTE_LIBRTE_PMD_SKELETON_EVENTDEV_DEBUG=n
 
 #
+# Compile PMD for software event device
+#
+CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV=y
+
+#
 # Compile librte_ring
 #
 CONFIG_RTE_LIBRTE_RING=y
diff --git a/drivers/event/Makefile b/drivers/event/Makefile
index 678279f..353441c 100644
--- a/drivers/event/Makefile
+++ b/drivers/event/Makefile
@@ -32,5 +32,6 @@
 include $(RTE_SDK)/mk/rte.vars.mk
 
 DIRS-$(CONFIG_RTE_LIBRTE_PMD_SKELETON_EVENTDEV) += skeleton
+DIRS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw
 
 include $(RTE_SDK)/mk/rte.subdir.mk
diff --git a/drivers/event/sw/Makefile b/drivers/event/sw/Makefile
new file mode 100644
index 000..d6836e3
--- /dev/null
+++ b/drivers/event/sw/Makefile
@@ -0,0 +1,66 @@
+#   BSD LICENSE
+#
+#   Copyright(c) 2016-2017 Intel Corporation. All rights reserved.
+#
+#   Redistribution and use in source and binary forms, with or without
+#   modification, are permitted provided that the following conditions
+#   are met:
+#
+# * Redistributions of source code must retain the above copyright
+#   notice, this list of conditions and the following disclaimer.
+# * Redistributions in binary form must reproduce the above copyright
+#   notice, this list of conditions and the following disclaimer in
+#   the documentation and/or other materials provided with the
+#   distribution.
+# * Neither the name of Intel Corporation nor the names of its
+#   contributors may be used to endorse or promote products derived
+#   from this software without specific prior written permission.
+#
+#   THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
+#   "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
+#   LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
+#   A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
+#   OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
+#   SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
+#   LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
+#   DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
+#   THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
+#   (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+#   OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
+
+include $(RTE_SDK)/mk/rte.vars.mk
+
+
+# library name
+LIB = librte_pmd_sw_event.a
+
+# build flags
+CFLAGS += -O3
+CFLAGS += $(WERROR_FLAGS)
+# for older GCC versions, allow us to initialize an event using
+# designated initializers.
+ifeq ($(CONFIG_RTE_TOOLCHAIN_GCC),y)
+ifeq ($(shell test $(GCC_VERSION) -le 50 && echo 1), 1)
+CFLAGS += -Wno-missing-field-initializers
+endif
+endif
+
+# library version
+LIBABIVER := 1
+
+# versioning export map
+EXPORT_MAP := rte_pmd_evdev_sw_version.map
+
+# library source files
+SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev.c
+
+# export include files
+SYMLINK-y-include +=
+
+# library dependencies
+DEPDIRS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += lib/librte_eal
+DEPDIRS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += lib/librte_eventdev
+DEPDIRS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += lib/librte_kvargs
+DEPDIRS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += lib/librte_ring
+
+include $(RTE_SDK)/mk/rte.lib.mk
diff --git a/drivers/event/sw/rte_pmd_evdev_sw_version.map 
b/drivers/event/sw/rte_pmd_evdev_sw_version.map
new file mode 100644
index 000..5352e7e
--- /dev/null
+++ b/drivers/event/sw/rte_pmd_evdev_sw_version.map
@@ -0,0 +1,3 @@
+DPDK_17.05 {
+   local: *;
+};
diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
new

[dpdk-dev] [PATCH v3 03/17] app/test: eventdev link all queues before start

2017-02-17 Thread Harry van Haaren
The software eventdev can lock-up if not all queues are
linked to a port. For this reason, the software evendev
fails to start if queues are not linked to anything.

This commit creates dummy links from all queues to port
0 in the eventdev setup function and start/stop test,
which would otherwise fail due to unlinked queues.

Signed-off-by: Harry van Haaren 
---
 app/test/test_eventdev.c | 16 
 1 file changed, 16 insertions(+)

diff --git a/app/test/test_eventdev.c b/app/test/test_eventdev.c
index 756bc32..7d4160d 100644
--- a/app/test/test_eventdev.c
+++ b/app/test/test_eventdev.c
@@ -545,6 +545,14 @@ test_eventdev_start_stop(void)
TEST_ASSERT_SUCCESS(ret, "Failed to setup port%d", i);
}
 
+   for (i = 0; i < rte_event_queue_count(TEST_DEV_ID); i++) {
+   uint8_t queue = i;
+   uint8_t prio  = 0;
+   ret = rte_event_port_link(TEST_DEV_ID, 0, &queue, &prio, 1);
+   TEST_ASSERT(ret == 1, "Failed to link port, device %d",
+   TEST_DEV_ID);
+   }
+
ret = rte_event_dev_start(TEST_DEV_ID);
TEST_ASSERT_SUCCESS(ret, "Failed to start device%d", TEST_DEV_ID);
 
@@ -571,6 +579,14 @@ eventdev_setup_device(void)
TEST_ASSERT_SUCCESS(ret, "Failed to setup port%d", i);
}
 
+   for (i = 0; i < rte_event_queue_count(TEST_DEV_ID); i++) {
+   uint8_t queue = i;
+   uint8_t prio  = 0;
+   ret = rte_event_port_link(TEST_DEV_ID, 0, &queue, &prio, 1);
+   TEST_ASSERT(ret == 1, "Failed to link port, device %d",
+   TEST_DEV_ID);
+   }
+
ret = rte_event_dev_start(TEST_DEV_ID);
TEST_ASSERT_SUCCESS(ret, "Failed to start device%d", TEST_DEV_ID);
 
-- 
2.7.4



[dpdk-dev] [PATCH v3 06/17] event/sw: add device capabilities function

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Add in the info_get function to return details on the queues, flow,
prioritization capabilities, etc. that this device has.

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/sw_evdev.c | 23 +++
 1 file changed, 23 insertions(+)

diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
index 4de9bc1..9d8517a 100644
--- a/drivers/event/sw/sw_evdev.c
+++ b/drivers/event/sw/sw_evdev.c
@@ -44,6 +44,28 @@
 #define SCHED_QUANTA_ARG "sched_quanta"
 #define CREDIT_QUANTA_ARG "credit_quanta"
 
+static void
+sw_info_get(struct rte_eventdev *dev, struct rte_event_dev_info *info)
+{
+   RTE_SET_USED(dev);
+
+   static const struct rte_event_dev_info evdev_sw_info = {
+   .driver_name = SW_PMD_NAME,
+   .max_event_queues = RTE_EVENT_MAX_QUEUES_PER_DEV,
+   .max_event_queue_flows = SW_QID_NUM_FIDS,
+   .max_event_queue_priority_levels = SW_Q_PRIORITY_MAX,
+   .max_event_priority_levels = SW_IQS_MAX,
+   .max_event_ports = SW_PORTS_MAX,
+   .max_event_port_dequeue_depth = MAX_SW_CONS_Q_DEPTH,
+   .max_event_port_enqueue_depth = MAX_SW_PROD_Q_DEPTH,
+   .max_num_events = SW_INFLIGHT_EVENTS_TOTAL,
+   .event_dev_cap = (RTE_EVENT_DEV_CAP_QUEUE_QOS |
+   RTE_EVENT_DEV_CAP_EVENT_QOS),
+   };
+
+   *info = evdev_sw_info;
+}
+
 static int
 assign_numa_node(const char *key __rte_unused, const char *value, void *opaque)
 {
@@ -78,6 +100,7 @@ static int
 sw_probe(const char *name, const char *params)
 {
static const struct rte_eventdev_ops evdev_sw_ops = {
+   .dev_infos_get = sw_info_get,
};
 
static const char *const args[] = {
-- 
2.7.4



[dpdk-dev] [PATCH v3 07/17] event/sw: add configure function

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/sw_evdev.c | 15 +++
 drivers/event/sw/sw_evdev.h | 11 +++
 2 files changed, 26 insertions(+)

diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
index 9d8517a..28a2326 100644
--- a/drivers/event/sw/sw_evdev.c
+++ b/drivers/event/sw/sw_evdev.c
@@ -44,6 +44,20 @@
 #define SCHED_QUANTA_ARG "sched_quanta"
 #define CREDIT_QUANTA_ARG "credit_quanta"
 
+static int
+sw_dev_configure(const struct rte_eventdev *dev)
+{
+   struct sw_evdev *sw = sw_pmd_priv(dev);
+   const struct rte_eventdev_data *data = dev->data;
+   const struct rte_event_dev_config *conf = &data->dev_conf;
+
+   sw->qid_count = conf->nb_event_queues;
+   sw->port_count = conf->nb_event_ports;
+   sw->nb_events_limit = conf->nb_events_limit;
+
+   return 0;
+}
+
 static void
 sw_info_get(struct rte_eventdev *dev, struct rte_event_dev_info *info)
 {
@@ -100,6 +114,7 @@ static int
 sw_probe(const char *name, const char *params)
 {
static const struct rte_eventdev_ops evdev_sw_ops = {
+   .dev_configure = sw_dev_configure,
.dev_infos_get = sw_info_get,
};
 
diff --git a/drivers/event/sw/sw_evdev.h b/drivers/event/sw/sw_evdev.h
index ab315d4..fda57df 100644
--- a/drivers/event/sw/sw_evdev.h
+++ b/drivers/event/sw/sw_evdev.h
@@ -35,6 +35,7 @@
 
 #include 
 #include 
+#include 
 
 #define SW_DEFAULT_CREDIT_QUANTA 32
 #define SW_DEFAULT_SCHED_QUANTA 128
@@ -129,7 +130,17 @@ struct sw_qid {
 struct sw_evdev {
struct rte_eventdev_data *data;
 
+   uint32_t port_count;
+   uint32_t qid_count;
+
+   /*
+* max events in this instance. Cached here for performance.
+* (also available in data->conf.nb_events_limit)
+*/
+   uint32_t nb_events_limit;
+
int32_t sched_quanta;
+
uint32_t credit_update_quanta;
 };
 
-- 
2.7.4



[dpdk-dev] [PATCH v3 09/17] event/sw: add support for event queues

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Add in the data structures for the event queues, and the eventdev
functions to create and destroy those queues.

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/iq_ring.h  | 176 
 drivers/event/sw/sw_evdev.c | 160 
 drivers/event/sw/sw_evdev.h |   5 ++
 3 files changed, 341 insertions(+)
 create mode 100644 drivers/event/sw/iq_ring.h

diff --git a/drivers/event/sw/iq_ring.h b/drivers/event/sw/iq_ring.h
new file mode 100644
index 000..d480d15
--- /dev/null
+++ b/drivers/event/sw/iq_ring.h
@@ -0,0 +1,176 @@
+/*-
+ *   BSD LICENSE
+ *
+ *   Copyright(c) 2016-2017 Intel Corporation. All rights reserved.
+ *
+ *   Redistribution and use in source and binary forms, with or without
+ *   modification, are permitted provided that the following conditions
+ *   are met:
+ *
+ * * Redistributions of source code must retain the above copyright
+ *   notice, this list of conditions and the following disclaimer.
+ * * Redistributions in binary form must reproduce the above copyright
+ *   notice, this list of conditions and the following disclaimer in
+ *   the documentation and/or other materials provided with the
+ *   distribution.
+ * * Neither the name of Intel Corporation nor the names of its
+ *   contributors may be used to endorse or promote products derived
+ *   from this software without specific prior written permission.
+ *
+ *   THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
+ *   "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
+ *   LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
+ *   A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
+ *   OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
+ *   SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
+ *   LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
+ *   DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
+ *   THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
+ *   (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+ *   OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
+ */
+
+/*
+ * Ring structure definitions used for the internal ring buffers of the
+ * SW eventdev implementation. These are designed for single-core use only.
+ */
+#ifndef _IQ_RING_
+#define _IQ_RING_
+
+#include 
+
+#include 
+#include 
+#include 
+#include 
+
+#define IQ_RING_NAMESIZE 12
+#define QID_IQ_DEPTH 512
+#define QID_IQ_MASK (uint16_t)(QID_IQ_DEPTH - 1)
+
+struct iq_ring {
+   char name[IQ_RING_NAMESIZE] __rte_cache_aligned;
+   uint16_t write_idx;
+   uint16_t read_idx;
+
+   struct rte_event ring[QID_IQ_DEPTH];
+};
+
+#ifndef force_inline
+#define force_inline inline __attribute__((always_inline))
+#endif
+
+static inline struct iq_ring *
+iq_ring_create(const char *name, unsigned int socket_id)
+{
+   struct iq_ring *retval;
+
+   retval = rte_malloc_socket(NULL, sizeof(*retval), 0, socket_id);
+   if (retval == NULL)
+   goto end;
+
+   snprintf(retval->name, sizeof(retval->name), "%s", name);
+   retval->write_idx = retval->read_idx = 0;
+end:
+   return retval;
+}
+
+static inline void
+iq_ring_destroy(struct iq_ring *r)
+{
+   rte_free(r);
+}
+
+static force_inline uint16_t
+iq_ring_count(const struct iq_ring *r)
+{
+   return r->write_idx - r->read_idx;
+}
+
+static force_inline uint16_t
+iq_ring_free_count(const struct iq_ring *r)
+{
+   return QID_IQ_MASK - iq_ring_count(r);
+}
+
+static force_inline uint16_t
+iq_ring_enqueue_burst(struct iq_ring *r, struct rte_event *qes, uint16_t 
nb_qes)
+{
+   const uint16_t read = r->read_idx;
+   uint16_t write = r->write_idx;
+   const uint16_t space = read + QID_IQ_MASK - write;
+   uint16_t i;
+
+   if (space < nb_qes)
+   nb_qes = space;
+
+   for (i = 0; i < nb_qes; i++, write++)
+   r->ring[write & QID_IQ_MASK] = qes[i];
+
+   r->write_idx = write;
+
+   return nb_qes;
+}
+
+static force_inline uint16_t
+iq_ring_dequeue_burst(struct iq_ring *r, struct rte_event *qes, uint16_t 
nb_qes)
+{
+   uint16_t read = r->read_idx;
+   const uint16_t write = r->write_idx;
+   const uint16_t items = write - read;
+   uint16_t i;
+
+   for (i = 0; i < nb_qes; i++, read++)
+   qes[i] = r->ring[read & QID_IQ_MASK];
+
+   if (items < nb_qes)
+   nb_qes = items;
+
+   r->read_idx += nb_qes;
+
+   return nb_qes;
+}
+
+/* assumes there is space, from a previous dequeue_burst */
+static force_inline uint16_t
+iq_ring_put_back(struct iq_ring *r, struct rte_event *qes, uint16_t nb_qes)
+{
+   uint16_t i, read = r->read_idx;
+
+   for (i = nb_qes; i-- > 0; )
+  

[dpdk-dev] [PATCH v3 08/17] event/sw: add fns to return default port/queue config

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Signed-off-by: Bruce Richardson 
---
 drivers/event/sw/sw_evdev.c | 32 
 1 file changed, 32 insertions(+)

diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
index 28a2326..d1fa3a7 100644
--- a/drivers/event/sw/sw_evdev.c
+++ b/drivers/event/sw/sw_evdev.c
@@ -44,6 +44,35 @@
 #define SCHED_QUANTA_ARG "sched_quanta"
 #define CREDIT_QUANTA_ARG "credit_quanta"
 
+static void
+sw_queue_def_conf(struct rte_eventdev *dev, uint8_t queue_id,
+struct rte_event_queue_conf *conf)
+{
+   RTE_SET_USED(dev);
+   RTE_SET_USED(queue_id);
+
+   static const struct rte_event_queue_conf default_conf = {
+   .nb_atomic_flows = 4096,
+   .nb_atomic_order_sequences = 1,
+   .event_queue_cfg = RTE_EVENT_QUEUE_CFG_ATOMIC_ONLY,
+   .priority = RTE_EVENT_DEV_PRIORITY_NORMAL,
+   };
+
+   *conf = default_conf;
+}
+
+static void
+sw_port_def_conf(struct rte_eventdev *dev, uint8_t port_id,
+struct rte_event_port_conf *port_conf)
+{
+   RTE_SET_USED(dev);
+   RTE_SET_USED(port_id);
+
+   port_conf->new_event_threshold = 1024;
+   port_conf->dequeue_depth = 16;
+   port_conf->enqueue_depth = 16;
+}
+
 static int
 sw_dev_configure(const struct rte_eventdev *dev)
 {
@@ -116,6 +145,9 @@ sw_probe(const char *name, const char *params)
static const struct rte_eventdev_ops evdev_sw_ops = {
.dev_configure = sw_dev_configure,
.dev_infos_get = sw_info_get,
+
+   .queue_def_conf = sw_queue_def_conf,
+   .port_def_conf = sw_port_def_conf,
};
 
static const char *const args[] = {
-- 
2.7.4



[dpdk-dev] [PATCH v3 10/17] event/sw: add support for event ports

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Add in the data-structures for the ports used by workers to send
packets to/from the scheduler. Also add in the functions to
create/destroy those ports.

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/event_ring.h | 185 ++
 drivers/event/sw/sw_evdev.c   |  76 +
 drivers/event/sw/sw_evdev.h   |  78 ++
 3 files changed, 339 insertions(+)
 create mode 100644 drivers/event/sw/event_ring.h

diff --git a/drivers/event/sw/event_ring.h b/drivers/event/sw/event_ring.h
new file mode 100644
index 000..cdaee95
--- /dev/null
+++ b/drivers/event/sw/event_ring.h
@@ -0,0 +1,185 @@
+/*-
+ *   BSD LICENSE
+ *
+ *   Copyright(c) 2016-2017 Intel Corporation. All rights reserved.
+ *
+ *   Redistribution and use in source and binary forms, with or without
+ *   modification, are permitted provided that the following conditions
+ *   are met:
+ *
+ * * Redistributions of source code must retain the above copyright
+ *   notice, this list of conditions and the following disclaimer.
+ * * Redistributions in binary form must reproduce the above copyright
+ *   notice, this list of conditions and the following disclaimer in
+ *   the documentation and/or other materials provided with the
+ *   distribution.
+ * * Neither the name of Intel Corporation nor the names of its
+ *   contributors may be used to endorse or promote products derived
+ *   from this software without specific prior written permission.
+ *
+ *   THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
+ *   "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
+ *   LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
+ *   A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
+ *   OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
+ *   SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
+ *   LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
+ *   DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
+ *   THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
+ *   (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+ *   OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
+ */
+
+/*
+ * Generic ring structure for passing events from one core to another.
+ *
+ * Used by the software scheduler for the producer and consumer rings for
+ * each port, i.e. for passing events from worker cores to scheduler and
+ * vice-versa. Designed for single-producer, single-consumer use with two
+ * cores working on each ring.
+ */
+
+#ifndef _EVENT_RING_
+#define _EVENT_RING_
+
+#include 
+
+#include 
+#include 
+#include 
+
+#define QE_RING_NAMESIZE 32
+
+struct qe_ring {
+   char name[QE_RING_NAMESIZE] __rte_cache_aligned;
+   uint32_t ring_size; /* size of memory block allocated to the ring */
+   uint32_t mask;  /* mask for read/write values == ring_size -1 */
+   uint32_t size;  /* actual usable space in the ring */
+   volatile uint32_t write_idx __rte_cache_aligned;
+   volatile uint32_t read_idx __rte_cache_aligned;
+
+   struct rte_event ring[0] __rte_cache_aligned;
+};
+
+#ifndef force_inline
+#define force_inline inline __attribute__((always_inline))
+#endif
+
+static inline struct qe_ring *
+qe_ring_create(const char *name, unsigned int size, unsigned int socket_id)
+{
+   struct qe_ring *retval;
+   const uint32_t ring_size = rte_align32pow2(size + 1);
+   size_t memsize = sizeof(*retval) +
+   (ring_size * sizeof(retval->ring[0]));
+
+   retval = rte_zmalloc_socket(NULL, memsize, 0, socket_id);
+   if (retval == NULL)
+   goto end;
+
+   snprintf(retval->name, sizeof(retval->name), "EVDEV_RG_%s", name);
+   retval->ring_size = ring_size;
+   retval->mask = ring_size - 1;
+   retval->size = size;
+end:
+   return retval;
+}
+
+static inline void
+qe_ring_destroy(struct qe_ring *r)
+{
+   rte_free(r);
+}
+
+static force_inline unsigned int
+qe_ring_count(const struct qe_ring *r)
+{
+   return r->write_idx - r->read_idx;
+}
+
+static force_inline unsigned int
+qe_ring_free_count(const struct qe_ring *r)
+{
+   return r->size - qe_ring_count(r);
+}
+
+static force_inline unsigned int
+qe_ring_enqueue_burst(struct qe_ring *r, const struct rte_event *qes,
+   unsigned int nb_qes, uint16_t *free_count)
+{
+   const uint32_t size = r->size;
+   const uint32_t mask = r->mask;
+   const uint32_t read = r->read_idx;
+   uint32_t write = r->write_idx;
+   const uint32_t space = read + size - write;
+   uint32_t i;
+
+   if (space < nb_qes)
+   nb_qes = space;
+
+   for (i = 0; i < nb_qes; i++, write++)
+   r->ring[write & mask] = qes[i];
+
+

[dpdk-dev] [PATCH v3 11/17] event/sw: add support for linking queues to ports

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/sw_evdev.c | 74 +
 1 file changed, 74 insertions(+)

diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
index 37f8846..b809d5d 100644
--- a/drivers/event/sw/sw_evdev.c
+++ b/drivers/event/sw/sw_evdev.c
@@ -36,6 +36,7 @@
 #include 
 #include 
 #include 
+#include 
 
 #include "sw_evdev.h"
 #include "iq_ring.h"
@@ -50,6 +51,77 @@ static void
 sw_info_get(struct rte_eventdev *dev, struct rte_event_dev_info *info);
 
 static int
+sw_port_link(struct rte_eventdev *dev, void *port, const uint8_t queues[],
+   const uint8_t priorities[], uint16_t num)
+{
+   struct sw_port *p = (void *)port;
+   struct sw_evdev *sw = sw_pmd_priv(dev);
+   int i;
+
+   RTE_SET_USED(priorities);
+   for (i = 0; i < num; i++) {
+   struct sw_qid *q = &sw->qids[queues[i]];
+
+   /* check for qid map overflow */
+   if (q->cq_num_mapped_cqs >= RTE_DIM(q->cq_map))
+   break;
+
+   if (p->is_directed && p->num_qids_mapped > 0)
+   break;
+
+   if (q->type == SW_SCHED_TYPE_DIRECT) {
+   /* check directed qids only map to one port */
+   if (p->num_qids_mapped > 0) {
+   rte_errno = -EDQUOT;
+   break;
+   }
+   /* check port only takes a directed flow */
+   if (num > 1) {
+   rte_errno = -EDQUOT;
+   break;
+   }
+
+   p->is_directed = 1;
+   p->num_qids_mapped = 1;
+   } else if (q->type == RTE_SCHED_TYPE_ORDERED) {
+   p->num_ordered_qids++;
+   p->num_qids_mapped++;
+   } else if (q->type == RTE_SCHED_TYPE_ATOMIC) {
+   p->num_qids_mapped++;
+   }
+
+   q->cq_map[q->cq_num_mapped_cqs] = p->id;
+   rte_smp_wmb();
+   q->cq_num_mapped_cqs++;
+   }
+   return i;
+}
+
+static int
+sw_port_unlink(struct rte_eventdev *dev, void *port, uint8_t queues[],
+   uint16_t nb_unlinks)
+{
+   struct sw_port *p = (void *)port;
+   struct sw_evdev *sw = sw_pmd_priv(dev);
+   unsigned int i, j;
+
+   int unlinked = 0;
+   for (i = 0; i < nb_unlinks; i++) {
+   struct sw_qid *q = &sw->qids[queues[i]];
+   for (j = 0; j < q->cq_num_mapped_cqs; j++)
+   if (q->cq_map[j] == p->id) {
+   q->cq_map[j] =
+   q->cq_map[q->cq_num_mapped_cqs - 1];
+   rte_smp_wmb();
+   q->cq_num_mapped_cqs--;
+   unlinked++;
+   continue;
+   }
+   }
+   return unlinked;
+}
+
+static int
 sw_port_setup(struct rte_eventdev *dev, uint8_t port_id,
const struct rte_event_port_conf *conf)
 {
@@ -384,6 +456,8 @@ sw_probe(const char *name, const char *params)
.port_def_conf = sw_port_def_conf,
.port_setup = sw_port_setup,
.port_release = sw_port_release,
+   .port_link = sw_port_link,
+   .port_unlink = sw_port_unlink,
};
 
static const char *const args[] = {
-- 
2.7.4



[dpdk-dev] [PATCH v3 14/17] event/sw: add start stop and close functions

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/sw_evdev.c | 74 +
 1 file changed, 74 insertions(+)

diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
index 68c4d0b..6fa7d71 100644
--- a/drivers/event/sw/sw_evdev.c
+++ b/drivers/event/sw/sw_evdev.c
@@ -415,6 +415,77 @@ sw_info_get(struct rte_eventdev *dev, struct 
rte_event_dev_info *info)
 }
 
 static int
+sw_start(struct rte_eventdev *dev)
+{
+   unsigned int i, j;
+   struct sw_evdev *sw = sw_pmd_priv(dev);
+   /* check all ports are set up */
+   for (i = 0; i < sw->port_count; i++)
+   if (sw->ports[i].rx_worker_ring == NULL) {
+   printf("%s %d: port %d not configured\n",
+  __func__, __LINE__, i);
+   return -1;
+   }
+
+   /* check all queues are configured and mapped to ports*/
+   for (i = 0; i < sw->qid_count; i++)
+   if (sw->qids[i].iq[0] == NULL ||
+   sw->qids[i].cq_num_mapped_cqs == 0) {
+   printf("%s %d: queue %d not configured\n",
+  __func__, __LINE__, i);
+   return -1;
+   }
+
+   /* build up our prioritized array of qids */
+   /* We don't use qsort here, as if all/multiple entries have the same
+* priority, the result is non-deterministic. From "man 3 qsort":
+* "If two members compare as equal, their order in the sorted
+* array is undefined."
+*/
+   uint32_t qidx = 0;
+   for (j = 0; j <= RTE_EVENT_DEV_PRIORITY_LOWEST; j++) {
+   for (i = 0; i < sw->qid_count; i++) {
+   if (sw->qids[i].priority == j) {
+   sw->qids_prioritized[qidx] = &sw->qids[i];
+   qidx++;
+   }
+   }
+   }
+   sw->started = 1;
+   return 0;
+}
+
+static void
+sw_stop(struct rte_eventdev *dev)
+{
+   struct sw_evdev *sw = sw_pmd_priv(dev);
+   sw->started = 0;
+}
+
+static int
+sw_close(struct rte_eventdev *dev)
+{
+   struct sw_evdev *sw = sw_pmd_priv(dev);
+   uint32_t i;
+
+   for (i = 0; i < sw->qid_count; i++)
+   sw_queue_release(dev, i);
+   sw->qid_count = 0;
+
+   for (i = 0; i < sw->port_count; i++)
+   sw_port_release(&sw->ports[i]);
+   sw->port_count = 0;
+
+   memset(&sw->stats, 0, sizeof(sw->stats));
+   sw->sched_called = 0;
+   sw->sched_no_iq_enqueues = 0;
+   sw->sched_no_cq_enqueues = 0;
+   sw->sched_cq_qid_called = 0;
+
+   return 0;
+}
+
+static int
 assign_numa_node(const char *key __rte_unused, const char *value, void *opaque)
 {
int *socket_id = opaque;
@@ -450,6 +521,9 @@ sw_probe(const char *name, const char *params)
static const struct rte_eventdev_ops evdev_sw_ops = {
.dev_configure = sw_dev_configure,
.dev_infos_get = sw_info_get,
+   .dev_close = sw_close,
+   .dev_start = sw_start,
+   .dev_stop = sw_stop,
 
.queue_def_conf = sw_queue_def_conf,
.queue_setup = sw_queue_setup,
-- 
2.7.4



[dpdk-dev] [PATCH v3 13/17] event/sw: add scheduling logic

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Add in the scheduling function which takes the events from the
producer queues and buffers them before scheduling them to consumer
queues. The scheduling logic includes support for atomic, reordered,
and parallel scheduling of flows.

Signed-off-by: Bruce Richardson 
Signed-off-by: Gage Eads 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/Makefile |   1 +
 drivers/event/sw/sw_evdev.c   |   1 +
 drivers/event/sw/sw_evdev.h   |  11 +
 drivers/event/sw/sw_evdev_scheduler.c | 602 ++
 4 files changed, 615 insertions(+)
 create mode 100644 drivers/event/sw/sw_evdev_scheduler.c

diff --git a/drivers/event/sw/Makefile b/drivers/event/sw/Makefile
index b6ecd91..a7f5b3d 100644
--- a/drivers/event/sw/Makefile
+++ b/drivers/event/sw/Makefile
@@ -54,6 +54,7 @@ EXPORT_MAP := rte_pmd_evdev_sw_version.map
 # library source files
 SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev.c
 SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev_worker.c
+SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev_scheduler.c
 
 # export include files
 SYMLINK-y-include +=
diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
index adff729..68c4d0b 100644
--- a/drivers/event/sw/sw_evdev.c
+++ b/drivers/event/sw/sw_evdev.c
@@ -530,6 +530,7 @@ sw_probe(const char *name, const char *params)
dev->enqueue_burst = sw_event_enqueue_burst;
dev->dequeue = sw_event_dequeue;
dev->dequeue_burst = sw_event_dequeue_burst;
+   dev->schedule = sw_event_schedule;
 
sw = dev->data->dev_private;
sw->data = dev->data;
diff --git a/drivers/event/sw/sw_evdev.h b/drivers/event/sw/sw_evdev.h
index ab372fd..7c157c7 100644
--- a/drivers/event/sw/sw_evdev.h
+++ b/drivers/event/sw/sw_evdev.h
@@ -248,8 +248,18 @@ struct sw_evdev {
/* Cache how many packets are in each cq */
uint16_t cq_ring_space[SW_PORTS_MAX] __rte_cache_aligned;
 
+   /* Array of pointers to load-balanced QIDs sorted by priority level */
+   struct sw_qid *qids_prioritized[RTE_EVENT_MAX_QUEUES_PER_DEV];
+
+   /* Stats */
+   struct sw_point_stats stats __rte_cache_aligned;
+   uint64_t sched_called;
int32_t sched_quanta;
+   uint64_t sched_no_iq_enqueues;
+   uint64_t sched_no_cq_enqueues;
+   uint64_t sched_cq_qid_called;
 
+   uint8_t started;
uint32_t credit_update_quanta;
 };
 
@@ -272,5 +282,6 @@ uint16_t sw_event_enqueue_burst(void *port, const struct 
rte_event ev[],
 uint16_t sw_event_dequeue(void *port, struct rte_event *ev, uint64_t wait);
 uint16_t sw_event_dequeue_burst(void *port, struct rte_event *ev, uint16_t num,
uint64_t wait);
+void sw_event_schedule(struct rte_eventdev *dev);
 
 #endif /* _SW_EVDEV_H_ */
diff --git a/drivers/event/sw/sw_evdev_scheduler.c 
b/drivers/event/sw/sw_evdev_scheduler.c
new file mode 100644
index 000..2aecc95
--- /dev/null
+++ b/drivers/event/sw/sw_evdev_scheduler.c
@@ -0,0 +1,602 @@
+/*-
+ *   BSD LICENSE
+ *
+ *   Copyright(c) 2016-2017 Intel Corporation. All rights reserved.
+ *
+ *   Redistribution and use in source and binary forms, with or without
+ *   modification, are permitted provided that the following conditions
+ *   are met:
+ *
+ * * Redistributions of source code must retain the above copyright
+ *   notice, this list of conditions and the following disclaimer.
+ * * Redistributions in binary form must reproduce the above copyright
+ *   notice, this list of conditions and the following disclaimer in
+ *   the documentation and/or other materials provided with the
+ *   distribution.
+ * * Neither the name of Intel Corporation nor the names of its
+ *   contributors may be used to endorse or promote products derived
+ *   from this software without specific prior written permission.
+ *
+ *   THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
+ *   "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
+ *   LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
+ *   A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
+ *   OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
+ *   SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
+ *   LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
+ *   DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
+ *   THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
+ *   (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+ *   OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
+ */
+
+#include 
+#include 
+#include "sw_evdev.h"
+#include "iq_ring.h"
+#include "event_ring.h"
+
+#define SW_IQS_MASK (SW_IQS_MAX-1)
+
+/* Retrieve the highest priority IQ or -1 if no pkts available. Doing the
+ * CLZ twice is faster than caching the value due to

[dpdk-dev] [PATCH v3 16/17] event/sw: add xstats support

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Add support for xstats to report out on the state of the eventdev.
Useful for debugging and for unit tests, as well as observability
at runtime and performance tuning of apps to work well with the
scheduler.

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/Makefile  |   1 +
 drivers/event/sw/sw_evdev.c|   8 +
 drivers/event/sw/sw_evdev.h|  21 +-
 drivers/event/sw/sw_evdev_xstats.c | 511 +
 4 files changed, 540 insertions(+), 1 deletion(-)
 create mode 100644 drivers/event/sw/sw_evdev_xstats.c

diff --git a/drivers/event/sw/Makefile b/drivers/event/sw/Makefile
index a7f5b3d..eb0dc4c 100644
--- a/drivers/event/sw/Makefile
+++ b/drivers/event/sw/Makefile
@@ -55,6 +55,7 @@ EXPORT_MAP := rte_pmd_evdev_sw_version.map
 SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev.c
 SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev_worker.c
 SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev_scheduler.c
+SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev_xstats.c
 
 # export include files
 SYMLINK-y-include +=
diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
index 8b7d8ed..c273399 100644
--- a/drivers/event/sw/sw_evdev.c
+++ b/drivers/event/sw/sw_evdev.c
@@ -598,6 +598,8 @@ sw_start(struct rte_eventdev *dev)
}
}
}
+   if (sw_xstats_init(sw) < 0)
+   return -1;
sw->started = 1;
return 0;
 }
@@ -606,6 +608,7 @@ static void
 sw_stop(struct rte_eventdev *dev)
 {
struct sw_evdev *sw = sw_pmd_priv(dev);
+   sw_xstats_uninit(sw);
sw->started = 0;
 }
 
@@ -681,6 +684,11 @@ sw_probe(const char *name, const char *params)
.port_release = sw_port_release,
.port_link = sw_port_link,
.port_unlink = sw_port_unlink,
+
+   .xstats_get = sw_xstats_get,
+   .xstats_get_names = sw_xstats_get_names,
+   .xstats_get_by_name = sw_xstats_get_by_name,
+   .xstats_reset = sw_xstats_reset,
};
 
static const char *const args[] = {
diff --git a/drivers/event/sw/sw_evdev.h b/drivers/event/sw/sw_evdev.h
index 7c157c7..690bfa1 100644
--- a/drivers/event/sw/sw_evdev.h
+++ b/drivers/event/sw/sw_evdev.h
@@ -62,6 +62,8 @@
 
 #define SW_SCHED_TYPE_DIRECT (RTE_SCHED_TYPE_PARALLEL + 1)
 
+#define SW_NUM_POLL_BUCKETS (MAX_SW_CONS_Q_DEPTH >> SW_DEQ_STAT_BUCKET_SHIFT)
+
 enum {
QE_FLAG_VALID_SHIFT = 0,
QE_FLAG_COMPLETE_SHIFT,
@@ -203,7 +205,7 @@ struct sw_port {
uint64_t avg_pkt_ticks;  /* tracks average over NUM_SAMPLES burst */
uint64_t total_polls;/* how many polls were counted in stats */
uint64_t zero_polls; /* tracks polls returning nothing */
-   uint32_t poll_buckets[MAX_SW_CONS_Q_DEPTH >> SW_DEQ_STAT_BUCKET_SHIFT];
+   uint32_t poll_buckets[SW_NUM_POLL_BUCKETS];
/* bucket values in 4s for shorter reporting */
 
/* History list structs, containing info on pkts egressed to worker */
@@ -230,6 +232,11 @@ struct sw_evdev {
 
uint32_t port_count;
uint32_t qid_count;
+   uint32_t xstats_count;
+   struct sw_xstats_entry *xstats;
+   uint32_t xstats_count_mode_dev;
+   uint32_t xstats_count_mode_port;
+   uint32_t xstats_count_mode_queue;
 
/* Contains all ports - load balanced and directed */
struct sw_port ports[SW_PORTS_MAX] __rte_cache_aligned;
@@ -283,5 +290,17 @@ uint16_t sw_event_dequeue(void *port, struct rte_event 
*ev, uint64_t wait);
 uint16_t sw_event_dequeue_burst(void *port, struct rte_event *ev, uint16_t num,
uint64_t wait);
 void sw_event_schedule(struct rte_eventdev *dev);
+int sw_xstats_init(struct sw_evdev *dev);
+int sw_xstats_uninit(struct sw_evdev *dev);
+int sw_xstats_get_names(const struct rte_eventdev *dev,
+   enum rte_event_dev_xstats_mode mode, uint8_t queue_port_id,
+   struct rte_event_dev_xstats_name *xstats_names, unsigned int size);
+int sw_xstats_get(const struct rte_eventdev *dev,
+   enum rte_event_dev_xstats_mode mode, uint8_t queue_port_id,
+   const unsigned int ids[], uint64_t values[], unsigned int n);
+uint64_t sw_xstats_get_by_name(const struct rte_eventdev *dev,
+   const char *name, unsigned int *id);
+int sw_xstats_reset(struct rte_eventdev *dev);
+
 
 #endif /* _SW_EVDEV_H_ */
diff --git a/drivers/event/sw/sw_evdev_xstats.c 
b/drivers/event/sw/sw_evdev_xstats.c
new file mode 100644
index 000..3354522
--- /dev/null
+++ b/drivers/event/sw/sw_evdev_xstats.c
@@ -0,0 +1,511 @@
+/*-
+ *   BSD LICENSE
+ *
+ *   Copyright(c) 2016-2017 Intel Corporation. All rights reserved.
+ *
+ *   Redistribution and use in source and binary forms, with or without
+ *   modification, are permitted provided t

[dpdk-dev] [PATCH v3 17/17] app/test: add unit tests for SW eventdev driver

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Since the sw driver is a standalone lookaside device that has no HW
requirements, we can provide a set of unit tests that test its
functionality across the different queue types and with different input
scenarios.

This also adds the tests to be automatically run by autotest.py

Signed-off-by: Bruce Richardson 
Signed-off-by: David Hunt 
Signed-off-by: Harry van Haaren 
---
 app/test/Makefile   |5 +-
 app/test/autotest_data.py   |   26 +
 app/test/test_eventdev_sw.c | 2249 +++
 3 files changed, 2279 insertions(+), 1 deletion(-)
 create mode 100644 app/test/test_eventdev_sw.c

diff --git a/app/test/Makefile b/app/test/Makefile
index a426548..dc92d9c 100644
--- a/app/test/Makefile
+++ b/app/test/Makefile
@@ -197,7 +197,10 @@ SRCS-$(CONFIG_RTE_LIBRTE_CRYPTODEV) += 
test_cryptodev_blockcipher.c
 SRCS-$(CONFIG_RTE_LIBRTE_CRYPTODEV) += test_cryptodev_perf.c
 SRCS-$(CONFIG_RTE_LIBRTE_CRYPTODEV) += test_cryptodev.c
 
-SRCS-$(CONFIG_RTE_LIBRTE_EVENTDEV) += test_eventdev.c
+ifeq ($(CONFIG_RTE_LIBRTE_EVENTDEV),y)
+SRCS-y += test_eventdev.c
+SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += test_eventdev_sw.c
+endif
 
 SRCS-$(CONFIG_RTE_LIBRTE_KVARGS) += test_kvargs.c
 
diff --git a/app/test/autotest_data.py b/app/test/autotest_data.py
index 0cd598b..165ed6c 100644
--- a/app/test/autotest_data.py
+++ b/app/test/autotest_data.py
@@ -346,6 +346,32 @@ def per_sockets(num):
 non_parallel_test_group_list = [
 
 {
+"Prefix":"eventdev",
+"Memory":"512",
+"Tests":
+[
+{
+"Name":"Eventdev common autotest",
+"Command": "eventdev_common_autotest",
+"Func":default_autotest,
+"Report":  None,
+},
+]
+},
+{
+"Prefix":"eventdev_sw",
+"Memory":"512",
+"Tests":
+[
+{
+"Name":"Eventdev sw autotest",
+"Command": "eventdev_sw_autotest",
+"Func":default_autotest,
+"Report":  None,
+},
+]
+},
+{
 "Prefix":"kni",
 "Memory":"512",
 "Tests":
diff --git a/app/test/test_eventdev_sw.c b/app/test/test_eventdev_sw.c
new file mode 100644
index 000..ef6a486
--- /dev/null
+++ b/app/test/test_eventdev_sw.c
@@ -0,0 +1,2249 @@
+/*-
+ *   BSD LICENSE
+ *
+ *   Copyright(c) 2016-2017 Intel Corporation. All rights reserved.
+ *   All rights reserved.
+ *
+ *   Redistribution and use in source and binary forms, with or without
+ *   modification, are permitted provided that the following conditions
+ *   are met:
+ *
+ * * Redistributions of source code must retain the above copyright
+ *   notice, this list of conditions and the following disclaimer.
+ * * Redistributions in binary form must reproduce the above copyright
+ *   notice, this list of conditions and the following disclaimer in
+ *   the documentation and/or other materials provided with the
+ *   distribution.
+ * * Neither the name of Intel Corporation nor the names of its
+ *   contributors may be used to endorse or promote products derived
+ *   from this software without specific prior written permission.
+ *
+ *   THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS AND CONTRIBUTORS
+ *   "AS IS" AND ANY EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT
+ *   LIMITED TO, THE IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR
+ *   A PARTICULAR PURPOSE ARE DISCLAIMED. IN NO EVENT SHALL THE COPYRIGHT
+ *   OWNER OR CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL,
+ *   SPECIAL, EXEMPLARY, OR CONSEQUENTIAL DAMAGES (INCLUDING, BUT NOT
+ *   LIMITED TO, PROCUREMENT OF SUBSTITUTE GOODS OR SERVICES; LOSS OF USE,
+ *   DATA, OR PROFITS; OR BUSINESS INTERRUPTION) HOWEVER CAUSED AND ON ANY
+ *   THEORY OF LIABILITY, WHETHER IN CONTRACT, STRICT LIABILITY, OR TORT
+ *   (INCLUDING NEGLIGENCE OR OTHERWISE) ARISING IN ANY WAY OUT OF THE USE
+ *   OF THIS SOFTWARE, EVEN IF ADVISED OF THE POSSIBILITY OF SUCH DAMAGE.
+ */
+
+#include 
+#include 
+#include 
+#include 
+#include 
+#include 
+
+#include 
+#include 
+#include 
+#include 
+#include 
+#include 
+#include 
+#include 
+#include 
+
+#include 
+#include "test.h"
+
+#define MAX_PORTS 16
+#define MAX_QIDS 16
+#define NUM_PACKETS (1<<18)
+
+static int evdev;
+
+struct test {
+   struct rte_mempool *mbuf_pool;
+   uint8_t port[MAX_PORTS];
+   uint8_t qid[MAX_QIDS];
+   int nb_qids;
+};
+
+static struct rte_event release_ev = {.op = RTE_EVENT_OP_RELEASE };
+
+static inline struct rte_mbuf *
+rte_gen_arp(int portid, struct rte_mempool *mp)
+{
+   /*
+* len = 14 + 46
+* ARP, Request who-has 10.0.0.1 tell 10.0.0.2, length 46
+*/
+   static const uint8_t arp_request[] = {
+   /*0x:*/ 0xff, 0xff, 0xff, 0xff, 0xff, 0xff, 0xec, 0xa8,
+  

Re: [dpdk-dev] [PATCH v9] net/kni: add KNI PMD

2017-02-17 Thread Bruce Richardson
On Fri, Feb 17, 2017 at 02:29:51PM +, Ferruh Yigit wrote:
> On 2/17/2017 1:47 PM, Thomas Monjalon wrote:
> > 2017-02-17 13:42, Ferruh Yigit:
> >> Add KNI PMD which wraps librte_kni for ease of use.
> >>
> >> KNI PMD can be used as any regular PMD to send / receive packets to the
> >> Linux networking stack.
> >>
> >> Signed-off-by: Ferruh Yigit 
> >> Reviewed-by: Yong Wang 
> >> ---
> >>
> >> v9:
> >> * update for 17.05
> > 
> > You keep updating this patch in the hope that someone would be interested :)
> > 
> > Please let's make clear that I am OK to merge it
> > but you asked me to wait for someone supporting its inclusion.
> 
> Right, it is good to mention that I explicitly asked to wait community
> response.
> 
> I keep updating it because I believe this is something useful.
> 
> Meanwhile adding this into repo means maintenance cost, so this should
> not be merged without any usecase or interest from community.
> 
> Patch is waiting for an ACK or NAK from community.
> 
I believe this is useful. No reason for KNI to have to use special
custom rx/tx functions when it can be made to use regular ethdev ones.
So:

Acked-by: Bruce Richardson 



[dpdk-dev] [PATCH v3 12/17] event/sw: add worker core functions

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

add the event enqueue, dequeue and release functions to the eventdev.
These also include tracking of stats for observability in the load of
the scheduler.
Internally in the enqueue function, the various types of enqueue
operations, to forward an existing event, to send a new event, to
drop a previous event, are converted to a series of flags which will
be used by the scheduler code to perform the needed actions for that
event.

Signed-off-by: Bruce Richardson 
Signed-off-by: Gage Eads 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/Makefile  |   1 +
 drivers/event/sw/sw_evdev.c|   5 +
 drivers/event/sw/sw_evdev.h|  34 +++
 drivers/event/sw/sw_evdev_worker.c | 188 +
 4 files changed, 228 insertions(+)
 create mode 100644 drivers/event/sw/sw_evdev_worker.c

diff --git a/drivers/event/sw/Makefile b/drivers/event/sw/Makefile
index d6836e3..b6ecd91 100644
--- a/drivers/event/sw/Makefile
+++ b/drivers/event/sw/Makefile
@@ -53,6 +53,7 @@ EXPORT_MAP := rte_pmd_evdev_sw_version.map
 
 # library source files
 SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev.c
+SRCS-$(CONFIG_RTE_LIBRTE_PMD_SW_EVENTDEV) += sw_evdev_worker.c
 
 # export include files
 SYMLINK-y-include +=
diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
index b809d5d..adff729 100644
--- a/drivers/event/sw/sw_evdev.c
+++ b/drivers/event/sw/sw_evdev.c
@@ -387,6 +387,7 @@ sw_dev_configure(const struct rte_eventdev *dev)
sw->qid_count = conf->nb_event_queues;
sw->port_count = conf->nb_event_ports;
sw->nb_events_limit = conf->nb_events_limit;
+   rte_atomic32_set(&sw->inflights, 0);
 
return 0;
 }
@@ -525,6 +526,10 @@ sw_probe(const char *name, const char *params)
return -EFAULT;
}
dev->dev_ops = &evdev_sw_ops;
+   dev->enqueue = sw_event_enqueue;
+   dev->enqueue_burst = sw_event_enqueue_burst;
+   dev->dequeue = sw_event_dequeue;
+   dev->dequeue_burst = sw_event_dequeue_burst;
 
sw = dev->data->dev_private;
sw->data = dev->data;
diff --git a/drivers/event/sw/sw_evdev.h b/drivers/event/sw/sw_evdev.h
index 1bedd63..ab372fd 100644
--- a/drivers/event/sw/sw_evdev.h
+++ b/drivers/event/sw/sw_evdev.h
@@ -55,12 +55,36 @@
 #define SCHED_DEQUEUE_BURST_SIZE 32
 
 #define SW_PORT_HIST_LIST (MAX_SW_PROD_Q_DEPTH) /* size of our history list */
+#define NUM_SAMPLES 64 /* how many data points use for average stats */
 
 #define EVENTDEV_NAME_SW_PMD event_sw
 #define SW_PMD_NAME RTE_STR(event_sw)
 
 #define SW_SCHED_TYPE_DIRECT (RTE_SCHED_TYPE_PARALLEL + 1)
 
+enum {
+   QE_FLAG_VALID_SHIFT = 0,
+   QE_FLAG_COMPLETE_SHIFT,
+   QE_FLAG_NOT_EOP_SHIFT,
+   _QE_FLAG_COUNT
+};
+
+#define QE_FLAG_VALID(1 << QE_FLAG_VALID_SHIFT)/* for NEW FWD, FRAG */
+#define QE_FLAG_COMPLETE (1 << QE_FLAG_COMPLETE_SHIFT) /* set for FWD, DROP  */
+#define QE_FLAG_NOT_EOP  (1 << QE_FLAG_NOT_EOP_SHIFT)  /* set for FRAG only  */
+
+static const uint8_t sw_qe_flag_map[] = {
+   QE_FLAG_VALID /* NEW Event */,
+   QE_FLAG_VALID | QE_FLAG_COMPLETE /* FWD Event */,
+   QE_FLAG_COMPLETE /* RELEASE Event */,
+
+   /* Values which can be used for future support for partial
+* events, i.e. where one event comes back to the scheduler
+* as multiple which need to be tracked together
+*/
+   QE_FLAG_VALID | QE_FLAG_COMPLETE | QE_FLAG_NOT_EOP,
+};
+
 #ifdef RTE_LIBRTE_PMD_EVDEV_SW_DEBUG
 #define SW_LOG_INFO(fmt, args...) \
RTE_LOG(INFO, EVENTDEV, "[%s] %s() line %u: " fmt "\n", \
@@ -210,6 +234,8 @@ struct sw_evdev {
/* Contains all ports - load balanced and directed */
struct sw_port ports[SW_PORTS_MAX] __rte_cache_aligned;
 
+   rte_atomic32_t inflights __rte_cache_aligned;
+
/*
 * max events in this instance. Cached here for performance.
 * (also available in data->conf.nb_events_limit)
@@ -239,4 +265,12 @@ sw_pmd_priv_const(const struct rte_eventdev *eventdev)
return eventdev->data->dev_private;
 }
 
+uint16_t sw_event_enqueue(void *port, const struct rte_event *ev);
+uint16_t sw_event_enqueue_burst(void *port, const struct rte_event ev[],
+   uint16_t num);
+
+uint16_t sw_event_dequeue(void *port, struct rte_event *ev, uint64_t wait);
+uint16_t sw_event_dequeue_burst(void *port, struct rte_event *ev, uint16_t num,
+   uint64_t wait);
+
 #endif /* _SW_EVDEV_H_ */
diff --git a/drivers/event/sw/sw_evdev_worker.c 
b/drivers/event/sw/sw_evdev_worker.c
new file mode 100644
index 000..aed1597
--- /dev/null
+++ b/drivers/event/sw/sw_evdev_worker.c
@@ -0,0 +1,188 @@
+/*-
+ *   BSD LICENSE
+ *
+ *   Copyright(c) 2016-2017 Intel Corporation. All rights reserved.
+ *
+ *   Redistribution and use in source and binary forms, with or without
+ *   modification, are permitted

Re: [dpdk-dev] [PATCH] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Bruce Richardson
On Fri, Feb 17, 2017 at 08:44:26AM -0600, Keith Wiles wrote:
> Calling strncpy with a maximum size argument of 16 bytes on destination
> array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
> destination string unterminated.
> 
> Signed-off-by: Keith Wiles 
> ---
>  drivers/net/tap/rte_eth_tap.c | 2 +-
>  1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
> index efc4426..f9938d7 100644
> --- a/drivers/net/tap/rte_eth_tap.c
> +++ b/drivers/net/tap/rte_eth_tap.c
> @@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short 
> flags, int add)
>   return -1;
>   }
>   memset(&ifr, 0, sizeof(ifr));
> - strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
> + strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ-1);
This is why I always prefer to use snprintf for copying strings, you
can't avoid null terminating.

snprintf(ifr.ifr_name, IFNAMSIZ, "%s", pmd->name);

/Bruce



[dpdk-dev] [PATCH v3 15/17] event/sw: add dump function for easier debugging

2017-02-17 Thread Harry van Haaren
From: Bruce Richardson 

Segfault issue resolved when only partially configured and
rte_event_dev_dump() is called before start(),
Reported-by: Vipin Varghese 

Signed-off-by: Bruce Richardson 
Signed-off-by: Harry van Haaren 
---
 drivers/event/sw/sw_evdev.c | 148 
 1 file changed, 148 insertions(+)

diff --git a/drivers/event/sw/sw_evdev.c b/drivers/event/sw/sw_evdev.c
index 6fa7d71..8b7d8ed 100644
--- a/drivers/event/sw/sw_evdev.c
+++ b/drivers/event/sw/sw_evdev.c
@@ -414,6 +414,153 @@ sw_info_get(struct rte_eventdev *dev, struct 
rte_event_dev_info *info)
*info = evdev_sw_info;
 }
 
+static void
+sw_dump(struct rte_eventdev *dev, FILE *f)
+{
+   const struct sw_evdev *sw = sw_pmd_priv(dev);
+
+   static const char * const q_type_strings[] = {
+   "Ordered", "Atomic", "Parallel", "Directed"
+   };
+   uint32_t i;
+   fprintf(f, "EventDev %s: ports %d, qids %d\n", "todo-fix-name",
+   sw->port_count, sw->qid_count);
+
+   fprintf(f, "\trx   %"PRIu64"\n\tdrop %"PRIu64"\n\ttx   %"PRIu64"\n",
+   sw->stats.rx_pkts, sw->stats.rx_dropped, sw->stats.tx_pkts);
+   fprintf(f, "\tsched calls: %"PRIu64"\n", sw->sched_called);
+   fprintf(f, "\tsched cq/qid call: %"PRIu64"\n", sw->sched_cq_qid_called);
+   fprintf(f, "\tsched no IQ enq: %"PRIu64"\n", sw->sched_no_iq_enqueues);
+   fprintf(f, "\tsched no CQ enq: %"PRIu64"\n", sw->sched_no_cq_enqueues);
+   uint32_t inflights = rte_atomic32_read(&sw->inflights);
+   uint32_t credits = sw->nb_events_limit - inflights;
+   fprintf(f, "\tinflight %d, credits: %d\n", inflights, credits);
+
+#define COL_RED "\x1b[31m"
+#define COL_RESET "\x1b[0m"
+
+   for (i = 0; i < sw->port_count; i++) {
+   int max, j;
+   const struct sw_port *p = &sw->ports[i];
+   if (!p->initialized) {
+   fprintf(f, "  %sPort %d not initialized.%s\n",
+   COL_RED, i, COL_RESET);
+   continue;
+   }
+   fprintf(f, "  Port %d %s\n", i,
+   p->is_directed ? " (SingleCons)" : "");
+   fprintf(f, "\trx   %"PRIu64"\tdrop %"PRIu64"\ttx   %"PRIu64
+   "\t%sinflight %d%s\n", sw->ports[i].stats.rx_pkts,
+   sw->ports[i].stats.rx_dropped,
+   sw->ports[i].stats.tx_pkts,
+   (p->inflights == p->inflight_max) ?
+   COL_RED : COL_RESET,
+   sw->ports[i].inflights, COL_RESET);
+
+   fprintf(f, "\tMax New: %u"
+   "\tAvg cycles PP: %"PRIu64"\tCredits: %u\n",
+   sw->ports[i].inflight_max,
+   sw->ports[i].avg_pkt_ticks,
+   sw->ports[i].inflight_credits);
+   fprintf(f, "\tReceive burst distribution:\n");
+   float zp_percent = p->zero_polls * 100.0 / p->total_polls;
+   fprintf(f, zp_percent < 10 ? "\t\t0:%.02f%% " : "\t\t0:%.0f%% ",
+   zp_percent);
+   for (max = (int)RTE_DIM(p->poll_buckets); max-- > 0;)
+   if (p->poll_buckets[max] != 0)
+   break;
+   for (j = 0; j <= max; j++) {
+   if (p->poll_buckets[j] != 0) {
+   float poll_pc = p->poll_buckets[j] * 100.0 /
+   p->total_polls;
+   fprintf(f, "%u-%u:%.02f%% ",
+   ((j << SW_DEQ_STAT_BUCKET_SHIFT) + 1),
+   ((j+1) << SW_DEQ_STAT_BUCKET_SHIFT),
+   poll_pc);
+   }
+   }
+   fprintf(f, "\n");
+
+   if (p->rx_worker_ring) {
+   uint64_t used = qe_ring_count(p->rx_worker_ring);
+   uint64_t space = qe_ring_free_count(p->rx_worker_ring);
+   const char *col = (space == 0) ? COL_RED : COL_RESET;
+   fprintf(f, "\t%srx ring used: %4"PRIu64"\tfree: %4"
+   PRIu64 COL_RESET"\n", col, used, space);
+   } else
+   fprintf(f, "\trx ring not initialized.\n");
+
+   if (p->cq_worker_ring) {
+   uint64_t used = qe_ring_count(p->cq_worker_ring);
+   uint64_t space = qe_ring_free_count(p->cq_worker_ring);
+   const char *col = (space == 0) ? COL_RED : COL_RESET;
+   fprintf(f, "\t%scq ring used: %4"PRIu64"\tfree: %4"
+   PRIu64 COL_RESET"\n", col, used, space);
+   } else
+   fprintf(f, "\tcq ring not initialized.\n");
+   }
+
+  

Re: [dpdk-dev] [PATCH v4] eal: Support running as unprivileged user

2017-02-17 Thread Sergio Gonzalez Monroy

On 31/01/2017 17:44, Ben Walker wrote:

For Linux kernel 4.0 and newer, the ability to obtain
physical page frame numbers for unprivileged users from
/proc/self/pagemap was removed. Instead, when an IOMMU
is present, simply choose our own DMA addresses instead.

Signed-off-by: Ben Walker 
---
  lib/librte_eal/common/eal_private.h  | 12 +
  lib/librte_eal/linuxapp/eal/eal_memory.c | 75 +++-
  lib/librte_eal/linuxapp/eal/eal_pci.c|  6 ++-
  3 files changed, 71 insertions(+), 22 deletions(-)


Acked-by: Sergio Gonzalez Monroy 

PS: Please keep a summary of changes made in each version on future 
patch sets (after the triple dash --- )


Re: [dpdk-dev] [PATCH] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Wiles, Keith

> On Feb 17, 2017, at 9:02 AM, Richardson, Bruce  
> wrote:
> 
> On Fri, Feb 17, 2017 at 08:44:26AM -0600, Keith Wiles wrote:
>> Calling strncpy with a maximum size argument of 16 bytes on destination
>> array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
>> destination string unterminated.
>> 
>> Signed-off-by: Keith Wiles 
>> ---
>> drivers/net/tap/rte_eth_tap.c | 2 +-
>> 1 file changed, 1 insertion(+), 1 deletion(-)
>> 
>> diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
>> index efc4426..f9938d7 100644
>> --- a/drivers/net/tap/rte_eth_tap.c
>> +++ b/drivers/net/tap/rte_eth_tap.c
>> @@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short 
>> flags, int add)
>>  return -1;
>>  }
>>  memset(&ifr, 0, sizeof(ifr));
>> -strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
>> +strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ-1);
> This is why I always prefer to use snprintf for copying strings, you
> can't avoid null terminating.

Normally I use snprintf to not sure why I reverted to strncpy. Maybe leftover 
from a previous driver I used as the template.

> 
>   snprintf(ifr.ifr_name, IFNAMSIZ, "%s", pmd->name);
> 
>   /Bruce

Regards,
Keith



Re: [dpdk-dev] [PATCH] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Bruce Richardson
On Fri, Feb 17, 2017 at 03:05:40PM +, Wiles, Keith wrote:
> 
> > On Feb 17, 2017, at 9:02 AM, Richardson, Bruce  
> > wrote:
> > 
> > On Fri, Feb 17, 2017 at 08:44:26AM -0600, Keith Wiles wrote:
> >> Calling strncpy with a maximum size argument of 16 bytes on destination
> >> array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
> >> destination string unterminated.
> >> 
> >> Signed-off-by: Keith Wiles 
> >> ---
> >> drivers/net/tap/rte_eth_tap.c | 2 +-
> >> 1 file changed, 1 insertion(+), 1 deletion(-)
> >> 
> >> diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
> >> index efc4426..f9938d7 100644
> >> --- a/drivers/net/tap/rte_eth_tap.c
> >> +++ b/drivers/net/tap/rte_eth_tap.c
> >> @@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short 
> >> flags, int add)
> >>return -1;
> >>}
> >>memset(&ifr, 0, sizeof(ifr));
> >> -  strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
> >> +  strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ-1);
> > This is why I always prefer to use snprintf for copying strings, you
> > can't avoid null terminating.
> 
> Normally I use snprintf to not sure why I reverted to strncpy. Maybe leftover 
> from a previous driver I used as the template.
> 
Is there a case to be made that DPDK should provide a strlcpy function
in the linuxapp EAL? [Assuming we don't want a dependency on libbsd?]
I find strncpy a horribly-error prone function to use - worse than
strcpy, since it gives a false sense of safety.

/Bruce


Re: [dpdk-dev] [PATCH] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Ferruh Yigit
On 2/17/2017 3:05 PM, Wiles, Keith wrote:
> 
>> On Feb 17, 2017, at 9:02 AM, Richardson, Bruce  
>> wrote:
>>
>> On Fri, Feb 17, 2017 at 08:44:26AM -0600, Keith Wiles wrote:
>>> Calling strncpy with a maximum size argument of 16 bytes on destination
>>> array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
>>> destination string unterminated.
>>>
>>> Signed-off-by: Keith Wiles 
>>> ---
>>> drivers/net/tap/rte_eth_tap.c | 2 +-
>>> 1 file changed, 1 insertion(+), 1 deletion(-)
>>>
>>> diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
>>> index efc4426..f9938d7 100644
>>> --- a/drivers/net/tap/rte_eth_tap.c
>>> +++ b/drivers/net/tap/rte_eth_tap.c
>>> @@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short 
>>> flags, int add)
>>> return -1;
>>> }
>>> memset(&ifr, 0, sizeof(ifr));
>>> -   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
>>> +   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ-1);
>> This is why I always prefer to use snprintf for copying strings, you
>> can't avoid null terminating.
> 
> Normally I use snprintf to not sure why I reverted to strncpy. Maybe leftover 
> from a previous driver I used as the template.

Since you are already updating that line, do you prefer to convert it to
snprintf instead of above modification?

> 
>>
>>  snprintf(ifr.ifr_name, IFNAMSIZ, "%s", pmd->name);
>>
>>  /Bruce
> 
> Regards,
> Keith
> 



[dpdk-dev] [PATCH 1/2] ethdev: clarify API comments of Rx queue count

2017-02-17 Thread Olivier Matz
The API comments are not consistent between each other.

The function rte_eth_rx_queue_count() returns the number of used
descriptors on a receive queue.

Signed-off-by: Olivier Matz 
Acked-by: Ferruh Yigit 
---

This commit was part of RFC patchset that will be reworked:
http://dpdk.org/dev/patchwork/patch/17233/
It can be processed separately.


 lib/librte_ether/rte_ethdev.h | 11 +++
 1 file changed, 7 insertions(+), 4 deletions(-)

diff --git a/lib/librte_ether/rte_ethdev.h b/lib/librte_ether/rte_ethdev.h
index 97f3e2d..6ef614d 100644
--- a/lib/librte_ether/rte_ethdev.h
+++ b/lib/librte_ether/rte_ethdev.h
@@ -1174,7 +1174,7 @@ typedef void (*eth_queue_release_t)(void *queue);
 
 typedef uint32_t (*eth_rx_queue_count_t)(struct rte_eth_dev *dev,
 uint16_t rx_queue_id);
-/**< @internal Get number of available descriptors on a receive queue of an 
Ethernet device. */
+/**< @internal Get number of used descriptors on a receive queue. */
 
 typedef int (*eth_rx_descriptor_done_t)(void *rxq, uint16_t offset);
 /**< @internal Check DD bit of specific RX descriptor */
@@ -1481,7 +1481,8 @@ struct eth_dev_ops {
eth_queue_stop_t   tx_queue_stop; /**< Stop TX for a queue. */
eth_rx_queue_setup_t   rx_queue_setup;/**< Set up device RX queue. 
*/
eth_queue_release_trx_queue_release; /**< Release RX queue. */
-   eth_rx_queue_count_t   rx_queue_count;/**< Get Rx queue count. */
+   eth_rx_queue_count_t   rx_queue_count;
+   /**< Get the number of used RX descriptors. */
eth_rx_descriptor_done_t   rx_descriptor_done; /**< Check rxd DD bit. */
eth_rx_enable_intr_t   rx_queue_intr_enable;  /**< Enable Rx queue 
interrupt. */
eth_rx_disable_intr_t  rx_queue_intr_disable; /**< Disable Rx queue 
interrupt. */
@@ -2723,7 +2724,7 @@ rte_eth_rx_burst(uint8_t port_id, uint16_t queue_id,
 }
 
 /**
- * Get the number of used descriptors in a specific queue
+ * Get the number of used descriptors of a rx queue
  *
  * @param port_id
  *  The port identifier of the Ethernet device.
@@ -2738,9 +2739,11 @@ static inline int
 rte_eth_rx_queue_count(uint8_t port_id, uint16_t queue_id)
 {
struct rte_eth_dev *dev = &rte_eth_devices[port_id];
+
RTE_ETH_VALID_PORTID_OR_ERR_RET(port_id, -EINVAL);
RTE_FUNC_PTR_OR_ERR_RET(*dev->dev_ops->rx_queue_count, -ENOTSUP);
-return (*dev->dev_ops->rx_queue_count)(dev, queue_id);
+
+   return (*dev->dev_ops->rx_queue_count)(dev, queue_id);
 }
 
 /**
-- 
2.8.1



[dpdk-dev] [PATCH 2/2] ethdev: move queue id check in generic layer

2017-02-17 Thread Olivier Matz
The check of queue_id is done in all drivers implementing
rte_eth_rx_queue_count(). Factorize this check in the generic function.

Note that the nfp driver was doing the check differently, which could
induce crashes if the queue index was too big.

Signed-off-by: Olivier Matz 
---

This commit was part of RFC patchset that will be reworked:
http://dpdk.org/dev/patchwork/patch/17232/
It can be processed separately.

rfc->v1:
- validate port id before accessing &rte_eth_devices[port_id]
- add enic driver


 drivers/net/e1000/em_rxtx.c| 5 -
 drivers/net/e1000/igb_rxtx.c   | 5 -
 drivers/net/enic/enic_ethdev.c | 5 -
 drivers/net/i40e/i40e_rxtx.c   | 5 -
 drivers/net/ixgbe/ixgbe_rxtx.c | 5 -
 drivers/net/nfp/nfp_net.c  | 5 -
 lib/librte_ether/rte_ethdev.h  | 7 +--
 7 files changed, 5 insertions(+), 32 deletions(-)

diff --git a/drivers/net/e1000/em_rxtx.c b/drivers/net/e1000/em_rxtx.c
index d099d6a..a265cb4 100644
--- a/drivers/net/e1000/em_rxtx.c
+++ b/drivers/net/e1000/em_rxtx.c
@@ -1436,11 +1436,6 @@ eth_em_rx_queue_count(struct rte_eth_dev *dev, uint16_t 
rx_queue_id)
struct em_rx_queue *rxq;
uint32_t desc = 0;
 
-   if (rx_queue_id >= dev->data->nb_rx_queues) {
-   PMD_RX_LOG(DEBUG, "Invalid RX queue_id=%d", rx_queue_id);
-   return 0;
-   }
-
rxq = dev->data->rx_queues[rx_queue_id];
rxdp = &(rxq->rx_ring[rxq->rx_tail]);
 
diff --git a/drivers/net/e1000/igb_rxtx.c b/drivers/net/e1000/igb_rxtx.c
index c9cf392..1bb4d85 100644
--- a/drivers/net/e1000/igb_rxtx.c
+++ b/drivers/net/e1000/igb_rxtx.c
@@ -1569,11 +1569,6 @@ eth_igb_rx_queue_count(struct rte_eth_dev *dev, uint16_t 
rx_queue_id)
struct igb_rx_queue *rxq;
uint32_t desc = 0;
 
-   if (rx_queue_id >= dev->data->nb_rx_queues) {
-   PMD_RX_LOG(ERR, "Invalid RX queue id=%d", rx_queue_id);
-   return 0;
-   }
-
rxq = dev->data->rx_queues[rx_queue_id];
rxdp = &(rxq->rx_ring[rxq->rx_tail]);
 
diff --git a/drivers/net/enic/enic_ethdev.c b/drivers/net/enic/enic_ethdev.c
index bffa870..2c2e29e 100644
--- a/drivers/net/enic/enic_ethdev.c
+++ b/drivers/net/enic/enic_ethdev.c
@@ -272,11 +272,6 @@ static uint32_t enicpmd_dev_rx_queue_count(struct 
rte_eth_dev *dev,
uint16_t cq_idx;
int rq_num;
 
-   if (rx_queue_id >= dev->data->nb_rx_queues) {
-   dev_err(enic, "Invalid RX queue id=%d", rx_queue_id);
-   return 0;
-   }
-
rq_num = enic_rte_rq_idx_to_sop_idx(rx_queue_id);
cq = &enic->cq[enic_cq_rq(enic, rq_num)];
cq_idx = cq->to_clean;
diff --git a/drivers/net/i40e/i40e_rxtx.c b/drivers/net/i40e/i40e_rxtx.c
index 48429cc..ec64a20 100644
--- a/drivers/net/i40e/i40e_rxtx.c
+++ b/drivers/net/i40e/i40e_rxtx.c
@@ -1876,11 +1876,6 @@ i40e_dev_rx_queue_count(struct rte_eth_dev *dev, 
uint16_t rx_queue_id)
struct i40e_rx_queue *rxq;
uint16_t desc = 0;
 
-   if (unlikely(rx_queue_id >= dev->data->nb_rx_queues)) {
-   PMD_DRV_LOG(ERR, "Invalid RX queue id %u", rx_queue_id);
-   return 0;
-   }
-
rxq = dev->data->rx_queues[rx_queue_id];
rxdp = &(rxq->rx_ring[rxq->rx_tail]);
while ((desc < rxq->nb_rx_desc) &&
diff --git a/drivers/net/ixgbe/ixgbe_rxtx.c b/drivers/net/ixgbe/ixgbe_rxtx.c
index 9502432..8a8da65 100644
--- a/drivers/net/ixgbe/ixgbe_rxtx.c
+++ b/drivers/net/ixgbe/ixgbe_rxtx.c
@@ -2911,11 +2911,6 @@ ixgbe_dev_rx_queue_count(struct rte_eth_dev *dev, 
uint16_t rx_queue_id)
struct ixgbe_rx_queue *rxq;
uint32_t desc = 0;
 
-   if (rx_queue_id >= dev->data->nb_rx_queues) {
-   PMD_RX_LOG(ERR, "Invalid RX queue id=%d", rx_queue_id);
-   return 0;
-   }
-
rxq = dev->data->rx_queues[rx_queue_id];
rxdp = &(rxq->rx_ring[rxq->rx_tail]);
 
diff --git a/drivers/net/nfp/nfp_net.c b/drivers/net/nfp/nfp_net.c
index d79f262..f8ed976 100644
--- a/drivers/net/nfp/nfp_net.c
+++ b/drivers/net/nfp/nfp_net.c
@@ -1184,11 +1184,6 @@ nfp_net_rx_queue_count(struct rte_eth_dev *dev, uint16_t 
queue_idx)
 
rxq = (struct nfp_net_rxq *)dev->data->rx_queues[queue_idx];
 
-   if (rxq == NULL) {
-   PMD_INIT_LOG(ERR, "Bad queue: %u", queue_idx);
-   return 0;
-   }
-
idx = rxq->rd_p;
 
count = 0;
diff --git a/lib/librte_ether/rte_ethdev.h b/lib/librte_ether/rte_ethdev.h
index 6ef614d..4be217c 100644
--- a/lib/librte_ether/rte_ethdev.h
+++ b/lib/librte_ether/rte_ethdev.h
@@ -2732,16 +2732,19 @@ rte_eth_rx_burst(uint8_t port_id, uint16_t queue_id,
  *  The queue id on the specific port.
  * @return
  *  The number of used descriptors in the specific queue, or:
- * (-EINVAL) if *port_id* is invalid
+ * (-EINVAL) if *port_id* or *queue_id* is invalid
  * (-ENOTSUP) if the device does not support this function
  */
 static inline int
 rte_eth_rx_queue_count(uint8_t port_id, 

[dpdk-dev] [PATCH v3] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Keith Wiles
Calling strncpy with a maximum size argument of 16 bytes on destination
array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
destination string unterminated.

Signed-off-by: Keith Wiles 
---
v3 - convert strncpy to use snprintf instead.
v2 - fix the checkpatch warning no spaces around '-'
v1 - fix coverity warning on strncpy using invalid length.

 drivers/net/tap/rte_eth_tap.c | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
index efc4426..cf1cea3 100644
--- a/drivers/net/tap/rte_eth_tap.c
+++ b/drivers/net/tap/rte_eth_tap.c
@@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short flags, 
int add)
return -1;
}
memset(&ifr, 0, sizeof(ifr));
-   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
+   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ - 1);
err = ioctl(s, SIOCGIFFLAGS, &ifr);
if (err < 0) {
RTE_LOG(WARNING, PMD, "Unable to get %s device flags: %s\n",
-- 
2.8.0.GIT



Re: [dpdk-dev] [PATCH v3] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Wiles, Keith

> On Feb 17, 2017, at 9:37 AM, Keith Wiles  wrote:
> 
> Calling strncpy with a maximum size argument of 16 bytes on destination
> array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
> destination string unterminated.
> 
> Signed-off-by: Keith Wiles 
> ---
> v3 - convert strncpy to use snprintf instead.
> v2 - fix the checkpatch warning no spaces around '-'
> v1 - fix coverity warning on strncpy using invalid length.
> 
> drivers/net/tap/rte_eth_tap.c | 2 +-
> 1 file changed, 1 insertion(+), 1 deletion(-)
> 
> diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
> index efc4426..cf1cea3 100644
> --- a/drivers/net/tap/rte_eth_tap.c
> +++ b/drivers/net/tap/rte_eth_tap.c
> @@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short 
> flags, int add)
>   return -1;
>   }
>   memset(&ifr, 0, sizeof(ifr));
> - strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
> + strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ - 1);
>   err = ioctl(s, SIOCGIFFLAGS, &ifr);
>   if (err < 0) {
>   RTE_LOG(WARNING, PMD, "Unable to get %s device flags: %s\n”,

NAK :-( forgot to finish the rebase

> -- 
> 2.8.0.GIT
> 

Regards,
Keith



[dpdk-dev] [PATCH v4] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Keith Wiles
Calling strncpy with a maximum size argument of 16 bytes on destination
array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
destination string unterminated.

Signed-off-by: Keith Wiles 
---
v4 - Forgot to finish rebase
v3 - convert strncpy to use snprintf instead.
v2 - fix the checkpatch warning no spaces around '-'
v1 - fix coverity warning on strncpy using invalid length.

 drivers/net/tap/rte_eth_tap.c | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/drivers/net/tap/rte_eth_tap.c b/drivers/net/tap/rte_eth_tap.c
index efc4426..47a7060 100644
--- a/drivers/net/tap/rte_eth_tap.c
+++ b/drivers/net/tap/rte_eth_tap.c
@@ -129,7 +129,7 @@ tun_alloc(struct pmd_internals *pmd, uint16_t qid)
memset(&ifr, 0, sizeof(struct ifreq));
 
ifr.ifr_flags = IFF_TAP | IFF_NO_PI;
-   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
+   snprintf(ifr.ifr_name, IFNAMSIZ, "%s", pmd->name);
 
RTE_LOG(DEBUG, PMD, "ifr_name '%s'\n", ifr.ifr_name);
 
@@ -297,7 +297,7 @@ tap_link_set_flags(struct pmd_internals *pmd, short flags, 
int add)
return -1;
}
memset(&ifr, 0, sizeof(ifr));
-   strncpy(ifr.ifr_name, pmd->name, IFNAMSIZ);
+   snprintf(ifr.ifr_name, IFNAMSIZ, "%s", pmd->name);
err = ioctl(s, SIOCGIFFLAGS, &ifr);
if (err < 0) {
RTE_LOG(WARNING, PMD, "Unable to get %s device flags: %s\n",
-- 
2.8.0.GIT



[dpdk-dev] [PATCH] doc: add default values of install variables

2017-02-17 Thread Thomas Monjalon
The variables DESTDIR and prefix are used with "make install"
to copy the files in $DESTDIR$prefix.
Their default values will be shown when calling "make help".

Signed-off-by: Thomas Monjalon 
---
 doc/build-sdk-quick.txt | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/doc/build-sdk-quick.txt b/doc/build-sdk-quick.txt
index 967ff09..57356d3 100644
--- a/doc/build-sdk-quick.txt
+++ b/doc/build-sdk-quick.txt
@@ -20,8 +20,8 @@ Build variables
V verbose
D debug dependencies
O build directory (default: build/ - install T= default: ./)
-   DESTDIR   staging install directory
-   prefixroot install directory
+   DESTDIR   staging install directory (default: empty)
+   prefixroot install directory (default: /usr/local)
T target template - used with config or install
format: 
templates in config/defconfig_*
-- 
2.7.0



Re: [dpdk-dev] [PATCH 2/2] net/ixgbe: add mac type check for all filters

2017-02-17 Thread Ferruh Yigit
On 2/13/2017 7:35 AM, Wei Zhao wrote:
> All kinds of filter need to hardware mac type check
> to make sure the hardware support that type of fliter.
> If not, it may cause serious issue.
> 
> Fixes: 11777435c727 ("net/ixgbe: parse flow director filter")
> Fixes: 672be56d76a2 ("net/ixgbe: parse n-tuple filter")
> Fixes: eb3539fc8550 ("net/ixgbe: parse ethertype filter")
> Fixes: 429f6ebb42cc ("net/ixgbe: parse TCP SYN filter")
> 
> Signed-off-by: Wei Zhao 
> Signed-off-by: Wenzhuo Lu 
> ---
>  drivers/net/ixgbe/ixgbe_flow.c | 129 
> +
>  1 file changed, 65 insertions(+), 64 deletions(-)
> 
> diff --git a/drivers/net/ixgbe/ixgbe_flow.c b/drivers/net/ixgbe/ixgbe_flow.c
> index 5a634d3..f414fa8 100644
> --- a/drivers/net/ixgbe/ixgbe_flow.c
> +++ b/drivers/net/ixgbe/ixgbe_flow.c
> @@ -84,11 +84,12 @@ cons_parse_ntuple_filter(const struct rte_flow_attr *attr,
>   struct rte_eth_ntuple_filter *filter,
>   struct rte_flow_error *error);
>  static int
> -ixgbe_parse_ntuple_filter(const struct rte_flow_attr *attr,
> - const struct rte_flow_item pattern[],
> - const struct rte_flow_action actions[],
> - struct rte_eth_ntuple_filter *filter,
> - struct rte_flow_error *error);
> +ixgbe_parse_ntuple_filter(struct rte_eth_dev *dev,
> +   const struct rte_flow_attr *attr,
> +   const struct rte_flow_item pattern[],
> +   const struct rte_flow_action actions[],
> +   struct rte_eth_ntuple_filter *filter,
> +   struct rte_flow_error *error);

Hi Wei,

You don't need these function declarations at all. What do you think
removing these first, in a separate patch, and won't need to update them
here?

Also it is possible to remove all function declarations if you move
"ixgbe_flow_ops" at the end of the file, that would be something I
prefer, but it is your call.

Thanks,
ferruh


Re: [dpdk-dev] [PATCH] examples: ethtool: Link against librte_pmd_ixgbe if necessary

2017-02-17 Thread Remy Horton


On 16/02/2017 16:17, Markos Chandras wrote:

The librte_ethtool library depends on librte_pmd_ixgbe if that
pmd driver is enabled so we need to link against it when we compile
the ethtool application. It fixes the following build problem:


For some reason this is not an issue with my Fedora box, so I'm guessing 
SUSE is stricter with sub-depenencies of libraries. Does this affect any 
of the OpenSUSE Linux distributions?


..Remy


Re: [dpdk-dev] [PATCH v4] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Ferruh Yigit
On 2/17/2017 3:43 PM, Keith Wiles wrote:
> Calling strncpy with a maximum size argument of 16 bytes on destination
> array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
> destination string unterminated.
> 
> Signed-off-by: Keith Wiles 

net/tap: fix possibly unterminated string

Coverity issue: 1407499
Fixes: 6b38b2725cdb ("net/tap: fix multi-queue support")
Cc: sta...@dpdk.org

Applied to dpdk-next-net/master, thanks.


(Updates:
- patch title:
It is preferred to mention from problem solved instead of the tool that
found it.

- Added coverity tag:
This helps to trace coverity issues, defined syntax is:

Coverity issue: xxx
Fixes: 

- Added Cc: tag for stable tree:
In case stable tree wants get this patch, to make patch visible.
)


Re: [dpdk-dev] [PATCH] examples: ethtool: Link against librte_pmd_ixgbe if necessary

2017-02-17 Thread Markos Chandras
On 02/17/2017 04:11 PM, Remy Horton wrote:
> 
> On 16/02/2017 16:17, Markos Chandras wrote:
>> The librte_ethtool library depends on librte_pmd_ixgbe if that
>> pmd driver is enabled so we need to link against it when we compile
>> the ethtool application. It fixes the following build problem:
> 
> For some reason this is not an issue with my Fedora box, so I'm guessing
> SUSE is stricter with sub-depenencies of libraries. Does this affect any
> of the OpenSUSE Linux distributions?
> 
> ..Remy

Hi Remy,

Yeah I am seen this problem in the openSUSE Tumbleweed distribution. I
think Nirmoy may have seen that in openSUSE Leap but I will let him
confirm that.

-- 
markos

SUSE LINUX GmbH | GF: Felix Imendörffer, Jane Smithard, Graham Norton
HRB 21284 (AG Nürnberg) Maxfeldstr. 5, D-90409, Nürnberg


Re: [dpdk-dev] [PATCH v4] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Wiles, Keith

> On Feb 17, 2017, at 10:21 AM, Yigit, Ferruh  wrote:
> 
> On 2/17/2017 3:43 PM, Keith Wiles wrote:
>> Calling strncpy with a maximum size argument of 16 bytes on destination
>> array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
>> destination string unterminated.
>> 
>> Signed-off-by: Keith Wiles 
> 
>net/tap: fix possibly unterminated string
> 
>Coverity issue: 1407499
>Fixes: 6b38b2725cdb ("net/tap: fix multi-queue support")
>Cc: sta...@dpdk.org
> 
> Applied to dpdk-next-net/master, thanks.
> 
> 
> (Updates:
> - patch title:
> It is preferred to mention from problem solved instead of the tool that
> found it.
> 
> - Added coverity tag:
> This helps to trace coverity issues, defined syntax is:
> 
>Coverity issue: xxx
>Fixes: 
> 
> - Added Cc: tag for stable tree:
> In case stable tree wants get this patch, to make patch visible.

I agree this is good, but to many rules not listed or checked in the tools. We 
need a much easier method to submit patches in the format that is defined and 
checked.

Today it is way to hard to know every little internal format for every type of 
patch. We need to fix this problem to make it easier to submit patches to 
dpdk.org, we can not continue like this as we grow it will become way to much 
work for the repo maintainers and the submitter.

Using a better tool then submitting via email seems like a better solution as 
long as we can add the given checks to the tool. Using a tools should also 
reduce the email traffic for most everyone, but we need to allow anyone to ask 
for all of the commits to the repo or pull requests like patches.

How can we handle these types of issues in the future?

> )

Regards,
Keith



Re: [dpdk-dev] [PATCH v4] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Ferruh Yigit
On 2/17/2017 4:34 PM, Wiles, Keith wrote:
> 
>> On Feb 17, 2017, at 10:21 AM, Yigit, Ferruh  wrote:
>>
>> On 2/17/2017 3:43 PM, Keith Wiles wrote:
>>> Calling strncpy with a maximum size argument of 16 bytes on destination
>>> array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
>>> destination string unterminated.
>>>
>>> Signed-off-by: Keith Wiles 
>>
>>net/tap: fix possibly unterminated string
>>
>>Coverity issue: 1407499
>>Fixes: 6b38b2725cdb ("net/tap: fix multi-queue support")
>>Cc: sta...@dpdk.org
>>
>> Applied to dpdk-next-net/master, thanks.
>>
>>
>> (Updates:
>> - patch title:
>> It is preferred to mention from problem solved instead of the tool that
>> found it.
>>
>> - Added coverity tag:
>> This helps to trace coverity issues, defined syntax is:
>>
>>Coverity issue: xxx
>>Fixes: 
>>
>> - Added Cc: tag for stable tree:
>> In case stable tree wants get this patch, to make patch visible.
> 
> I agree this is good, but to many rules not listed or checked in the tools. 
> We need a much easier method to submit patches in the format that is defined 
> and checked.
> 
> Today it is way to hard to know every little internal format for every type 
> of patch. We need to fix this problem to make it easier to submit patches to 
> dpdk.org, we can not continue like this as we grow it will become way to much 
> work for the repo maintainers and the submitter.

That is why I am documenting what has been changed and the reasoning in
the mail, so I am hoping this is helping others that following the mail
list sync about rules. Also gives a discussion medium about rules..

> 
> Using a better tool then submitting via email seems like a better solution as 
> long as we can add the given checks to the tool. Using a tools should also 
> reduce the email traffic for most everyone, but we need to allow anyone to 
> ask for all of the commits to the repo or pull requests like patches.
> 
> How can we handle these types of issues in the future?
> 
>> )
> 
> Regards,
> Keith
> 



Re: [dpdk-dev] [PATCH] net/mlx5: add out of buffer counter to extended statistic

2017-02-17 Thread Ferruh Yigit
On 2/14/2017 2:31 PM, Shahaf Shuler wrote:
> This commit adds RX out of buffer counter to xstats report.
> The counter counts the number of dropped occurred due to lack of buffers
> on device RX queues.
> 
> Signed-off-by: Shahaf Shuler 
> Acked-by: Nelio Laranjeiro 

Applied to dpdk-next-net/master, thanks.


Re: [dpdk-dev] [PATCH v4] net/tap: fix coverity warning on strncpy

2017-02-17 Thread Wiles, Keith

> On Feb 17, 2017, at 10:48 AM, Yigit, Ferruh  wrote:
> 
> On 2/17/2017 4:34 PM, Wiles, Keith wrote:
>> 
>>> On Feb 17, 2017, at 10:21 AM, Yigit, Ferruh  wrote:
>>> 
>>> On 2/17/2017 3:43 PM, Keith Wiles wrote:
 Calling strncpy with a maximum size argument of 16 bytes on destination
 array "ifr.ifr_ifrn.ifrn_name" of size 16 bytes might leave the
 destination string unterminated.
 
 Signed-off-by: Keith Wiles 
>>> 
>>>   net/tap: fix possibly unterminated string
>>> 
>>>   Coverity issue: 1407499
>>>   Fixes: 6b38b2725cdb ("net/tap: fix multi-queue support")
>>>   Cc: sta...@dpdk.org
>>> 
>>> Applied to dpdk-next-net/master, thanks.
>>> 
>>> 
>>> (Updates:
>>> - patch title:
>>> It is preferred to mention from problem solved instead of the tool that
>>> found it.
>>> 
>>> - Added coverity tag:
>>> This helps to trace coverity issues, defined syntax is:
>>> 
>>>   Coverity issue: xxx
>>>   Fixes: 
>>> 
>>> - Added Cc: tag for stable tree:
>>> In case stable tree wants get this patch, to make patch visible.
>> 
>> I agree this is good, but to many rules not listed or checked in the tools. 
>> We need a much easier method to submit patches in the format that is defined 
>> and checked.
>> 
>> Today it is way to hard to know every little internal format for every type 
>> of patch. We need to fix this problem to make it easier to submit patches to 
>> dpdk.org, we can not continue like this as we grow it will become way to 
>> much work for the repo maintainers and the submitter.
> 
> That is why I am documenting what has been changed and the reasoning in
> the mail, so I am hoping this is helping others that following the mail
> list sync about rules. Also gives a discussion medium about rules..

Great, but we need to document this on the web site not in email. We can not 
expect someone (or a new person) to read millions of emails to find these 
hidden gems, right?

> 
>> 
>> Using a better tool then submitting via email seems like a better solution 
>> as long as we can add the given checks to the tool. Using a tools should 
>> also reduce the email traffic for most everyone, but we need to allow anyone 
>> to ask for all of the commits to the repo or pull requests like patches.
>> 
>> How can we handle these types of issues in the future?
>> 
>>> )
>> 
>> Regards,
>> Keith

Regards,
Keith



Re: [dpdk-dev] [PATCH v9] net/kni: add KNI PMD

2017-02-17 Thread Yong Wang
> -Original Message-
> From: Ferruh Yigit [mailto:ferruh.yi...@intel.com]
> Sent: Friday, February 17, 2017 6:30 AM
> To: Thomas Monjalon 
> Cc: dev@dpdk.org; John McNamara ; Yong
> Wang 
> Subject: Re: [PATCH v9] net/kni: add KNI PMD
> 
> On 2/17/2017 1:47 PM, Thomas Monjalon wrote:
> > 2017-02-17 13:42, Ferruh Yigit:
> >> Add KNI PMD which wraps librte_kni for ease of use.
> >>
> >> KNI PMD can be used as any regular PMD to send / receive packets to the
> >> Linux networking stack.
> >>
> >> Signed-off-by: Ferruh Yigit 
> >> Reviewed-by: Yong Wang 
> >> ---

I have the impression that Reviewed-by is good enough for a change to be 
accepted, which does not seem to be the case.

Acked-by: Yong Wang 

> >>
> >> v9:
> >> * update for 17.05
> >
> > You keep updating this patch in the hope that someone would be
> interested :)
> >
> > Please let's make clear that I am OK to merge it
> > but you asked me to wait for someone supporting its inclusion.
> 
> Right, it is good to mention that I explicitly asked to wait community
> response.
> 
> I keep updating it because I believe this is something useful.
> 
> Meanwhile adding this into repo means maintenance cost, so this should
> not be merged without any usecase or interest from community.
> 
> Patch is waiting for an ACK or NAK from community.
> 
> Thanks,
> ferruh


Re: [dpdk-dev] [RFC 0/8] mbuf: structure reorganization

2017-02-17 Thread Ananyev, Konstantin
Hi guys,

> -Original Message-
> From: Olivier Matz [mailto:olivier.m...@6wind.com]
> Sent: Friday, February 17, 2017 2:17 PM
> To: Jan Blunck 
> Cc: Ananyev, Konstantin ; dev@dpdk.org
> Subject: Re: [dpdk-dev] [RFC 0/8] mbuf: structure reorganization
> 
> Hi Jan,
> 
> On Fri, 17 Feb 2017 14:38:32 +0100, Jan Blunck 
> wrote:
> > On Fri, Feb 17, 2017 at 11:51 AM, Olivier Matz
> >  wrote:
> > > Hi Jan,
> > >
> > > On Thu, 16 Feb 2017 18:26:39 +0100, Jan Blunck
> > >  wrote:
> > >> On Thu, Feb 16, 2017 at 2:48 PM, Olivier Matz
> > >>  wrote:
> > >> > On Mon, 6 Feb 2017 18:41:27 +, "Ananyev, Konstantin"
> > >> >  wrote:
> > >> >> >
> > >> >> > The main changes are:
> > >> >> > - reorder structure to increase vector performance on some
> > >> >> > non-ia platforms.
> > >> >> > - add a 64bits timestamp field in the 1st cache line
> > >> >>
> > >> >> Wonder why it deserves to be in first cache line?
> > >> >> How it differs from seqn below (pure SW stuff right now).
> > >> >
> > >> > In case the timestamp is set from a NIC value, it is set in the
> > >> > Rx path. So that's why I think it deserve to be located in the
> > >> > 1st cache line.
> > >> >
> > >> > As you said, the seqn is a pure sw stuff right: it is set in a
> > >> > lib, not in a PMD rx path.
> > >> >
> > >>
> > >> If we talk about setting the timestamp value in the RX path this
> > >> implicitly means software timestamps. Hardware timestamping usually
> > >> works by letting the hardware inject sync events for coarse time
> > >> tracking and additionally injecting fine granular per-packet ticks
> > >> at a specific offset in the packet. Out of performance reasons I
> > >> don't think it makes sense to extract this during the burst and
> > >> write it into the mbuf again.
> > >
> > > From what I've understand, at least it does not work like this for
> > > mellanox NICs: timestamp is a metadata attached to a rx packet. But
> > > maybe they (and other NIC vendors interrested in the feature) can
> > > confirm or not.
> > >
> >
> > Mellanox NICs use a 48bit cycle counter split into a high and low
> > part. To convert the cycle values into a timestamp you need to
> > initialize and maintainer a timecounter that shifts the cycle count
> > e.g. nanosecs. IIRC Mellanox doesn't generate explicit clock events
> > but the cycle counter is large enough so that the user can easily
> > maintain the timecounter by manually updating it.
> >
> > >>
> > >> The problem with timestamps is to get the abstraction right wrt the
> > >> correction factors and the size of the tick vs. the timestamp in
> > >> the events injected. From my perspective it would be better to
> > >> extract the handling of timestamp data into a library with PMD
> > >> specific implementation of the conversions. That way the
> > >> normalized timestamp values can get extracted if they are present.
> > >> The mbuf itself would only indicate the presence of timestamp
> > >> metadata in that case.
> > >
> > > I agree however that we need to properly define the meaning of this
> > > field. My idea is:
> > >
> > > - the timestamp is in nanosecond
> > > - the reference is always the same for a given path: if the
> > > timestamp is set in a PMD, all the packets for this PMD will have
> > > the same reference, but for 2 different PMDs (or a sw lib), the
> > > reference would not be the same.
> > >
> > > I think it's enough for many use cases.
> > > We can later add helpers to compare timestamps with different
> > > references.
> >
> > My point is that I still doubt that it belongs into the first
> > cacheline. It requires accessing other structures for converting into
> > nanoseconds anyway. Optimally I would like to see this happening on
> > access instead but if that isn't achievable at least in a second step.
> 
> Sorry, I don't really get your point. My comprehension of the timestamp
> usage in a PMD is as following:
> 
> rx_burst(struct rxq *rxq, ...)
> {
>   unsigned long factor = rxq->timestamp_factor;
>   unsigned port = rxq->port;
> 
>   for each hw_desc {
>   m = rte_pktmbuf_alloc(rxq->pool);
>   m->len = hw_desc->len;
>   m->port = port;
>   m->ol_flags =
>   ...
>   m->timestamp = hw_desc->timestamp * factor;
>   }
>   ...
> }
> 
> In that case, I think it deserves to be in the 1st cache line.

So you are saying that:
- for some HW that DPDK supports (mlx?) timestamp information
Is available in HW RX descriptor
- and as soon timestamp field will be available in mbuf, you plan
to populate it using this HW RXD field.
Is that so?
Konstantin


Re: [dpdk-dev] [PATCH v4] eal: Support running as unprivileged user

2017-02-17 Thread Stephen Hemminger
On Tue, 31 Jan 2017 10:44:53 -0700
Ben Walker  wrote:

> + if (physaddr == RTE_BAD_PHYS_ADDR) {
>   RTE_LOG(ERR, EAL,
> - "Cannot open /proc/self/pagemap: %s. "
> - "virt2phys address translation will not work\n",
> + "Cannot obtain physical addresses: %s. "
> + "Only vfio will function.\n",

Please don't split a single error message across multiple lines. It makes
it harder for user to find the source code lines with simple grep.
Better to just have one long line for format string, or better yet be less 
wordy.

Yes, the existing DPDK code has lots of these issues.


Re: [dpdk-dev] [PATCH] eventdev: event device to contain rte device holder

2017-02-17 Thread Stephen Hemminger
On Thu, 16 Feb 2017 16:22:29 +0530
Nipun Gupta  wrote:

> Signed-off-by: Nipun Gupta 
> 
> rte_device is a generic device which is available to the applications
> and EAL. This patch replaces rte_pci_device in 'struct rte_eventdev'
> and in 'struct rte_event_dev_info' with common rte_device.
> ---
>  drivers/event/skeleton/skeleton_eventdev.c | 2 +-
>  lib/librte_eventdev/rte_eventdev.c | 6 +++---
>  lib/librte_eventdev/rte_eventdev.h | 6 +++---
>  3 files changed, 7 insertions(+), 7 deletions(-)
> 
> diff --git a/drivers/event/skeleton/skeleton_eventdev.c 
> b/drivers/event/skeleton/skeleton_eventdev.c
> index dee0faf..770dce3 100644
> --- a/drivers/event/skeleton/skeleton_eventdev.c
> +++ b/drivers/event/skeleton/skeleton_eventdev.c
> @@ -383,7 +383,7 @@
>   if (rte_eal_process_type() != RTE_PROC_PRIMARY)
>   return 0;
>  
> - pci_dev = eventdev->pci_dev;
> + pci_dev = RTE_DEV_TO_PCI(eventdev->dev);

How will this work when there are more than just PCI devices?
For example, upcoming patches will add rte_vmbus_device.  There is no
run time type checking in C.


Re: [dpdk-dev] [PATCH v9] net/kni: add KNI PMD

2017-02-17 Thread Thomas Monjalon
2017-02-17 17:52, Yong Wang:
> From: Ferruh Yigit [mailto:ferruh.yi...@intel.com]
> > On 2/17/2017 1:47 PM, Thomas Monjalon wrote:
> > > 2017-02-17 13:42, Ferruh Yigit:
> > >> Add KNI PMD which wraps librte_kni for ease of use.
> > >>
> > >> KNI PMD can be used as any regular PMD to send / receive packets to the
> > >> Linux networking stack.
> > >>
> > >> Signed-off-by: Ferruh Yigit 
> > >> Reviewed-by: Yong Wang 
> > >> ---
> 
> I have the impression that Reviewed-by is good enough for a change to be 
> accepted, which does not seem to be the case.
> 
> Acked-by: Yong Wang 

Sorry, it is not what I meant.
Your Reviewed-by is really strong to make confident the patch is good.
But Ferruh wanted to make sure more people wants this PMD.



Re: [dpdk-dev] [PATCHv7 00/47] NXP DPAA2 PMD

2017-02-17 Thread Neil Horman
On Fri, Feb 17, 2017 at 01:17:26PM +0100, Vincent JARDIN wrote:
> Le 17/02/2017 à 13:13, Bruce Richardson a écrit :
> > If it builds without an SDK dependency I'd be happy enough to see this
> > merged into DPDK.
> 
> +1, the patch is clean enough to be compiled.
> 
> Boards or CPUs can rely on specific SDK, firmware, etc., it is up to the
> vendors.
> 
I absolutely disagree with this.  Its completely anathema to the reason open
source code is beneficial to the community that maintains it.  By allowing code
like this into the project, you've tacitly agree to do regular maintenence on
their code for them, without the reciprocating benefit of being able to use
their hardware free of additional license terms.  If you want to have a non-open
driver that works with an open source project, thats fine, but keep it out of
tree, and maintain it yourself.  What you've esentially asked for here is for
the dpdk community to be some free labor, when APIS and such change.  No thank
you.

I re-iterate my NAK.

> regards,
>   Vincent
> 


Re: [dpdk-dev] [RFC 0/8] mbuf: structure reorganization

2017-02-17 Thread Andrew Rybchenko

On 02/17/2017 04:51 PM, Jan Blunck wrote:

On Fri, Feb 17, 2017 at 1:49 PM, Nélio Laranjeiro
 wrote:

Hi Olivier, Jan,

On Fri, Feb 17, 2017 at 11:51:53AM +0100, Olivier Matz wrote:

Hi Jan,

On Thu, 16 Feb 2017 18:26:39 +0100, Jan Blunck 
wrote:

If we talk about setting the timestamp value in the RX path this
implicitly means software timestamps. Hardware timestamping usually
works by letting the hardware inject sync events for coarse time
tracking and additionally injecting fine granular per-packet ticks at
a specific offset in the packet. Out of performance reasons I don't
think it makes sense to extract this during the burst and write it
into the mbuf again.

 From what I've understand, at least it does not work like this for
mellanox NICs: timestamp is a metadata attached to a rx packet. But
maybe they (and other NIC vendors interrested in the feature) can
confirm or not.

Olivier is right, this timestamp information is returned by the hardware
as the other fields describing the Rx packet (length, RSS hash, checksum
...).  The PMD only copy it into the Mbuf.


Indeed, for Mellanox the timestamp is stored in the CQ entry.
Solarflares write it relative to the packet header.


Confirmed. We have pseudo-header just before the packet itself and 
timestamp is put to pseudo-header by the HW.