[dpdk-dev] DPDK OVS on Ubuntu 14.04

2015-12-01 Thread Polehn, Mike A
May need to setup huge pages on kernel boot line (this is example, you may need to adjust): The huge page configuration can be added to the default configuration file /etc/default/grub by adding to the GRUB_CMDLINE_LINUX and the grub configuration file regenerated to get an updated configurati

[dpdk-dev] Does anybody know OpenDataPlane

2015-12-02 Thread Polehn, Mike A
I don't think you have researched this enough. Asking this questions shows that you are just beginning your research or do not understand how this fits into current telco NFV/SDN efforts. Why does this exist: "OpenDataPlane using DPDK for Intel NIC", listed below? Why would competing technologi

[dpdk-dev] Does anybody know OpenDataPlane

2015-12-02 Thread Polehn, Mike A
...@dpdk.org] On Behalf Of Polehn, Mike A Sent: Wednesday, December 2, 2015 8:32 AM To: Kury Nicolas; dev at dpdk.org Subject: Re: [dpdk-dev] Does anybody know OpenDataPlane I don't think you have researched this enough. Asking this questions shows that you are just beginning your research or d

[dpdk-dev] rte_prefetch0() is effective?

2016-01-13 Thread Polehn, Mike A
Prefetchs make a big difference because a powerful CPU like IA is always trying to find items to prefetch and the priority of these is not always easy to determine. This is especially a problem across subroutine calls since the compiler cannot determine what is of priority in the other subroutin

[dpdk-dev] [PATCH] vhost: remove lockless enqueue to the virtio ring

2016-01-19 Thread Polehn, Mike A
SMP operations can be very expensive, sometimes can impact operations by 100s to 1000s of clock cycles depending on what is the circumstances of the synchronization. It is how you arrange the SMP operations within the tasks at hand across the SMP cores that gives methods for top performance. Us

[dpdk-dev] rte_mbuf size for jumbo frame

2016-01-26 Thread Polehn, Mike A
Jumbo frames are generally handled by link lists (but called something else) of mbufs. Enabling jumbo frames for the device driver should enable the right portion of the driver which handles the linked lists. Don't make the mbufs huge. Mike -Original Message- From: dev [mailto:dev-bou

[dpdk-dev] [Dpdk-ovs] problem in binding interfaces of virtio-pci on the VM

2015-02-26 Thread Polehn, Mike A
In this example, the control network 00:03.0, remains unbound to UIO driver but remains attached to Linux device driver (ssh access with putty) and just the target interfaces are bound. Below, it shows all 3 interfaces bound to the uio driver, which are not usable until a task uses the UIO driv

[dpdk-dev] [Patch 1/2] i40e RX Bulk Alloc: Larger list size (33 to 128) throughput optimization

2015-10-27 Thread Polehn, Mike A
Combined 2 subroutines of code into one subroutine with one read operation followed by buffer allocate and load loop. Eliminated the staging queue and subroutine, which removed extra pointer list movements and reduced number of active variable cache pages during for call. Reduced queue positi

[dpdk-dev] [Patch] Eth Driver: Optimization for improved NIC processing rates

2015-10-27 Thread Polehn, Mike A
Prefetch of interface access variables while calling into driver RX and TX subroutines. For converging zero loss packet task tests, a small drop in latency for zero loss measurements and small drop in lost packet counts for the lossy measurement points was observed, indicating some savings of

[dpdk-dev] [Patch 2/2] i40e rx Bulk Alloc: Larger list size (33 to 128) throughput optimization

2015-10-27 Thread Polehn, Mike A
Added check of minimum of 2 packet allocation count to eliminate the extra overhead for supporting prefetch for the case of checking for only one packet allocated into the queue at a time. Used some standard variables to help reduce overhead of non-standard variable sizes. Added second level

[dpdk-dev] [Patch 1/2] i40e simple tx: Larger list size (33 to 128) throughput optimization

2015-10-27 Thread Polehn, Mike A
Reduce the 32 packet list size focus for better packet list size range handling. Changed maximum new buffer loop process size to the NIC queue free buffer count per loop. Removed redundant single call check to just one call with focused loop. Remove NIC register update write from per loop to on

[dpdk-dev] [Patch 2/2] i40e simple tx: Larger list size (33 to 128) throughput optimization

2015-10-27 Thread Polehn, Mike A
Added packet memory prefetch for faster access to variables inside packet buffer needed for the free operation. Signed-off-by: Mike A. Polehn diff --git a/drivers/net/i40e/i40e_rxtx.c b/drivers/net/i40e/i40e_rxtx.c index 177fb2e..d9bc30a 100644 --- a/drivers/net/i40e/i40e_rxtx.c +++ b/drivers/

[dpdk-dev] [Patch] Eth Driver: Optimization for improved NIC processing rates

2015-10-28 Thread Polehn, Mike A
, October 28, 2015 3:45 AM To: Polehn, Mike A Cc: dev at dpdk.org Subject: Re: [dpdk-dev] [Patch] Eth Driver: Optimization for improved NIC processing rates On Tue, Oct 27, 2015 at 08:56:31PM +, Polehn, Mike A wrote: > Prefetch of interface access variables while calling into driver R

[dpdk-dev] [Patch v2] Eth driver optimization: Prefetch variable structure

2015-11-03 Thread Polehn, Mike A
Adds Eth driver prefetch variable structure to CPU cache 0 while calling into tx or rx device driver operation. RFC 2544 test of NIC task test measurement points show improvement of lower latency and/or better packet throughput indicating clock cycles saved. Signed-off-by: Mike A. Polehn dif

[dpdk-dev] How can I calculate/estimate pps(packet per seocond) and bps(bit per second) in DPDK pktg

2015-11-03 Thread Polehn, Mike A
I used the following code snip-it with the i40e device, with 1 second sample time had very high accuracy for IPv4 UDP packets: #define FLOWD_PERF_PACKET_OVERHEAD 24 /* CRC + Preamble + SOF + Interpacket gap */ #define FLOWD_REF_NETWORK_SPEED 10e9 double Ave_Bytes_per_Packet, Data_Rate, Net_R

[dpdk-dev] How can I calculate/estimate pps(packet per seocond) and bps(bit per second) in DPDK pktg

2015-11-03 Thread Polehn, Mike A
t all depends on exact algorithm and end scaling. Being able to do high quality integer processing is a good skill, especially when doing work like signal processing. -Original Message- From: Wiles, Keith Sent: Tuesday, November 3, 2015 11:01 AM To: Polehn, Mike A; Van Haaren,

[dpdk-dev] FW: [Patch v2] Eth driver optimization: Prefetch variable structure

2015-11-03 Thread Polehn, Mike A
), to submit patches into dpdk.org user account, with patches signed with my official email address (which is different than the sending email address which is just junk name)? Mike -Original Message- From: Polehn, Mike A Sent: Tuesday, November 3, 2015 12:17 PM To: St Leger, Jim

[dpdk-dev] How can I calculate/estimate pps(packet per seocond) and bps(bit per second) in DPDK pktg

2015-11-04 Thread Polehn, Mike A
ery tricky and often is not any faster than floating point math when using multiplies except on the very low performance processors. Mike From: ??? [mailto:pnk...@naver.com] Sent: Tuesday, November 3, 2015 5:45 PM To: Polehn, Mike A; Wiles, Keith; Van Haaren, Harry; dev at dpdk.org Subject: RE: [d

[dpdk-dev] SR-IOV: API to tell VF from PF

2015-11-05 Thread Polehn, Mike A
I can think of a very good reason to want to know if the device is VF or PF. The VF has to go through a layer 2 switch, not allowing it to just receive anything coming across the Ehternet. The PF can receive all the packets, including packets with different NIC addresses. This allow the packet

[dpdk-dev] SR-IOV: API to tell VF from PF

2015-11-05 Thread Polehn, Mike A
: Thursday, November 5, 2015 7:51 AM To: Polehn, Mike A; Shaham Fridenberg Cc: dev at dpdk.org Subject: RE: [dpdk-dev] SR-IOV: API to tell VF from PF > -Original Message- > From: Polehn, Mike A > Sent: Thursday, November 5, 2015 3:43 PM > To: Richardson, Bruce ; Shaham Fridenb

[dpdk-dev] [PATCH v2] ethdev: Prefetch driver variable structure

2015-11-10 Thread Polehn, Mike A
Adds ethdev driver prefetch of variable structure to CPU cache 0 while calling into tx or rx device driver operation. RFC 2544 test of NIC task test measurement points show improvement of lower latency and/or better packet throughput indicating clock cycles saved. Signed-off-by: Mike A. Polehn -

[dpdk-dev] [PATCH v2] ethdev: Prefetch driver variable structure

2015-11-11 Thread Polehn, Mike A
It is probably the usual MS operation issues, I'll resubmit. -Original Message- From: Stephen Hemminger [mailto:step...@networkplumber.org] Sent: Tuesday, November 10, 2015 9:03 AM To: Polehn, Mike A Cc: dev at dpdk.org Subject: Re: [dpdk-dev] [PATCH v2] ethdev: Prefetch driver var

[dpdk-dev] [PATCH v2] ethdev: Prefetch driver variable structure

2015-11-11 Thread Polehn, Mike A
Adds ethdev driver prefetch of variable structure to CPU cache 0 while calling into tx or rx device driver operation. RFC 2544 test of NIC task test measurement points show improvement of lower latency and/or better packet throughput indicating clock cycles saved. Signed-off-by: Mike A. Polehn -