May need to setup huge pages on kernel boot line (this is example, you may need
to adjust):
The huge page configuration can be added to the default configuration file
/etc/default/grub by adding to the GRUB_CMDLINE_LINUX and the grub
configuration file regenerated to get an updated configurati
I don't think you have researched this enough.
Asking this questions shows that you are just beginning your research or do not
understand how this fits into current telco NFV/SDN efforts.
Why does this exist: "OpenDataPlane using DPDK for Intel NIC", listed below?
Why would competing technologi
...@dpdk.org] On Behalf Of Polehn, Mike A
Sent: Wednesday, December 2, 2015 8:32 AM
To: Kury Nicolas; dev at dpdk.org
Subject: Re: [dpdk-dev] Does anybody know OpenDataPlane
I don't think you have researched this enough.
Asking this questions shows that you are just beginning your research or d
Prefetchs make a big difference because a powerful CPU like IA is always trying
to find items to prefetch and the priority of these is not always easy to
determine. This is especially a problem across subroutine calls since the
compiler cannot determine what is of priority in the other subroutin
SMP operations can be very expensive, sometimes can impact operations by 100s
to 1000s of clock cycles depending on what is the circumstances of the
synchronization. It is how you arrange the SMP operations within the tasks at
hand across the SMP cores that gives methods for top performance. Us
Jumbo frames are generally handled by link lists (but called something else) of
mbufs.
Enabling jumbo frames for the device driver should enable the right portion of
the driver which handles the linked lists.
Don't make the mbufs huge.
Mike
-Original Message-
From: dev [mailto:dev-bou
In this example, the control network 00:03.0, remains unbound to UIO driver but
remains attached
to Linux device driver (ssh access with putty) and just the target interfaces
are bound.
Below, it shows all 3 interfaces bound to the uio driver, which are not usable
until a task uses the UIO driv
Combined 2 subroutines of code into one subroutine with one read operation
followed by
buffer allocate and load loop.
Eliminated the staging queue and subroutine, which removed extra pointer list
movements
and reduced number of active variable cache pages during for call.
Reduced queue positi
Prefetch of interface access variables while calling into driver RX and TX
subroutines.
For converging zero loss packet task tests, a small drop in latency for zero
loss measurements
and small drop in lost packet counts for the lossy measurement points was
observed,
indicating some savings of
Added check of minimum of 2 packet allocation count to eliminate the extra
overhead for
supporting prefetch for the case of checking for only one packet allocated into
the queue
at a time.
Used some standard variables to help reduce overhead of non-standard variable
sizes.
Added second level
Reduce the 32 packet list size focus for better packet list size range handling.
Changed maximum new buffer loop process size to the NIC queue free buffer count
per loop.
Removed redundant single call check to just one call with focused loop.
Remove NIC register update write from per loop to on
Added packet memory prefetch for faster access to variables inside packet
buffer needed
for the free operation.
Signed-off-by: Mike A. Polehn
diff --git a/drivers/net/i40e/i40e_rxtx.c b/drivers/net/i40e/i40e_rxtx.c
index 177fb2e..d9bc30a 100644
--- a/drivers/net/i40e/i40e_rxtx.c
+++ b/drivers/
, October 28, 2015 3:45 AM
To: Polehn, Mike A
Cc: dev at dpdk.org
Subject: Re: [dpdk-dev] [Patch] Eth Driver: Optimization for improved NIC
processing rates
On Tue, Oct 27, 2015 at 08:56:31PM +, Polehn, Mike A wrote:
> Prefetch of interface access variables while calling into driver R
Adds Eth driver prefetch variable structure to CPU cache 0 while calling into
tx or rx
device driver operation.
RFC 2544 test of NIC task test measurement points show improvement of lower
latency and/or better packet throughput indicating clock cycles saved.
Signed-off-by: Mike A. Polehn
dif
I used the following code snip-it with the i40e device, with 1 second sample
time had very high accuracy for IPv4 UDP packets:
#define FLOWD_PERF_PACKET_OVERHEAD 24 /* CRC + Preamble + SOF + Interpacket
gap */
#define FLOWD_REF_NETWORK_SPEED 10e9
double Ave_Bytes_per_Packet, Data_Rate, Net_R
t all depends on exact algorithm and end scaling.
Being able to do high
quality integer processing is a good skill, especially when doing work like
signal processing.
-Original Message-
From: Wiles, Keith
Sent: Tuesday, November 3, 2015 11:01 AM
To: Polehn, Mike A; Van Haaren,
), to submit
patches into dpdk.org user account, with patches signed with my official email
address (which is different than the sending email address which is just junk
name)?
Mike
-Original Message-
From: Polehn, Mike A
Sent: Tuesday, November 3, 2015 12:17 PM
To: St Leger, Jim
ery tricky and often is not any faster than floating point
math when using multiplies except on the very low performance processors.
Mike
From: ??? [mailto:pnk...@naver.com]
Sent: Tuesday, November 3, 2015 5:45 PM
To: Polehn, Mike A; Wiles, Keith; Van Haaren, Harry; dev at dpdk.org
Subject: RE: [d
I can think of a very good reason to want to know if the device is VF or PF.
The VF has to go through a layer 2 switch, not allowing it to just receive
anything coming across the Ehternet.
The PF can receive all the packets, including packets with different NIC
addresses. This allow the packet
: Thursday, November 5, 2015 7:51 AM
To: Polehn, Mike A; Shaham Fridenberg
Cc: dev at dpdk.org
Subject: RE: [dpdk-dev] SR-IOV: API to tell VF from PF
> -Original Message-
> From: Polehn, Mike A
> Sent: Thursday, November 5, 2015 3:43 PM
> To: Richardson, Bruce ; Shaham Fridenb
Adds ethdev driver prefetch of variable structure to CPU cache 0
while calling into tx or rx device driver operation.
RFC 2544 test of NIC task test measurement points show improvement
of lower latency and/or better packet throughput indicating clock
cycles saved.
Signed-off-by: Mike A. Polehn
-
It is probably the usual MS operation issues, I'll resubmit.
-Original Message-
From: Stephen Hemminger [mailto:step...@networkplumber.org]
Sent: Tuesday, November 10, 2015 9:03 AM
To: Polehn, Mike A
Cc: dev at dpdk.org
Subject: Re: [dpdk-dev] [PATCH v2] ethdev: Prefetch driver var
Adds ethdev driver prefetch of variable structure to CPU cache 0
while calling into tx or rx device driver operation.
RFC 2544 test of NIC task test measurement points show improvement
of lower latency and/or better packet throughput indicating clock
cycles saved.
Signed-off-by: Mike A. Polehn
-
23 matches
Mail list logo