Hi,
> -----Original Message----- > From: wei.guo.si...@gmail.com [mailto:wei.guo.si...@gmail.com] > Sent: Saturday, January 13, 2018 10:35 AM > To: Lu, Wenzhuo <wenzhuo...@intel.com> > Cc: dev@dpdk.org; Thomas Monjalon <tho...@monjalon.net>; Simon Guo > <wei.guo.si...@gmail.com> > Subject: [PATCH v5] app/testpmd: add option ring-bind-lcpu to bind Q with > CPU > > From: Simon Guo <wei.guo.si...@gmail.com> > > Currently the rx/tx queue is allocated from the buffer pool on socket of: > - port's socket if --port-numa-config specified > - or ring-numa-config setting per port > > All the above will "bind" queue to single socket per port configuration. > But it can actually archieve better performance if one port's queue can be > spread across multiple NUMA nodes, and the rx/tx queue is allocated per > lcpu socket. > > This patch adds a new option "--ring-bind-lcpu"(no parameter). With this, > testpmd can utilize the PCI-e bus bandwidth on another NUMA nodes. > > When --port-numa-config or --ring-numa-config option is specified, this -- > ring-bind-lcpu option will be suppressed. > > Test result: > 64bytes package, running in PowerPC with Mellanox > CX-4 card, single port(100G), with 8 cores, fw mode: > - Without this patch: 52.5Mpps throughput > - With this patch: 66Mpps throughput > ~25% improvement > > Signed-off-by: Simon Guo <wei.guo.si...@gmail.com> Acked-by: Wenzhuo Lu <wenzhuo...@intel.com>