From: Robert Hoo <robert...@intel.com> It's hard to benchmark 40G+ network bandwidth using ordinary tools like iperf, netperf (see reference 1). Pktgen, packet generator from Kernel sapce, shall be a candidate. I derived this NUMA awared irq affinity sample script from multi-queue sample02, successfully benchmarked 40G link. I think this can also be useful for 100G reference, though I haven't got device to test yet.
This script simply does: Detect $DEV's NUMA node belonging. Bind each thread (processor of NUMA locality) with each $DEV queue's irq affinity, 1:1 mapping. How many '-t' threads input determines how many queues will be utilized. If '-f' designates first cpu id, then offset in the NUMA node's cpu list. Tested with Intel XL710 NIC with Cisco 3172 switch. Referrences: https://people.netfilter.org/hawk/presentations/LCA2015/net_stack_challenges_100G_LCA2015.pdf http://www.intel.cn/content/dam/www/public/us/en/documents/reference-guides/xl710-x710-performance-tuning-linux-guide.pdf Change log v2: Rebased to https://github.com/netoptimizer/network-testing/tree/master/pktgen Move helper functions to functions.sh More concise shell grammar usage Take '-f' parameter into consideration. If the first CPU is designaed, offset in the NUMA-aware CPU list. Use err(), info() helper functions for such outputs. Robert Hoo (2): Add some helper functions Add pktgen script: pktgen_sample06_numa_awared_queue_irq_affinity.sh pktgen/functions.sh | 44 ++++++++++ ...tgen_sample06_numa_awared_queue_irq_affinity.sh | 97 ++++++++++++++++++++++ 2 files changed, 141 insertions(+) create mode 100755 pktgen/pktgen_sample06_numa_awared_queue_irq_affinity.sh -- 1.8.3.1