> Controlling fanout more is a good idea but not sure what this patch > is trying to do with it.
I am maintaining a DPDK application which should run on a large number of setups. Unfortunately, I do not have a lot of control over the environment the application runs on (e.g kernel). The problem I was trying to solve is that the Af_Packet PMD seems to fail to initialize properly on some setups. I managed to reproduce the issue with testpmd. sudo ./x86_64-native-linuxapp-gcc/app/dpdk-testpmd \ -l 0-3 \ -m 1024 \ --no-huge \ --no-shconf \ --no-pci \ --vdev=net_af_packet0,iface=eth1,blocksz=4096,framesz=2048,framecnt=512,qpairs=1,qdisc_bypass=0 \ --vdev=net_af_packet1,iface=eth2,blocksz=4096,framesz=2048,framecnt=512,qpairs=1,qdisc_bypass=0 \ -- \ -i Error: Unknown device type. EAL: Detected CPU lcores: 8 EAL: Detected NUMA nodes: 1 EAL: Detected static linkage of DPDK EAL: Selected IOVA mode 'VA' AFPACKET: rte_pmd_init_internals(): net_af_packet0: could not set PACKET_FANOUT on AF_PACKET socket for eth1:Invalid argument VDEV_BUS: vdev_probe(): failed to initialize net_af_packet0 device AFPACKET: rte_pmd_init_internals(): net_af_packet1: could not set PACKET_FANOUT on AF_PACKET socket for eth1:Invalid argument VDEV_BUS: vdev_probe(): failed to initialize net_af_packet1 device EAL: Bus (vdev) probe failed. testpmd: No probed ethernet devices It seems that the call to PACKET_FANOUT setsockopt failed with EINVAL. I debugged the issue further. It seems that the root cause is that when the interface is down (e.g eth1), after the bind operation succeeds, the PACKET_FANOUT setsockopt will fail. This is a behavior of AF_Packet in Linux which I'm not sure is that well documented. It seems to be fixed in a recent Linux Kernel commit. I re-compiled the kernel with the change, and did not see the issue afterwards. Commit 2cee3e6e2e4b74bec96694169f01cd3feec1f264 af_packet: allow fanout_add when socket is not RUNNING [1] https://github.com/torvalds/linux/commit/2cee3e6e2e4b74bec96694169f01cd3feec1f264 I was trying to find a solution for my application to run on setups which don't have the kernel patch. Introducing a devarg to control when packet_fanout is enabled seemed like a possible way around the issue. As a second solution around the issue, I was also thinking of using the PACKET_FANOUT setsockopt only when nb_queues > 1. Conceptually, I think it would make some sense, because fanout doesn't really help when having a single socket. We would need more sockets in order to load balance incoming packets. And this would still allow me to control the behavior from the existing 'qpairs' devarg. The idea would be something along these lines if (nb_queues > 1) { rc = setsockopt(qsockfd, SOL_PACKET, PACKET_FANOUT, &fanout_arg, sizeof(fanout_arg)); /* ... */ } Would a patch using this idea be considered more suitable ? > - DPDK minimum kernel version is now 4.19 so no point in worrying about > backward compatibility. According to man page for packet, fanout > was added in 3.1 kernel. > > - It would be useful to allow application to control fanout in more detail. This is a great idea. Would introducing a new devarg, (e.g 'fanout_mode') be the proper way to allow the application to customize fanout in more detail ? --vdev=net_af_packet0,iface=eth1,blocksz=4096,framesz=2048,framecnt=512,qpairs=1,fanout_mode=[fanout_hash|fanout_cpu|fanout_rnd|fanout_qm]