On 10/25/2019 2:48 AM, Di, ChenxuX wrote: > Hi, Ferruh > >> -----Original Message----- >> From: Yigit, Ferruh >> Sent: Friday, October 25, 2019 1:24 AM >> To: Di, ChenxuX <chenxux...@intel.com>; dev@dpdk.org >> Cc: Lu, Wenzhuo <wenzhuo...@intel.com>; Yang, Qiming >> <qiming.y...@intel.com> >> Subject: Re: [dpdk-dev] [PATCH] app/testpmd: fix device hotplug remove >> >> On 10/24/2019 2:03 AM, Di ChenxuX wrote: >>> Hotplug remove cause infinite loops. Fix by canceling port_close >>> before port_detach function when rmv_port_callback. >> >> Can you please give more details/backtrace of how loop happens? >> How can trigger it? > > Here is the test case > > Environment > Os: Ubuntu 18.04 > Device: X710 nic > Software: qemu > > 1. Bind pf0 to vfio-pci > > [root@xxxxxxxxx dpdk]# modprobe vfio-pci > [root@ xxxxxxxxx dpdk]# usertools/dpdk-devbind.py --force > --bind=vfio-pci 0000:81:00.0 > 2. Passthrough PF and start qemu > > [root@ xxxxxxxxx dpdk]# taskset -c 0-7 qemu-system-x86_64 -enable-kvm > -pidfile /tmp/.vm0.pid -m 10240 -cpu host -smp 8 -name vm0 -monitor > unix:/tmp/vm0_monitor.sock,server,nowait -chardev > socket,path=/tmp/vm0_qga0.sock,server,nowait,id=vm0_qga0 -device > virtio-serial -device > virtserialport,chardev=vm0_qga0,name=org.qemu.guest_agent.0 -device > e1000,netdev=nttsip1 -netdev user,id=nttsip1,hostfwd=tcp: xxxxxxxxx:6000-:22 > -monitor stdio -drive file=/home/image/test_vfio.img -vnc :5 -device > vfio-pci,host=0000:81:00.0,id=dev1 > 3. Log in VM, bind passthrough port 0 to vfio-pci > > virtdut. xxxxxxxxx:6000: modprobe -r vfio_iommu_type1 > virtdut. xxxxxxxxx:6000: modprobe -r vfio > virtdut. xxxxxxxxx:6000: modprobe vfio enable_unsafe_noiommu_mode=1 > virtdut. xxxxxxxxx:6000: modprobe vfio-pci > > virtdut. xxxxxxxxx:6000: ./usertools/dpdk-devbind.py -b vfio-pci > 0000:00:05.0 > 4. Start testpmd with "--hot-plug" enable > > virtdut. xxxxxxxxx:6000: ./x86_64-native-linuxapp-gcc/app/testpmd -l > 0,1,2,3,4,5,6,7 -n 1 -w 0000:00:05.0 --file-prefix=dpdk_24610_20191014100036 > -- -i --hot-plug > 5. Remove device from qemu interface > > (qemu) device_del dev1 > > 6.before change > Removing a device... > EAL: Driver cannot detach the device (0000:00:05.0) > EAL: Failed to detach device on primary process > testpmd: Failed to detach device 0000:00:05.0 > EAL: can not get port by device 0000:00:05.0! > EAL: can not get port by device 0000:00:05.0!
'close()' is failing is a problem and should be fix but we keep getting "RTE_DEV_EVENT_REMOVE" event? Did you able to get the backtrace of the issue? > ... > EAL: can not get port by device 0000:00:05.0! > ... > ... > after change: > Removing a device... > EAL: Error disabling MSI-X interrupts for fd 47 > EAL: Releasing pci mapped resource for 0000:00:05.0 > EAL: Calling pci_unmap_resource for 0000:00:05.0 at 0x1100800000 > EAL: Calling pci_unmap_resource for 0000:00:05.0 at 0x1101000000 > Device of port 0 is detached > Now total ports is 0 > Done > Invalid port_id=0 > EAL: Cannot find device (0000:00:05.0) on bus (pci) > > >> >>> >>> Fixes: ac89d46096d5 ("net/i40e: release port upon close") >>> >>> Signed-off-by: Di ChenxuX <chenxux...@intel.com> >>> --- >>> app/test-pmd/testpmd.c | 1 - >>> 1 file changed, 1 deletion(-) >>> >>> diff --git a/app/test-pmd/testpmd.c b/app/test-pmd/testpmd.c index >>> 5701f3141..a264644a1 100644 >>> --- a/app/test-pmd/testpmd.c >>> +++ b/app/test-pmd/testpmd.c >>> @@ -2708,7 +2708,6 @@ rmv_port_callback(void *arg) >>> no_link_check = 1; >>> stop_port(port_id); >>> no_link_check = org_no_link_check; >>> - close_port(port_id); >>> detach_port_device(port_id); >>> if (need_to_start) >>> start_packet_forwarding(0); >>> >