hi matan

On 11/6/2018 2:22 PM, Matan Azrad wrote:
Hi Jeff

Can you detail more in the commit log that we can understand the deadlock 
scenario. And how does this commit fix it?


Before i add more detail in the commit log of next version, i would explain to you here at first here.

When the device be hot-unplugged,  the hot-unplug handler will be invoked and the device will be detached, at this time if the interrupt still not disable soon and the second

remove event come again(kernel will sent pci remove event after sent uio remove event) , the bus->find_device will return null and return, at this place lack of an unlock.

Without this unlock, it will block the next remove or add event detection. So it definitely need an unlock here to avoid dead lock.


-----Original Message-----
From: Jeff Guo <jia....@intel.com>
Sent: Tuesday, November 6, 2018 8:07 AM
To: konstantin.anan...@intel.com; anatoly.bura...@intel.com; Thomas
Monjalon <tho...@monjalon.net>; bernard.iremon...@intel.com;
jingjing...@intel.com; wenzhuo...@intel.com
Cc: ferruh.yi...@intel.com; dev@dpdk.org; jia....@intel.com;
helin.zh...@intel.com; Matan Azrad <ma...@mellanox.com>;
shaopeng...@intel.com
Subject: [PATCH 1/3] eal: fix lock issue for hot-unplug

This patch will add missing unlock for hot-unplug handler, without this patch
potential dead lock will occur when device be hotplug-in after device be hot-
unplugged.

Fixes: 0fc54536b14a ("eal: add failure handling for hot-unplug")
Signed-off-by: Jeff Guo <jia....@intel.com>
---
  lib/librte_eal/linuxapp/eal/eal_dev.c | 12 ++++++++----
  1 file changed, 8 insertions(+), 4 deletions(-)

diff --git a/lib/librte_eal/linuxapp/eal/eal_dev.c
b/lib/librte_eal/linuxapp/eal/eal_dev.c
index d589c69..2830c86 100644
--- a/lib/librte_eal/linuxapp/eal/eal_dev.c
+++ b/lib/librte_eal/linuxapp/eal/eal_dev.c
@@ -258,7 +258,7 @@ dev_uev_handler(__rte_unused void *param)
                        if (bus == NULL) {
                                RTE_LOG(ERR, EAL, "Cannot find bus (%s)\n",
                                        busname);
-                               return;
+                               goto failure_handle_err;
                        }

                        dev = bus->find_device(NULL, cmp_dev_name, @@
-266,19 +266,23 @@ dev_uev_handler(__rte_unused void *param)
                        if (dev == NULL) {
                                RTE_LOG(ERR, EAL, "Cannot find device (%s)
on "
                                        "bus (%s)\n", uevent.devname,
busname);
-                               return;
+                               goto failure_handle_err;
                        }

                        ret = bus->hot_unplug_handler(dev);
-                       rte_spinlock_unlock(&failure_handle_lock);
                        if (ret) {
                                RTE_LOG(ERR, EAL, "Can not handle hot-
unplug "
                                        "for device (%s)\n", dev->name);
-                               return;
                        }
+                       rte_spinlock_unlock(&failure_handle_lock);
                }
                rte_dev_event_callback_process(uevent.devname,
uevent.type);
        }
+
+       return;
+
+failure_handle_err:
+       rte_spinlock_unlock(&failure_handle_lock);
  }

  int __rte_experimental
--
2.7.4

Reply via email to