Hi Peng & Fei, > -----Original Message----- > From: dev <dev-boun...@dpdk.org> On Behalf Of Peng He > Sent: Friday, January 29, 2021 3:36 PM > To: dev@dpdk.org > Cc: maxime.coque...@redhat.com > Subject: [dpdk-dev] [PATCH] lib/librte_vhost: fix vid allocation race
Fix the title to 'vhost: XXXXX' > > From: "chenwei.0515" <chenwei.0...@bytedance.com> This should not be here.. you could just delete it as the author is already in commit message. > > vhost_new_devcie might be called in different threads at the same time. s/devcie/device > thread 1(config thread) > rte_vhost_driver_start > ->vhost_user_start_client > ->vhost_user_add_connection > -> vhost_new_device > > thread 2(vhost-events) > vhost_user_read_cb > ->vhost_user_msg_handler (return value < 0) > -> vhost_user_start_client > -> vhost_new_device > > So there could be a case that a same vid has been allocated twice, or > some vid might be lost in DPDK lib however still held by the upper > applications. Good catch! I checked the code and find there exists cases that different threads may allocate vhost slot. And I also find that other functions which use the global vhost_devices[] may also meet the same problem. For example, vhost_destroy_device() could be called by different thread. So I suggest to fix the problem completely in all related functions like vhost_destroy_device() and get_device(). What do you think? If you agree with above, note that the title should also be changed. Besides, please also add 'Fixes:$COMMID_ID' and cc to sta...@dpdk.org so it could be fixed in LTS. You can check other commit for reference. > > Reported-by: Peng He <hepeng.0...@bytedance.com> > Signed-off-by: Fei Chen <chenwei.0...@bytedance.com> > Reviewed-by: Zhihong Wang <wangzhihong....@bytedance.com> > --- > lib/librte_vhost/vhost.c | 6 ++++++ > 1 file changed, 6 insertions(+) > > diff --git a/lib/librte_vhost/vhost.c b/lib/librte_vhost/vhost.c > index efb136edd1..db11d293d2 100644 > --- a/lib/librte_vhost/vhost.c > +++ b/lib/librte_vhost/vhost.c > @@ -26,6 +26,7 @@ > #include "vhost_user.h" > > struct virtio_net *vhost_devices[MAX_VHOST_DEVICE]; > +pthread_mutex_t vhost_dev_lock = PTHREAD_MUTEX_INITIALIZER; There's a duplicate space between 'pthread_mutex_t' and 'vhost_dev_lock', Let's just leave one 😊 Thanks, Chenbo > > /* Called with iotlb_lock read-locked */ > uint64_t > @@ -645,6 +646,7 @@ vhost_new_device(void) > struct virtio_net *dev; > int i; > > + pthread_mutex_lock(&vhost_dev_lock); > for (i = 0; i < MAX_VHOST_DEVICE; i++) { > if (vhost_devices[i] == NULL) > break; > @@ -653,6 +655,7 @@ vhost_new_device(void) > if (i == MAX_VHOST_DEVICE) { > VHOST_LOG_CONFIG(ERR, > "Failed to find a free slot for new device.\n"); > + pthread_mutex_unlock(&vhost_dev_lock); > return -1; > } > > @@ -660,10 +663,13 @@ vhost_new_device(void) > if (dev == NULL) { > VHOST_LOG_CONFIG(ERR, > "Failed to allocate memory for new dev.\n"); > + pthread_mutex_unlock(&vhost_dev_lock); > return -1; > } > > vhost_devices[i] = dev; > + pthread_mutex_unlock(&vhost_dev_lock); > + > dev->vid = i; > dev->flags = VIRTIO_DEV_BUILTIN_VIRTIO_NET; > dev->slave_req_fd = -1; > -- > 2.23.0