From: Naman Jain <namj...@linux.microsoft.com> Sent: Thursday, March 27, 2025 
10:28 PM
> 
> On regular bootup, devices get registered to VMBus first, so when
> uio_hv_generic driver for a particular device type is probed,
> the device is already initialized and added, so sysfs creation in
> uio_hv_generic probe works fine. However, when device is removed
> and brought back, the channel rescinds and device again gets
> registered to VMBus. However this time, the uio_hv_generic driver is
> already registered to probe for that device and in this case sysfs
> creation is tried before the device's kobject gets initialized
> completely.
> 
> Fix this by moving the core logic of sysfs creation for ring buffer,
> from uio_hv_generic to HyperV's VMBus driver, where rest of the sysfs
> attributes for the channels are defined. While doing that, make use
> of attribute groups and macros, instead of creating sysfs directly,
> to ensure better error handling and code flow.
> 
> Problem path:
> vmbus_process_offer (new offer comes for the VMBus device)
>   vmbus_add_channel_work
>     vmbus_device_register
>       |-> device_register
>       |     |...
>       |     |-> hv_uio_probe
>       |           |...
>       |           |-> sysfs_create_bin_file (leads to a warning as
>       |                 primary channel's kobject, which is used to
>       |                 create sysfs is not yet initialized)
>       |-> kset_create_and_add
>       |-> vmbus_add_channel_kobj (initialization of primary channel's
>                                   kobject happens later)
> 
> Above code flow is sequential and the warning is always reproducible in
> this path.
> 
> Fixes: 9ab877a6ccf8 ("uio_hv_generic: make ring buffer attribute for primary 
> channel")
> Cc: sta...@kernel.org
> Suggested-by: Saurabh Sengar <ssen...@linux.microsoft.com>
> Suggested-by: Michael Kelley <mhkli...@outlook.com>
> Signed-off-by: Naman Jain <namj...@linux.microsoft.com>
> ---
>  drivers/hv/hyperv_vmbus.h    |   6 ++
>  drivers/hv/vmbus_drv.c       | 110 ++++++++++++++++++++++++++++++++++-
>  drivers/uio/uio_hv_generic.c |  33 +++++------
>  include/linux/hyperv.h       |   6 ++
>  4 files changed, 134 insertions(+), 21 deletions(-)
> 

[snip]

> +/**
> + * hv_create_ring_sysfs() - create "ring" sysfs entry corresponding to ring 
> buffers for a channel.
> + * @channel: Pointer to vmbus_channel structure
> + * @hv_mmap_ring_buffer: function pointer for initializing the function to 
> be called on mmap of
> + *                       channel's "ring" sysfs node, which is for the ring 
> buffer of that channel.
> + *                       Function pointer is of below type:
> + *                       int (*hv_mmap_ring_buffer)(struct vmbus_channel 
> *channel,
> + *                                                  struct vm_area_struct 
> *vma))
> + *                       This has a pointer to the channel and a pointer to 
> vm_area_struct,
> + *                       used for mmap, as arguments.
> + *
> + * Sysfs node for ring buffer of a channel is created along with other 
> fields, however its
> + * visibility is disabled by default. Sysfs creation needs to be controlled 
> when the use-case
> + * is running.
> + * For example, HV_NIC device is used either by uio_hv_generic or hv_netvsc 
> at any given point of
> + * time, and "ring" sysfs is needed only when uio_hv_generic is bound to 
> that device. To avoid
> + * exposing the ring buffer by default, this function is reponsible to 
> enable visibility of
> + * ring for userspace to use.
> + * Note: Race conditions can happen with userspace and it is not encouraged 
> to create new
> + * use-cases for this. This was added to maintain backward compatibility, 
> while solving
> + * one of the race conditions in uio_hv_generic while creating sysfs.
> + *
> + * Returns 0 on success or error code on failure.
> + */
> +int hv_create_ring_sysfs(struct vmbus_channel *channel,
> +                      int (*hv_mmap_ring_buffer)(struct vmbus_channel 
> *channel,
> +                                                 struct vm_area_struct *vma))
> +{
> +     struct kobject *kobj = &channel->kobj;
> +     struct vmbus_channel *primary_channel = channel->primary_channel ?
> +             channel->primary_channel : channel;
> +
> +     channel->mmap_ring_buffer = hv_mmap_ring_buffer;
> +     channel->ring_sysfs_visible = true;
> +
> +     /*
> +      * Skip updating the sysfs group if the primary channel is not yet 
> initialized and sysfs
> +      * group is not yet created. In those cases, the 'ring' will be created 
> later in
> +      * vmbus_device_register() -> vmbus_add_channel_kobj().
> +      */
> +     if  (!primary_channel->device_obj->channels_kset)
> +             return 0;

This test doesn't accomplish what you want. It tests if the "channels" directory
has been created, but not if the numbered subdirectory for this channel has been
created. sysfs_update_group() operates on the numbered subdirectory and
could still fail because it hasn't been created yet.

My recommendation is to not try to do a test, and just let sysfs_update_group()
fail in that case (and ignore the error).

Michael

> +
> +     return sysfs_update_group(kobj, &vmbus_chan_group);
> +}
> +EXPORT_SYMBOL_GPL(hv_create_ring_sysfs);

Reply via email to