On Thu, Sep 11, 2025 at 10:22:24PM -0400, Brian Song wrote:
> 
> 
> On 9/10/25 9:14 AM, Stefan Hajnoczi wrote:
> > On Tue, Sep 09, 2025 at 04:51:12PM -0400, Brian Song wrote:
> > > 
> > > 
> > > On 9/9/25 3:38 PM, Stefan Hajnoczi wrote:
> > > > On Fri, Aug 29, 2025 at 10:50:25PM -0400, Brian Song wrote:
> > > > > To test FUSE-over-io_uring, set the environment variable
> > > > > FUSE_OVER_IO_URING=1. This applies only when using the
> > > > > 'fuse' protocol.
> > > > > 
> > > > > $ FUSE_OVER_IO_URING=1 ./check -fuse
> > > > > 
> > > > > Suggested-by: Kevin Wolf <kw...@redhat.com>
> > > > > Suggested-by: Stefan Hajnoczi <stefa...@redhat.com>
> > > > > Signed-off-by: Brian Song <hibrians...@gmail.com>
> > > > > ---
> > > > >    tests/qemu-iotests/check     |  2 ++
> > > > >    tests/qemu-iotests/common.rc | 45 
> > > > > +++++++++++++++++++++++++++---------
> > > > >    2 files changed, 36 insertions(+), 11 deletions(-)
> > > > > 
> > > > > diff --git a/tests/qemu-iotests/check b/tests/qemu-iotests/check
> > > > > index 545f9ec7bd..c6fa0f9e3d 100755
> > > > > --- a/tests/qemu-iotests/check
> > > > > +++ b/tests/qemu-iotests/check
> > > > > @@ -94,6 +94,8 @@ def make_argparser() -> argparse.ArgumentParser:
> > > > >            mg.add_argument('-' + fmt, dest='imgfmt', 
> > > > > action='store_const',
> > > > >                            const=fmt, help=f'test {fmt}')
> > > > > +    # To test FUSE-over-io_uring, set the environment variable
> > > > > +    # FUSE_OVER_IO_URING=1. This applies only when using the 'fuse' 
> > > > > protocol
> > > > >        protocol_list = ['file', 'rbd', 'nbd', 'ssh', 'nfs', 'fuse']
> > > > >        g_prt = p.add_argument_group(
> > > > >            '  image protocol options',
> > > > > diff --git a/tests/qemu-iotests/common.rc 
> > > > > b/tests/qemu-iotests/common.rc
> > > > > index e977cb4eb6..f8b79c3810 100644
> > > > > --- a/tests/qemu-iotests/common.rc
> > > > > +++ b/tests/qemu-iotests/common.rc
> > > > > @@ -539,17 +539,38 @@ _make_test_img()
> > > > >            touch "$export_mp"
> > > > >            rm -f "$SOCK_DIR/fuse-output"
> > > > > -        # Usually, users would export formatted nodes.  But we 
> > > > > present fuse as a
> > > > > -        # protocol-level driver here, so we have to leave the format 
> > > > > to the
> > > > > -        # client.
> > > > > -        # Switch off allow-other, because in general we do not need 
> > > > > it for
> > > > > -        # iotests.  The default allow-other=auto has the downside of 
> > > > > printing a
> > > > > -        # fusermount error on its first attempt if allow_other is not
> > > > > -        # permissible, which we would need to filter.
> > > > 
> > > > This comment applies to both branches of the if statement. I think
> > > > keeping it here is slightly better.
> > > > 
> > > > > -        QSD_NEED_PID=y $QSD \
> > > > > -              --blockdev 
> > > > > file,node-name=export-node,filename=$img_name,discard=unmap \
> > > > > -              --export 
> > > > > fuse,id=fuse-export,node-name=export-node,mountpoint="$export_mp",writable=on,growable=on,allow-other=off
> > > > >  \
> > > > > -              &
> > > > > +        if [ -n "$FUSE_OVER_IO_URING" ]; then
> > > > > +            nr_cpu=$(nproc 2>/dev/null || echo 1)
> > > > > +            nr_iothreads=$((nr_cpu / 2))
> > > > > +            if [ $nr_iothreads -lt 1 ]; then
> > > > > +                nr_iothreads=1
> > > > > +            fi
> > > > 
> > > > Please add a comment explaining that the purpose of this configuration
> > > > based on the number of CPUs is to test multiple IOThreads when the host
> > > > allows it, since that is a more interesting case then just 1 IOThread.
> > > > Many other configurations are possible as well, but not all of them can
> > > > be tested because the test matrix would be large.
> > > > 
> > > > > +
> > > > > +            iothread_args=""
> > > > > +            iothread_export_args=""
> > > > > +            for ((i=0; i<$nr_iothreads; i++)); do
> > > > > +                iothread_args="$iothread_args --object 
> > > > > iothread,id=iothread$i"
> > > > > +                
> > > > > iothread_export_args="$iothread_export_args,iothread.$i=iothread$i"
> > > > > +            done
> > > > > +
> > > > > +            QSD_NEED_PID=y $QSD \
> > > > > +                    $iothread_args \
> > > > > +                    --blockdev 
> > > > > file,node-name=export-node,filename=$img_name,discard=unmap \
> > > > > +                    --export 
> > > > > fuse,id=fuse-export,node-name=export-node,mountpoint="$export_mp",writable=on,growable=on,allow-other=off,io-uring=on$iothread_export_args
> > > > >  \
> > > > > +                &
> > > > > +        else
> > > > > +            # Usually, users would export formatted nodes.  But we 
> > > > > present fuse as a
> > > > > +            # protocol-level driver here, so we have to leave the 
> > > > > format to the
> > > > > +            # client.
> > > > > +            # Switch off allow-other, because in general we do not 
> > > > > need it for
> > > > > +            # iotests.  The default allow-other=auto has the 
> > > > > downside of printing a
> > > > > +            # fusermount error on its first attempt if allow_other 
> > > > > is not
> > > > > +            # permissible, which we would need to filter.
> > > > > +            QSD_NEED_PID=y $QSD \
> > > > > +                --blockdev 
> > > > > file,node-name=export-node,filename=$img_name,discard=unmap \
> > > > > +                --export 
> > > > > fuse,id=fuse-export,node-name=export-node,mountpoint="$export_mp",writable=on,growable=on,allow-other=off
> > > > >  \
> > > > > +                &
> > > > > +        fi
> > > > >            pidfile="$QEMU_TEST_DIR/qemu-storage-daemon.pid"
> > > > > @@ -592,6 +613,8 @@ _rm_test_img()
> > > > >            kill "${FUSE_PIDS[index]}"
> > > > > +        sleep 1
> > > > > +
> > > > 
> > > > What is the purpose of this sleep command?
> > > > 
> > > 
> > > I don’t exactly remember why. It might get stuck if there’s no sleep 
> > > here. I
> > > remember we discussed this problem in earlier emails.
> > 
> > The purpose needs to be understood. Otherwise there is a good chance
> > that the test will fail randomly in a continuous integration environment
> > where things sometimes take a long time due to CPU contention.
> > 
> > Stefan
> 
> I think the issue lies in our current approach of using df to check whether
> the FUSE mount has been unmounted.
> 
> When we traced df with strace, we found that its logic for checking the
> mount point is:
> => Call mount to read the system's mount information
> => Use statfs() to get the filesystem statistics
> 
> But our current test code exits with the following sequence:
> => Kill the FUSE process
> => The kernel starts cleaning up the FUSE mount point
> => df calls statfs(), which requires communication with the FUSE process But
> the FUSE process might still be cleaning up, causing the communication to
> fail
> => df then returns an error or stale information
> => Our detection logic misinterprets this and immediately deletes the
> mounted image
> 
> Since we only need to check the system's mount information, we can just call
> mount and grep "$img" to verify whether the image has been successfully
> unmounted.
> 
> Does it make sense?

It's worth trying. Hanna wrote the existing code that uses df(1), so
maybe she has thoughts on this too.

I looked at waiting for FUSE_PIDS[] or using the QMP monitor to shut
down cleanly. Those approaches have their own issues. Sending a
`block-export-del` QMP command and waiting for it to return, followed by
a `quit` QMP command should work well. But it's more complex than
adjusting the existing loop and still needs a timeout. So I think the
mount(8) approach is worth a shot.

Stefan

Attachment: signature.asc
Description: PGP signature

Reply via email to