Il 20/09/2013 11:54, Kevin Wolf ha scritto:
> Am 19.09.2013 um 18:48 hat Paolo Bonzini geschrieben:
>> The following sequence happens:
>> - the SeaBIOS virtio-blk driver does not support the WCE feature, which
>> causes QEMU to disable writeback caching
>>
>> - the Linux virtio-blk driver resets the device, finds WCE is available
>> but writeback caching is disabled; tells block layer to not send cache
>> flush commands
>>
>> - the Linux virtio-blk driver sets the DRIVER_OK bit, which causes
>> writeback caching to be re-enabled, but the Linux virtio-blk driver does
>> not know of this side effect and cache flushes remain disabled
>>
>> The bug is at the third step.  If the guest does know about CONFIG_WCE,
>> QEMU should ignore the WCE feature's state.  The guest will control the
>> cache mode solely using configuration space.  This change makes Linux
>> do flushes correctly, but Linux will keep SeaBIOS's writethrough mode.
> 
> This sounds fishy. The solutions happens to make recent Linux kernels do
> the right thing, but wouldn't drivers that don't know CONFIG_WCE still
> fall into the same trap?

No, drivers that don't know CONFIG_WCE will do the following:

1) -drive cache=writethrough case, WCE supported

    When the driver resets the device, QEMU disables the write cache
    (virtio_blk_reset).  Thus VIRTIO_BLK_F_WCE is not advertised.
    The Linux virtio-blk driver tells the block layer to not send
    cache flush commands, which is correct because they are useless.
    VIRTIO_BLK_F_WCE is obviously not negotiated, and
    virtio_blk_set_status confirms the disk in writethrough mode.

2) -drive cache=writeback case, WCE supported

    When the driver resets the device, QEMU disables the write cache
    (virtio_blk_reset).  Thus VIRTIO_BLK_F_WCE is advertised by the
    device and negotiated by the driver.  The Linux virtio-blk driver
    recognizes that VIRTIO_BLK_F_WCE is negotiated and tells the block
    layer to send cache flush commands.  virtio_blk_set_status confirms
    the disk in writeback mode.

3) -drive cache=writethrough case, WCE not supported

    When the driver resets the device, QEMU disables the write cache
    (virtio_blk_reset).  Thus VIRTIO_BLK_F_WCE is not advertised.
    The virtio-blk driver doesn't do anything.
    virtio_blk_set_status confirms the disk in writethrough mode.

4) -drive cache=writeback case, WCE not supported

    When the driver resets the device, QEMU disables the write cache
    (virtio_blk_reset).  Thus VIRTIO_BLK_F_WCE is advertised by the
    device, but not negotiated by the driver.
    The virtio-blk driver doesn't do anything.
    virtio_blk_set_status places the disk in writethrough mode.

> I guess making a host feature flag dynamic was
> a bad idea to start with.

I disagree, it is very useful.  The bug was unfortunate indeed, and
probably happened due to testing the two patches (CONFIG_WCE and
no-WCE-implies-writethrough) independently rather than together.

> Perhaps we should restrict the magic to disabling WCE in case the guest
> doesn't have VIRTIO_BLK_F_WCE, but never allow it to enable WCE even
> though we've already advertised that the host doesn't have WCE.

That's already what happens, because (thanks to the new
"bdrv_set_enable_write_cache(s->bs, s->original_wce);" at reset time)
VIRTIO_BLK_F_WCE is never exposed in writethrough mode.

Paolo

Reply via email to