Am 17.05.2019 um 10:42 hat Klaus Birkelund Jensen geschrieben: > Hi, > > This series of patches contains a number of refactorings to the emulated > nvme device, adds additional features, such as support for metadata and > scatter gather lists, and bumps the supported NVMe version to 1.3. > Lastly, it contains a new 'ocssd' device. > > The motivation for the first seven patches is to set everything up for > the final patch that adds a new 'ocssd' device and associated block > driver that implements the OpenChannel 2.0 specification[1]. Many of us > in the OpenChannel comunity have used a qemu fork[2] for emulation of > OpenChannel devices. The fork is itself based on Keith's qemu-nvme > tree[3] and we recently merged mainline qemu into it, but the result is > still a "hybrid" nvme device that supports both conventional nvme and > the OCSSD 2.0 spec through a 'dialect' mechanism. Merging instead of > rebasing also created a pretty messy commit history and my efforts to > try and rebase our work onto mainline was getting hairy to say the > least. And I was never really happy with the dialect approach anyway. > > I have instead prepared this series of fresh patches that incrementally > adds additional features to the nvme device to bring it into shape for > finally introducing a new (and separate) 'ocssd' device that emulates an > OpenChannel 2.0 device by reusing core functionality from the nvme > device. Providing a separate ocssd device ensures that no ocssd specific > stuff creeps into the nvme device. > > The ocssd device is backed by a new 'ocssd' block driver that holds > internal meta data and keeps state permanent across power cycles. In the > future I think we could use the same approach for the nvme device to > keep internal metadata such as utilization and deallocated blocks.
A backend driver that is specific for a guest device model (i.e. the device model requires this driver, and the backend is useless without the device) sounds like a very questionable design. Metadata like OcssdFormatHeader that is considered part of the image data, which means that the _actual_ image content without metadata isn't directly accessible any more feels like a bad idea, too. Simple things like what a resize operation means (change only the actual disk size as usual, or is the new size disk + metadata?) become confusing. Attaching an image to a different device becomes impossible. The block format driver doesn't seem to actually add much functionality to a specially crafted raw image: It provides a convenient way to create such special images and it dumps some values in 'qemu-img info', but the actual interpretation of the data is left to the device model. Looking at the options it does provide, my impression is that these should really be qdev properties, and the place to store them persistently is something like the libvirt XML. The device doesn't change any of the values, so there is nothing that QEMU actually needs to store. What you invented is a one-off way to pass a config file to a device, but only for one specific device type. I think this needs to use a much more standard approach to be mergable. Markus (CCed) as the maintainer for the configuration mechanisms may have an opinion on this, too. > For now, the nvme device does not support the Deallocated and > Unwritten Logical Block Error (DULBE) feature or the Data Set > Management command as this would require such support. Doesn't bdrv_co_block_status() provide all the information you need for that? Kevin