Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy

Avi Kivity Thu, 24 Feb 2011 00:54:40 -0800

On 02/23/2011 10:18 PM, Anthony Liguori wrote:

Then the management stack has to worry about yet another way ofinteracting via qemu.
{ 'StateItem': { 'key': 'str', 'value': 'str' } }
{ 'StateSection': { 'kind': 'str', 'name': 'str', 'items': ['StateItem' ] } }
{ 'StateInfo': { 'sections': [ 'StateSection' ] } }

{ 'query-state', {}, {}, 'StateInfo' }
A management tool never need to worry about anything other than thiscommand if it so chooses. If we have the pre-machine init mode for0.16, then this can even be used to inspect state without running aguest.

So we have yet another information tree. If we store the cd-rom ejectstate here, then we need to make an association between the device pathof the cd-rom, and the StateItem key.

Far better to store it in the device itself. For example, we could makea layered block format driver that stores the eject state and a "backingfile" containing the actual media. Eject and media change would berecorded in the block format driver's state. You could then hot-unpluga USB cd-writer and hot-plug it back into a different guest,implementing a virtual sneakernet.

The fact that the state is visible in the filesystem is animplementation detail.

A detail that has to be catered for by the management stack - it has toprovide a safe place for it, back it up, etc.

  I'd like to limit it to the monitor.
Doesn't the stateful non-config file becomes a failure point? Ithas to be on shared and redundant storage?
It depends on what your availability model is and how frequentlyyour management tool backs up the config. As of right now, we havea pretty glaring reliability hole here so adding a stateful"non-config" can only improve things.
I think the solutions I pointed out close the hole with the existinginterfaces.
It doesn't work for eject unless you interpose an acknowledged event.Ultimately, this is a simple problem. If you want reliability, weeither need symmetric RPCs so that the device model can call (andwait) to the management layer to acknowledge a change or QEMU can postan event to the management layer, and maintain the state in a reliablefashion.


I don't see why it doesn't work.  Please explain.

You still have the race condition around guest initiated events likeeject. Unless you have an acknowledged event from a management tool(which we can't do in QMP today) whereas you don't complete theguest initiated eject operation until management ack's it, we needto store that state ourself.
I don't see why.
If management crashes, it queries the eject state when it reconnectsto qemu.If qemu crashes, the eject state is lost, but that is fine. MyCD-ROM drive tray pulls itself in when the machine is started.
Pick any of a number of possible events that change the machine'sstate. We can wave our hands at some things saying they don't matterand do one off solutions for others, or we can just have a robust wayof handling this consistently.

Both block live copy and cd-rom eject state can be solved with layeredblock format drivers. I don't think a central place for random datamakes sense. State belongs near the device that maintains it, esp. ifthe device is hot-pluggable, so it's easy to associate the state withthe device.

You're introducing the need for additional code in the managementlayer, the care and feeding for the stateful non-config file.
If a management layer ignores the stateful non-config file, as youlike to call it, it'll get the same semantics it has today. I thinkmanaging a single thing is a whole lot easier than managing an NVRAMfile, a block migration layering file, and all of the future thingswe're going to add once we decide they are important too.

I disagree. Storing NVRAM as a disk image is a simple extension ofexisting management tools. Block live-copy and cd-rom eject state alsomake sense as per-image state if you take hotunplug and hotplug intoaccount.

If qemu crashes, these events are meaningless. If managementcrashes, it has to query qemu for all state that it wants to keeptrack of via events.
Think power failure, not qemu crash. In the event of a powerfailure, any hardware change initiated by the guest ought to beconsistent with when the guest has restarted. If you eject theCDROM tray and then lose power, its still ejected after the powercomes back on.
Not on all machines.
Let's list guest state which is independent of power. That would bewither NVRAM of various types, or physical alterations. CD-ROM ejectis one. Are there others?
Any indirect qemu state. Block migration is an example, but otherexamples would be VNC server information (like current password), WCEsetting (depending on whether we modelled eeprom for the drivers), andpersisted device settings (lots of devices have eeprom these days).


Device settings should be stored with the devices, not with qemu.

Suppose we take the cold-plug on startup via the monitor approach. Sowe start with a bare machine, cold plug stuff into it. Now qemu has toreconcile the stateful non-config file with the hardware. What ifsomething has changed? A device moved into a different slot?

If a network card has eeprom, we can specify it with -devicertl8139,eeprom=id, where id specifies a disk image for the eeprom.

I think my solution (multiplexing block format driver) fits therequirements for live-copy perfectly. In fact it has a name - it's aRAID-1 driver started in degraded mode. It could be useful other usecases.
It feels a bit awkward to me to be honest.


Not to me.

--
error compiling committee.c: too many arguments to function

Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy

Reply via email to