Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy

Anthony Liguori Wed, 23 Feb 2011 08:05:30 -0800

On 02/23/2011 09:31 AM, Avi Kivity wrote:

On 02/23/2011 04:35 PM, Anthony Liguori wrote:
On 02/23/2011 07:01 AM, Avi Kivity wrote:
On 02/23/2011 01:14 AM, Anthony Liguori wrote:
-drive already ties into the qemuopts infrastructure and we havereadconfig and writeconfig. I don't think we're missing any majorpieces to do this in a more proper fashion.
The problem with qemu config files is that it splits theauthoritative source of where images are stored into two. Is it inthe management tool's database or is it in qemu's config file?
I like to use the phrase "stateful config file". To me, it's just adatabase for QEMU to persist data about the VM. It's the only wayfor QEMU to make certain transactions atomic in the face of QEMUcrashing.
The user visible config file is a totally different concept. Amanagement tool launches QEMU and tells it where to keep it's statedatabase. The management application may prepopulate the statedatabase or it may just use an empty file.
In that case the word 'config' is misleading. To me, it implies thatthe user configures something, and qemu reads it, not something mostlyinternal to qemu.


Understood.

Qemu does keep state. Currently only images, but in theory also theon-board NVRAM.

Yeah, this is a good example of an area where a "stateful config file"would be useful. I like the idea of storing this sort of thing in atext file with a config structure because a user certainly wants to beable to specify the boot order. Being able to tweak this kind of stuffadds a lot of interesting capabilities.

QEMU uses the state database to store information that is createddynamically. For instance, devices added through device_add. Adevice added via -device wouldn't necessary get added to the statedatabase.
Practically speaking, it let's you invoke QEMU with a fixed commandline, while still using the monitor to make changes that wouldotherwise require the command line being updated.
Then the invoker quickly loses track of what the actual state is. Itcan't just remember which commands it issued (presumably in responseto the user updating user visible state). It has to parse thestateful config file qemu outputs.

Well specifically, it has to ask QEMU and QEMU can tell it the currentstate via a nice structured data format over QMP. It's a hell of a loteasier than the management tool trying to do this outside of QEMU.

  But at which points should it parse it?

I was thinking that we should post events whenever we change thestateful config. That would let the management tool have a mechanismfor determining when settings have been changed. Of course, if themanagement tool crashes, it should re-read at startup.

I don't think it's reasonable to have three different ways to interactwith qemu, all needed: the command line, reading and writing thestateful config file, and the monitor. I'd rather push for startingqemu with a blank guest and assembling (cold-plugging) all thehardware via the monitor before starting the guest.

Yes. I view the command line as optional. To me, this is the idealinteraction:


1) start qemu with an empty stateful config file

2) issue monitor commands to create all devices and backends

3) the stateful config file totally captures the state of all of theissued QMP commands. The management tool can relaunch the guest just bypassing the stateful config file to QEMU.

4) when the management tool needs to "extract" a config file, it canread the stateful config (through the monitor) and generate it's own config.

5) the management tool should treat the stateful config file as more orless opaque. It shouldn't be visible to end user.

In the non-managed case, users should interact directly with the configfile.

For the problem at hand, one solution is to make qemu stop after thecopy, and then management can issue an additional command torearrange the disk and resume the guest. A drawback here is that ifmanagement dies, the guest is stopped until it restarts. We alsomake management latency guest visible, even if it doesn't die at aninconvenient place.
An alternative approach is to have the copy be performed by a newlayered block format driver:
- create a new image, type = live-copy, containing three pieces ofinformation
   - source image
   - destination image
   - copy state (initially nothing is copied)
- tell qemu switch to the new image
- qemu starts copying, updates copy state as needed
- copy finishes, event is emitted; reads and writes still serviced
- management receives event, switches qemu to destination image
- management removes live-copy image
If management dies while this is happening, it can simply query thestate of the copy. Similarly, if qemu dies, the copy state ispersistent (could be 0/1 or real range of blocks).
This is a more elegant solution to the problem than the commitproblem but it's also a one-off. I think we have a generic problemhere and we ought to try to solve it generically (within reason).
Can you give more examples?
I think I demonstrated that hot-plug can be solved via the existinginterfaces.

Sure. CMOS settings right now are not persisted across reboot. Guestinitiated activities like IDE or PCI eject are tricky to persistcorrectly within a management tool.

We could add events for all of this things but it's all racy sinceevents are posted. If we have a stateful config file, we can make allof these things non-racy and post an event that the config has changed.If there's a crash, the management tool can read the config on startupto catch up on missed events.

I think the nature of a posted event management interface is such thatwe need a stateful config that persists across QEMU invocations.


Regards,

Anthony Liguori

Re: [Qemu-devel] Re: [patch 2/3] Add support for live block copy

Reply via email to