Re: [Qemu-devel] Re: [RFC][PATCH] performance improvement for windows guests, running on top of virtio block device

Anthony Liguori Thu, 25 Feb 2010 11:55:27 -0800

On 02/25/2010 11:33 AM, Avi Kivity wrote:

On 02/25/2010 07:15 PM, Anthony Liguori wrote:
I agree. Further, once we fine-grain device threading, the iothreadessentially disappears and is replaced by device-specific threads.There's no "idle" anymore.
That's a nice idea, but how is io dispatch handled? Is everythingsynchronous or do we continue to program asynchronously?
Simple stuff can be kept asynchronous, complex stuff (like qcow2)ought to be made synchronous (it uses threads anyway, so we don't loseanything). Stuff like vnc can go either way.

We've discussed this before and I still contend that threads do not makeqcow2 any simpler.

It's very difficult to mix concepts.
We're complicated enough to have conflicting requirements and a largecode base with its own inertia, so no choice really.
I personally don't anticipate per-device threading but ratheranticipate re-entrant device models. I would expect all I/O to bedispatched within the I/O thread and the VCPU threads to be able toexecute device models simultaneously with the I/O thread.
That means long-running operations on the iothread can lock out othercompletions.
Candidates for own threads are:
- live migration
- block format drivers (except linux-aio, perhaps have a thread forthe aio completion handler)
- vnc
- sdl
- sound?
- hotplug, esp. memory
Each such thread could run the same loop as the iothread. Anypollable fd or timer would be associated with a thread, so thingscontinue as normal more or less. Unassociated objects continue withthe main iothread.

Is the point latency or increasing available CPU resources? If thedevice models are re-entrant, that reduces a ton of the demand on theqemu_mutex which means that IO thread can run uncontended. While wehave evidence that the VCPU threads and IO threads are competing witheach other today, I don't think we have any evidence to suggest that theIO thread is self-starving itself with long running events.

With the device model, I'd like to see us move toward a very welldefined API for each device to use. Part of the reason for this is tolimit the scope of the devices in such a way that we can enforce this atcompile time. Then we can introduce locking within devices with somelevel of guarantee that we've covered the API devices are actuallyconsuming.

For host services though, it's much more difficult to isolate them likethis. I'm not necessarily claiming that this will never be the rightthing to do, but I don't think we really have the evidence today tosuggest that we should focus on this in the short term.


Regards,

Anthony Liguori

Re: [Qemu-devel] Re: [RFC][PATCH] performance improvement for windows guests, running on top of virtio block device

Reply via email to