Re: [Qemu-devel] Re: [RFC][PATCH] performance improvement for windows guests, running on top of virtio block device

Anthony Liguori Mon, 11 Jan 2010 07:14:04 -0800

On 01/11/2010 08:46 AM, Avi Kivity wrote:

On 01/11/2010 04:37 PM, Anthony Liguori wrote:
That has the downside of bouncing a cache line on unrelated exits.
The read and write sides of the ring are widely separated in physicalmemory specifically to avoid cache line bouncing.
I meant, exits on random vcpus will cause the cacheline containing thenotification disable flag to bounce around. As it is, we read it onthe vcpu that owns the queue and write it on that vcpu or the I/O thread.


Bottom halves are always run from the IO thread.

It probably doesn't matter with qemu as it is now, since it willbounce qemu_mutex, but it will hurt with large guests (especially ifthey have many rings).
IMO we should get things to work well without riding on unrelatedexits, especially as we're trying to reduce those exits.
A block I/O request can potentially be very, very long lived. Byserializing requests like this, there's a high likelihood that it'sgoing to kill performance with anything capable of processingmultiple requests.
Right, that's why I suggested having a queue depth at which disablingnotification kicks in. The patch hardcodes this depth to 1, unpatchedqemu is infinite, a good value is probably spindle count + VAT.


That means we would need a user visible option which is quite unfortunate.

Also, that logic only really makes sense with cache=off. Withcache=writethrough, you can get pathological cases whereas you have anuncached access followed by cached accesses. In fact, with read-ahead,this is probably not an uncommon scenario.

OTOH, if we aggressively poll the ring when we have an opportunityto, there's very little down side to that and it addresses theserialization problem.
But we can't guarantee that we'll get those opportunities, so itdoesn't address the problem in a general way. A guest that doesn'tuse hpet and only has a single virtio-blk device will not have anyreason to exit to qemu.

We can mitigate this with a timer but honestly, we need to do perfmeasurements to see. My feeling is that we will need some moreaggressive form of polling than just waiting for IO completion. I don'tthink queue depth is enough because it assumes that all requests areequal. When dealing with cache=off or even just storage with it's owncache, that's simply not the case.


Regards,

Anthony Liguori

Re: [Qemu-devel] Re: [RFC][PATCH] performance improvement for windows guests, running on top of virtio block device

Reply via email to