On Thu, May 14, 2015 at 02:46:44PM +0100, Andrew Jones wrote: > On Thu, May 14, 2015 at 01:05:09PM +0200, Christoffer Dall wrote: > > On Wed, May 13, 2015 at 01:31:52PM +0200, Andrew Jones wrote: > > > Provide a method to change normal, cacheable memory to non-cacheable. > > > KVM will make use of this to keep emulated device memory regions > > > coherent with the guest. > > > > > > Signed-off-by: Andrew Jones <drjo...@redhat.com> > > > > Reviewed-by: Christoffer Dall <christoffer.d...@linaro.org> > > > > But you obviously need Russell and Will/Catalin to ack/merge this. > > I guess this patch is going to go away in the next round. You've > pointed out that I screwed stuff up royally with my over eagerness > to reuse code. I need to reimplement change_memory_common, but a > version that takes an mm, which is more or less what I did in the > last version of this series, back when I was pinning pages.
I kept wondering what this patch and the next one are doing with set_memory_nc(). Basically you were trying to set the cache attributes for the kernel linear mapping or kmap address (in the 32-bit arm case, which is removed anyway on kunmap). What you need is changing the attributes of the user mapping as accessed by Qemu but I don't think simply re-implementing change_memory_common() would work, you probably need to pin the pages in memory as well. Otherwise, the kernel may remove such pages and, when bringing them back, would set the default cacheability attributes. Another way would be to split the vma containing the non-cacheable memory so that you get a single vma with the vm_page_prot as Non-cacheable. Yet another approach could be for KVM to mmap the necessary memory for Qemu via a file_operations.mmap call (but that's only for ranges outside the guest "RAM"). I didn't have time to follow these threads in details, but just to recap my understanding, we have two main use-cases: 1. Qemu handling guest I/O to device (e.g. PCIe BARs) 2. Qemu emulating device DMA For (1), I guess Qemu uses an anonymous mmap() and then tells KVM about this memory slot. The memory attributes in this case could be Device because that's how the guest would normally map it. The file_operations.mmap trick would work in this case but this means expanding the KVM ABI beyond just an ioctl(). For (2), since Qemu is writing to the guest "RAM" (e.g. video framebuffer allocated by the guest), I still think the simplest is to tell the guest (via DT) that such device is cache coherent rather than trying to remap the Qemu mapping as non-cacheable. -- Catalin