Hi Alex Williamson, I am able (with some hacks :)) to directly assign the e1000 PCI device to KVM guest using VFIO on Freescale device. One of the problem I am facing is about the DMA mapping in IOMMU for PCI device BARs. On Freescale devices, the mmaped PCI device BARs are not required to be mapped in IOMMU. Typically the flow of in/out transaction (from CPU) is:
Incoming flow: -----| |----------| |---------------| |-------------| CORE |<----<------<-----<--| IOMMU |<---<---<| PCI-Controller|<------<-----<----<| PCI device | -----| |----------| |---------------| |-------------| Outgoing Flow: IOMMU is bypassed for out transactions -----| |----------| |---------------| |-------------| CORE |>---->------>----| | IOMMU | ->-->| PCI-Controller|>------>----->---->| PCI device | -----| | |----------| ^ |---------------| |-------------| | | |------------------| Also because of some hardware limitations on our IOMMU it is difficult to map these BAR regions with RAM (DDR) regions. So on Freescale device we want the VFIO_IOMMU_MAP_DMA ioctl to be called for RAM regions (DDR) only and _not_ for these mmaped ram regions of PCI device bars. I can understand that we need to add these mmaped PCI bars as RAM type in qemu memory_region_*(). So for that I tried to skip these regions in VFIO memory_listeners. Below changes which works for me. I am not sure whether this is correct approach, please suggest. ------------- diff --git a/hw/vfio_pci.c b/hw/vfio_pci.c index c51ae67..63728d8 100644 --- a/hw/vfio_pci.c +++ b/hw/vfio_pci.c @@ -1115,9 +1115,35 @@ static int vfio_dma_map(VFIOContainer *container, hwaddr iova, return -errno; } -static bool vfio_listener_skipped_section(MemoryRegionSection *section) +static int memory_region_is_mmap_bars(VFIOContainer *container, + MemoryRegionSection *section) { - return !memory_region_is_ram(section->mr); + VFIOGroup *group; + VFIODevice *vdev; + int i; + + QLIST_FOREACH(group, &container->group_list, next) { + QLIST_FOREACH(vdev, &group->device_list, next) { + if (vdev->msix->mmap_mem.ram_addr == section->mr->ram_addr) + return 1; + for (i = 0; i < PCI_ROM_SLOT; i++) { + VFIOBAR *bar = &vdev->bars[i]; + if (bar->mmap_mem.ram_addr == section->mr->ram_addr) + return 1; + } + } + } + + return 0; +} + +static bool vfio_listener_skipped_section(VFIOContainer *container, + MemoryRegionSection *section) +{ + if (!memory_region_is_ram(section->mr)) + return 1; + + return memory_region_is_mmap_bars(container, section); } static void vfio_listener_region_add(MemoryListener *listener, @@ -1129,7 +1155,7 @@ static void vfio_listener_region_add(MemoryListener *listener, void *vaddr; int ret; - if (vfio_listener_skipped_section(section)) { + if (vfio_listener_skipped_section(container, section)) { DPRINTF("vfio: SKIPPING region_add %"HWADDR_PRIx" - %"PRIx64"\n", section->offset_within_address_space, section->offset_within_address_space + section->size - 1); @@ -1173,7 +1199,7 @@ static void vfio_listener_region_del(MemoryListener *listener, hwaddr iova, end; int ret; - if (vfio_listener_skipped_section(section)) { + if (vfio_listener_skipped_section(container, section)) { DPRINTF("vfio: SKIPPING region_del %"HWADDR_PRIx" - %"PRIx64"\n", section->offset_within_address_space, section->offset_within_address_space + section->size - 1); ----------------- Thanks -Bharat