On Thu, Dec 16, 2021 at 11:34:25AM -0500, Lance Richardson wrote:
> On Thu, Dec 16, 2021 at 11:20 AM Bruce Richardson
> <bruce.richard...@intel.com> wrote:
> >
> > On Thu, Dec 16, 2021 at 11:04:54AM -0500, Lance Richardson wrote:
> > > Hi Bruce,
> > >
> > > I've been looking into using the IOAT PMD, initially with dma_autotest
> > > and the dpdk-dma example application. These seem to work fine on
> > > SKX with the current main branch, but when I try the same procedure
> > > on ICX (binding all 8 devices to vfio-pci in both cases), I get the 
> > > following
> > > output for each device when probed. Is something different needed when
> > > using IOAT on ICX vs. SKX?
> > >
> > > Thanks,
> > >      Lance
> > >
> > > EAL: Probe PCI driver: dmadev_ioat (8086:b00) device: 0000:80:01.0 
> > > (socket 2)
> > > IOAT: ioat_dmadev_probe(): Init 0000:80:01.0 on NUMA node 2
> > > IOAT: ioat_dmadev_create(): ioat_dmadev_create: Channel count == 255
> > >
> > > IOAT: ioat_dmadev_create(): ioat_dmadev_create: Channel appears locked
> > >
> > > IOAT: ioat_dmadev_create(): ioat_dmadev_create: cannot reset device.
> > > CHANCMD=0xff, CHANSTS=0xffffffffffffffff, CHANERR=0xffffffff
> > >
> > > EAL: Releasing PCI mapped resource for 0000:80:01.0
> > > EAL: Calling pci_unmap_resource for 0000:80:01.0 at 0x4102430000
> > > EAL: Requested device 0000:80:01.0 cannot be used
> >
> > That is strange, the same PMD should work ok on both platforms. This is all
> > on latest branch, right? Let me attempt to reproduce and get back to you.
> 
> Hi Bruce,
> 
> That's correct, I'm using the current tip of the main branch, which
> seems to be identical to 21.11.0.
> >
> > /Bruce
> >
> > PS: Is this a 4-socket system you are running on, since I see "socket 2"
> > being described as the socket number for device 80:01.0?
> >
> It is a two-socket system with sub-NUMA enabled, so it appears as four
> NUMA nodes. I'm only binding the devices on the second socket.
> 

Ok, [not that that should affect anything to do with ioat, AFAIK]

Tried quickly reproducing the issue on some of our systems and failed to do
so. Does this error appear consistently, especially after a reboot?

Thanks,
/Bruce

Reply via email to