On Tue, 4 Jul 2023 19:20:00 +0530
Ani Sinha <anisi...@redhat.com> wrote:

> > On 04-Jul-2023, at 6:18 PM, Igor Mammedov <imamm...@redhat.com> wrote:
> > 
> > On Tue, 4 Jul 2023 21:02:09 +0900
> > Akihiko Odaki <akihiko.od...@daynix.com> wrote:
> >   
> >> On 2023/07/04 20:59, Ani Sinha wrote:  
> >>> 
> >>>   
> >>>> On 04-Jul-2023, at 5:24 PM, Akihiko Odaki <akihiko.od...@daynix.com> 
> >>>> wrote:
> >>>> 
> >>>> On 2023/07/04 20:25, Ani Sinha wrote:    
> >>>>> PCI Express ports only have one slot, so PCI Express devices can only be
> >>>>> plugged into slot 0 on a PCIE port. Add a warning to let users know 
> >>>>> when the
> >>>>> invalid configuration is used. We may enforce this more strongly later 
> >>>>> on once
> >>>>> we get more clarity on whether we are introducing a bad regression for 
> >>>>> users
> >>>>> currenly using the wrong configuration.
> >>>>> The change has been tested to not break or alter behaviors of ARI 
> >>>>> capable
> >>>>> devices by instantiating seven vfs on an emulated igb device (the 
> >>>>> maximum
> >>>>> number of vfs the linux igb driver supports). The vfs instantiated 
> >>>>> correctly
> >>>>> and are seen to have non-zero device/slot numbers in the conventional 
> >>>>> PCI BDF
> >>>>> representation.
> >>>>> CC: jus...@redhat.com
> >>>>> CC: imamm...@redhat.com
> >>>>> CC: m...@redhat.com
> >>>>> CC: akihiko.od...@daynix.com
> >>>>> Resolves: https://bugzilla.redhat.com/show_bug.cgi?id=2128929
> >>>>> Signed-off-by: Ani Sinha <anisi...@redhat.com>
> >>>>> Reviewed-by: Julia Suvorova <jus...@redhat.com>
> >>>>> ---
> >>>>>  hw/pci/pci.c | 15 +++++++++++++++
> >>>>>  1 file changed, 15 insertions(+)
> >>>>> diff --git a/hw/pci/pci.c b/hw/pci/pci.c
> >>>>> index e2eb4c3b4a..47517ba3db 100644
> >>>>> --- a/hw/pci/pci.c
> >>>>> +++ b/hw/pci/pci.c
> >>>>> @@ -65,6 +65,7 @@ bool pci_available = true;
> >>>>>  static char *pcibus_get_dev_path(DeviceState *dev);
> >>>>>  static char *pcibus_get_fw_dev_path(DeviceState *dev);
> >>>>>  static void pcibus_reset(BusState *qbus);
> >>>>> +static bool pcie_has_upstream_port(PCIDevice *dev);
> >>>>>    static Property pci_props[] = {
> >>>>>      DEFINE_PROP_PCI_DEVFN("addr", PCIDevice, devfn, -1),
> >>>>> @@ -2121,6 +2122,20 @@ static void pci_qdev_realize(DeviceState *qdev, 
> >>>>> Error **errp)
> >>>>>          }
> >>>>>      }
> >>>>>  +    /*
> >>>>> +     * With SRIOV and ARI, vfs can have non-zero slot in the 
> >>>>> conventional
> >>>>> +     * PCI interpretation as all five bits reserved for slot addresses 
> >>>>> are
> >>>>> +     * also used for function bits for the various vfs. Ignore that 
> >>>>> case.    
> >>>> 
> >>>> You don't have to mention SR/IOV; it affects all ARI-capable devices. A 
> >>>> PF can also have non-zero slot number in the conventional interpretation 
> >>>> so you shouldn't call it vf either.    
> >>> 
> >>> Can you please help write a comment that explains this properly for all 
> >>> cases - ARI/non-ARI, PFs and VFs? Once everyone agrees that its clear and 
> >>> correct, I will re-spin.    
> >> 
> >> Simply, you can say:
> >> With ARI, the slot number field in the conventional PCI interpretation 
> >> can have a non-zero value as the field bits are reused to extend the 
> >> function number bits. Ignore that case.  
> > 
> > mentioning 'conventional PCI interpretation' in comment and then immediately
> > checking 'pci_is_express(pci_dev)' is confusing. Since comment belongs
> > only to PCIE branch it would be better to talk in only about PCIe stuff
> > and referring to relevant portions of spec.  
> 
> Ok so how about this?
> 
>    * With ARI, devices can have non-zero slot in the traditional BDF          
>                                                                         
>      * representation as all five bits reserved for slot addresses are        
>                                                                           
>      * also used for function bits. Ignore that case.  

you still refer to traditional (which I misread as 'conventional'),
steal the linux comment and argument it with ARI if necessary,
something like this (probably needs some more massaging):


         /*                                                                     
  
         * A PCIe Downstream Port normally leads to a Link with only Device     
 
         * 0 on it (PCIe spec r3.1, sec 7.3.1). 
          However PCI_SLOT() is broken if ARI is enabled, hence work around it
          by skipping check if the later cap is present.                        
          
         */
                     
> 
> 
> > (for example see how it's done in kernel code: only_one_child(...)
> > 
> > PS:
> > kernel can be forced  to scan for !0 device numbers, but that's rather
> > a hack, so we shouldn't really care about that.
> >   
> >>   
> >>>   
> >>>>   
> >>>>> +     */
> >>>>> +    if (pci_is_express(pci_dev) &&
> >>>>> +        !pcie_find_capability(pci_dev, PCI_EXT_CAP_ID_ARI) &&
> >>>>> +        pcie_has_upstream_port(pci_dev) &&
> >>>>> +        PCI_SLOT(pci_dev->devfn)) {
> >>>>> +        warn_report("PCI: slot %d is not valid for %s,"
> >>>>> +                    " parent device only allows plugging into slot 0.",
> >>>>> +                    PCI_SLOT(pci_dev->devfn), pci_dev->name);
> >>>>> +    }
> >>>>> +
> >>>>>      if (pci_dev->failover_pair_id) {
> >>>>>          if (!pci_bus_is_express(pci_get_bus(pci_dev))) {
> >>>>>              error_setg(errp, "failover primary device must be on "    
> 


Reply via email to