mips sys

Nathan Whitehorn Wed, 20 Jul 2016 08:45:48 -0700


On 07/20/16 04:28, Michal Meloun wrote:

Dne 19.07.2016 v 17:06 Nathan Whitehorn napsal(a):



On 07/19/16 04:13, Michal Meloun wrote:

Dne 19.07.2016 v 2:11 Nathan Whitehorn napsal(a):
Hi Nathan,
I’m afraid that skra is on vacation, for next 2 weeks (at minimum), so
please don’t expect quick response.

Could you please describe what this change is in more detail?

Short description is appended.

It breaks a lot of encapsulations we have worked very hard tomaintain,
moves ARM code into MI parts of the kernel, and the OFW parts violate
IEEE 1275 (the Open Firmware standard). In particular, there is no
guarantee that the interrupts for a newbus (or OF) device areencoded in
a property called "interrupts" (or, indeed, in any property at all) on
that node and there are many, many device trees where that is not the
case (e.g. ones with interrupt maps, as well as Apple hardware). By
putting that knowledge into the OF root bus device, which we havetriedto keep it out of, this enforces a standard that doesn't actuallyexist.

Imho, this patch doesn’t change anything in this area. Only handling of
“interrupts” property is changed, all other cases are unchanged (I
hope).  Also, INTRNG code is currently shared by ARM, ARM64 and MIPS.

But "interrupts" isn't a generic part of OF. This makes it one,incorrectly.

How? Can you be little more exact ?

Because it puts knowledge into ofwbus that expects that children atarbitrary levels of nesting have interrupts defined by an "interrupts"property. You could patch this through on sub-devices, of course, butthat's already done correctly by the existing ofw_bus_map_intr() code ina much more robust way that doesn't involve trying to guess howsub-buses and devices have chosen to allocate resources. Why reinventthe wheel all the way through the bus hierarchy?

I'm hesitant to ask for reversion on something that landed 6 weeks ago
without me noticing, but this needs a lot more architectural workbefore
any parts of the kernel should use it.
-Nathan
I think that it’s too late.  This patch series consist of r301451
(https://reviews.freebsd.org/D6632),
r301453, r301539 and 301543. And new GPIO interrupts are currentlyused
(by in tree drivers or in development trees).
Well, then we need in-place rearchitecture.
The root of problem is that standard way of delivering interrupt
resource to consumer driver doesn’t works in OFW world.

So we have some fact:
- the format of interrupt property is dependent of interrupt
   controller and only interrupt controller can parse it.
- the interrupt property can have more data than just interrupt
   number.
- single interrupt controller must be able to handle multiple
   format of interrupt description.

In pre-patchset era, simplebus enumerates children and attempts to set
memory and interrupts to resource list for them. But the interrupt
controllers are not yet populated so nobody can parse interrupt
property. Moreover, in all cases (parsed or not), we cannot store
complete interrupt description into resource list.
We have done this for many years on PowerPC and sparc64 with delayedconfiguration of interrupts and a look-up table. This handlescomplicated bus configurations better than this code and requires nochanges outside of a few MD files. That is why the (now partiallyduplicated) OFW_BUS_MAP_INTR() function exists. That one also has thebenefit of still working when used in conjunction with, e.g., deviceswith an interrupt-map-mask property.
The patch simply postpones reading of interrupt property to
bus_alloc_resource() (called by consumer driver) time.

Due to this, we can:
- parse  interrupt property. The interrupt driver must exist
   at this time.
This only works with some types of interrupt properties, not all, andbreaks if the interrupt driver hasn't attached yet (which it can't beguaranteed to -- some PPC systems have interrupt drivers that live onthe PCI bus, for example).
How you can allocate (and reserve it in rman) interrupt if is notmapped (so you have not real IRQ number for it). Just for notice -multiple virtual IRQs can be mapped into single real IRQ.

The core idea is to think of the full interrupt specifier -- theinterrupt parent and the full byte string in the device tree -- as theIRQ rather than the interrupt pin on some chip (which is usually, butnot always, the first word in that byte string). The "virtual" IRQnumber is just a compression of that longer piece of data, which usuallycan't fit in an rman resource.

There is no need to actually activate those interrupts before interruptsare enabled, so you can just cache them in a table until the end ofdevice probing, which lets you break circular dependency loops betweenbus and interrupt topology.

So long as you keep track of your mapping and the same (parent,interrupt specifier) parent always gives the same virtual IRQ, there isno way in this system to map multiple active IRQs onto a singleinterrupt pin on the PIC unless your device tree is broken and specifiestwo devices with incompatible modes (active high and edge downgoing orsomething) on the same pin. In this case, nothing you can do will saveyou -- unless your PIC supports interrupts for different kinds ofevents, in which case this system will work perfectly by treating themas different interrupts to the kernel for which the fact they are on thesame pin is immaterial.

I should note that ARM and MIPS have an almost complete implementationof this already: maybe some more intr_machdep.c logic is needed for somecases, but all the rest of the plumbing is there.

- bus_alloc_resource() returns resource, so we can attach parsed
   interrupt data to it. By this, the resource itself can be used
   for delivering configuration data to subsequent call to
   bus_setup_intr() (or to all related  bus_<foo>() calls).
The patched code still accepts delivering of interrupts in resourcelist.
Michal
Given that other code depends on this, fixing it will likely requiresome complex work. I wish I had known about it when it went in.
There are three main problems:
1. It doesn't work for interrupts defined by other mechanisms (e.g.interrupt-map properties)
I aggree, but missing ' interrupt-map' functioanlity is not caused bythis patch.


It is in that the standard system already implements it completely.

2. It partially duplicates the functionality of OFW_BUS_MAP_INTR(),but is both problematically more general and less flexible (it hasrequirements on timing of PIC attachment vs. driver resource allocation)
OFW_BUS_MAP_INTR() can parse only OFW based data and expect thatparsed data are magicaly stored within the call.The new method, bus_map_intr(), can parse data from multiple sources(OFW, UEFI / ACPI, synthetic[gpio device + pin number]). It alsoreturns parsed data back to caller.

That is not true. It works as long as you can specify the interruptstate as a 32-bit key of some kind for the PIC and a string of arbitrarydata, which works with all of those. You could even make the interruptdata be a pointer to exactly the structs you have chosen to define here.

And no, it  doesn't  add any additional timing requirements .

As far as I can tell, it requires the interrupt controller to beattached before you can allocate interrupts. Is that not true?

3. It is not fully transparent to end code. Since it happens atbus_alloc_resource() time, it is complicated to get the appropriatevalues for IRQs constructed by composite techniques (interrupt-mapvs. interrupts vs. hand allocation vs. PCI routing, for example).
I don't see any limitation - can you be more exact? Why is nottransparent? Why is more complicated ?

Suppose that a PCI device adds more IRQs to its resource list ormodifies the ordering. How is whatever bus layer supposed to dosomething sensible at allocation time? It requires that RID numbers meansomething to the parent bus after assignment, which is not guaranteed byanything and is, in more than handful of cases I think of, not true inpractice.

It is much easier to do this correctly at bus attach time when theresource lists are made (how PPC does it).
I don't agree. I don't agree. Making this at bus attach time leadsinto complicated 'virtual' IRQ infrastructure, with many unresolvedcorner cases.

Which unresolved corner cases? This has been working correctly on anumber of platforms in both FreeBSD and Linux for many years.

(1) is easy to fix without API changes, but (2) and (3) arefundamental architectural problems that will bite us immediately downthe road and cause a permanent schism between OF support on differentplatforms.
Let me describe how this is handled on PowerPC (Linux on PPC solvesthe problem the same way). When constructing a resource list, busdrivers that construct them from OF properties callofw_bus_map_intr() with the interrupt parent phandle and the array ofcells corresponding to the interrupt. This thunks immediately tonexus, which connects to code in intr_machdep.c. Code there assigns aunique made-up virtual IRQ and returns it, caching the interruptparent ID and opaque interrupt data (if the same string of datareappears later, you get back the same virtual IRQ of course).
When PIC drivers attach and register themselves with the interrupthandling layer, all the interrupts for that PIC are passed to italong with the virtual IRQ. The PIC driver is supposed to know whatits interrupt data mean, which can be safely guaranteed, and itpresents the assigned virtual IRQ number to the kernel whendispatching interrupts. (IRQs configured after PIC attachment arepassed through immediately).
This accomplishes the following things:
1. Parsing interrupt data is moved to the PIC driver, which is theonly place it can be done safely.
I don't see anything different comparing with INTRNG.

What I am advocating *is* INTRNG, at least as originally conceived andimplemented.

2. There is no ordering requirement on PIC attachment vs. theattachment of anything else.
I think thats is not a true - PIC must exist beforebus_alloc_resource() / bus_setup_intr() is called.

It does not with the IRQ mapping infrastructure. Interrupts are set upat PIC attachment, whenever that occurs.

3. Changes are extremely minimal relative to the "standard" interruptflow: you only have to patch code that is already directly dealingwith OF interrupts.
I don't see anything different comparing with INTRNG.

Again, this was the original INTRNG architecture and is alreadyimplemented. As such, there are *no* changes required on ARM to get it.bus_map_intr() adds a bunch of new code, in parallel with the old codethat also solves the problem, to no purpose.

4. It happens at bus enumeration time, when results can be guaranteedself-consistent.
Where do you see any potential source of inconsistency in INTRNG?

See the example above about modified interrupt lists. There is also noobvious way for a child device to construct an interrupt not assigned toit by the parent device from device tree properties without knowing insome detail what kind of interrupt needs to be built.

5. It combines naturally with ofw_bus_lookup_imap() and friends inthe interrupt-map case (e.g. for PCI).
Again, I don't see anything different. Proper parsing of interruptproperty is not a problem of INTRNG (but must be fixed, of course).

But it is *already* fixed by the standard code that already exists. Youare introducing a less-functional parallel code path here.

I'm not sure what the right path forward is, but this code needs tobe fixed. The PowerPC code is fully MI, and was the template for theoriginal INTRNG, so it shouldn't be too bad to replace.
-Nathan
So, new INTRNG:
- Introduces new more general bus method that can parse interruptconfiguration
 data from any source. Is this step backward?

Yes, since it is more general in some sense, while simultaneouslyhandling fewer cases than code that already exists and is implemented.

- Old INTRNG and PPC code stores unparsed and/or parsed interrupt data in
INTRNG and each consumer must query for them. This data sharing alsocausessignificant locking issues. New INTRNG stores interruptconfiguration data into
  given resource, so each relevant bus method can access it immediately.
  Is this step backward?


Which locking issues? And yes, it is.

- New INTRNG is not OFW centric, it can works with virtually unlimitednumber
   of configuration data sources.  Is this step backward?

Also yes, because it makes the interrupt handles less opaque, whichmakes the infrastructure less flexible.

- New INTRNG correctly uses standard system infrastructure. Real IRQnumber
   is reserved in rman within bus_alloc_resource() call, interrupt HW is
configured (only!) within bus_setup_intr() call. Is this stepbackward?

The "real" IRQ number is not well defined always, so requiring that is astep backwards, yes.

- New INTRNG completely eliminates huge and not always working virtual
  IRQ concept.

When does it "not always work"? It seems to, in fact, always work onmultiple platforms and have for a long time in the face of all kinds oftotally bizarre topologies and system architectures.

Don’t take me bad, I’m open to any change. But no, at this time, I’mnot ready to completely revert someone else's work – although I am aco-author.

I would urge, in the strongest possible terms, that this be backed outfrom stable/11 at least. We can add the new API back for 11.1 if we wantit, but we totally lose the ability to change it later in the stable/11cycle if it stays in now.

-Nathan


Michal


_______________________________________________
svn-src-head@freebsd.org mailing list
https://lists.freebsd.org/mailman/listinfo/svn-src-head
To unsubscribe, send any mail to "svn-src-head-unsubscr...@freebsd.org"

Re: svn commit: r301453 - in head/sys: arm/arm arm64/arm64 dev/fdt dev/gpio dev/iicbus dev/ofw dev/pci dev/vnic kern mips/mips sys

Reply via email to