iova: Improve restart logic

John Garry Thu, 18 Mar 2021 09:10:11 -0700

Well yeah, in your particular case you're allocating from a heavilyover-contended address space, so much of the time it is genuinely full.Plus you're primarily churning one or two sizes of IOVA, so there's ahigh chance that you will either allocate immediately from the cachednode (after a previous free), or search the whole space and fail. Incase it was missed, searching only some arbitrary subset of the spacebefore giving up is not a good behaviour for an allocator to have ingeneral.
So since the retry means that we search through the complete pfnrange most of the time (due to poor success rate), we should be ableto do a better job at maintaining an accurate max alloc size, bycalculating it from the range search, and not relying on max allocfailed or resetting it frequently. Hopefully that would mean thatwe're smarter about not trying the allocation.
So I tried that out and we seem to be able to scrap back anappreciable amount of performance. Maybe 80% of original, with withanother change, below.
TBH if you really want to make allocation more efficient I think thereare more radical changes that would be worth experimenting with, likeusing some form of augmented rbtree to also encode the amount of freespace under each branch, or representing the free space in its ownparallel tree, or whether some other structure entirely might be abetter bet these days.
And if you just want to make your thing acceptably fast, now I'm goingto say stick a quirk somewhere to force the "forcedac" option on yourplatform ;)


Easier said than done :)

But still, I'd like to just be able to cache all IOVA sizes for my DMAengine, so we should not have to go near the RB tree often.

I have put together a series to allow upper limit of rcache range beincreased per domain. So naturally that gives better performance than weoriginally had.

I don't want to prejudice the solution by saying what I think of it now,so will send it out...

[...]
@@ -219,7 +256,7 @@ static int __alloc_and_insert_iova_range(structiova_domain *iovad,
          if (low_pfn == iovad->start_pfn && retry_pfn < limit_pfn) {
              high_pfn = limit_pfn;
              low_pfn = retry_pfn;
-            curr = &iovad->anchor.node;
+            curr = iova_find_limit(iovad, limit_pfn);
I see that it is now applied. However, alternatively could we just adda zero-length 32b boundary marker node for the 32b pfn restart point?
That would need special cases all over the place to prevent the markergetting merged into reservations or hit by lookups, and at worst breakthe ordering of the tree if a legitimate node straddles the boundary. Idid consider having the insert/delete routines keep track of yet anothercached node for whatever's currently the first thing above the 32-bitboundary, but I was worried that might be a bit too invasive.

Yeah, I did think of that. I don't think that it would have too muchoverhead.

FWIW I'm currently planning to come back to this again when I have a bitmore time, since the optimum thing to do (modulo replacing the entirealgorithm...) is actually to make the second part of the search*upwards* from the cached node to the limit. Furthermore, to revive myarch/arm conversion I think we're realistically going to need acompatibility option for bottom-up allocation to avoid too many nastysurprises, so I'd like to generalise things to tackle both concerns atonce.


Thanks,
John

Re: [PATCH 2/2] iommu/iova: Improve restart logic

Reply via email to