On-Demand range technology [1/5] - Executive Summary

Andrew MacLeod Wed, 22 May 2019 18:29:07 -0700

Now that stage 1 has reopened, I’d like to reopen a discussion about thetechnology and experiences we have from the Ranger project I brought uplast year. https://gcc.gnu.org/ml/gcc/2018-05/msg00288.html . (Theoriginal wiki pages are now out of date, and I will work on updatingthem soon.)

The Ranger is designed to evaluate ranges on-demand rather than througha top-down approach. This means you can ask for a range from anywhere,and it walks back thru the IL satisfying any preconditions and doing therequired calculations. It utilizes a cache to avoid re-doing work. Ifranges are processed in a forward dominator order, it’s not muchdifferent than what we do today. Due to its nature, the order youprocess things in has minimal impact on the overall time… You can do itin reverse dominator order and get similar times.

It requires no outside preconditions (such as dominators) to work, andhas a very simple API… Simply query the range of an ssa_name at anypoint in the IL and all the details are taken care of.

We have spent much of the past 6 months refining the prototype (branch“ssa-range”) and adjusting it to share as much code with VRP aspossible. They are currently using a common code base for extractingranges from statements, as well as simplifying statements.

The Ranger deals with just ranges. The other aspects of VRP areintended to be follow on work that integrates tightly with it, but arealso independent and would be available for other passes to use. Theseinclude:

- Equivalency tracking
- Relational processing
- Bitmask tracking

We have implemented a VRP pass that duplicates the functionality of EVRP(other than the bits mentioned above), as well as converted a few otherpasses to use the interface.. I do not anticipate those missing bitshaving a significant impact on the results.

The prototype branch it quite stable and can successfully build and testan entire Fedora distribution (9174 packages). There is an issue withswitches I will discuss later whereby the constant range of a switchedge is not readily available and is exponentially expensive tocalculate. We have a design to address that problem, and in the commoncase we are about 20% faster than EVRP is.

When utilized in passes which only require ranges for a small number ofssa-names we see significant improvements. The sprintf warning pass forinstance allows us to remove the calculations of dominators and theresulting forced walk order. We see a 95% speedup (yes, 1/20th of theoverall time!). This is primarily due to no additional overhead andonly calculating the few things that are actually needed. The wallocaand wrestrict passes are a similar model, but as they have not beenconverted to use EVRP ranges yet, we don’t see similar speedups there.

That is the executive summary. I will go into more details of eachmajor thing mentioned in follow on notes so that comments anddiscussions can focus on one thing at a time.

We think this approach is very solid and has many significant benefitsto GCC. We’d like to address any concerns you may have, and work towardsfinding a way to integrate this model with the code base during thisstage 1.


Comments and feedback always welcome!
Thanks
Andrew

On-Demand range technology [1/5] - Executive Summary

Reply via email to