On-Demand range technology [6/5] - Integration

Andrew MacLeod Wed, 05 Jun 2019 13:56:22 -0700

After the various discussions, I've evaluated how I think everything canfit together, so this is my proposal for integration with trunk.

The complete Ranger prototype consists of 5 major components, one ofwhich is missing/un-implemented as yet :-)

1 - irange - This is the non-symbolic range implementation we'vebeen using which represents ranges as groups of ordered sub-ranges.2 - range-ops - This is the component which extracts ranges fromstatements, and so performs the functionality of extract_range_from_*, except it operates on the irange API and also allows for solving ofoperands other than just the LHS of the expression.3 - GORI-computes - This is the component which utilizes range-ops tocompute a range on an outgoing edge for any ssa-name in the definitionchain of the branch

      a_3 = b_6 * 2
      d_8 = a_3 - 20
     if (d_8 < 30)
   the GORI-compute component can generate ranges for d_8, a_3 and b_6.

4 - GORI-Cache and the Ranger. Working together, this provides theon-demand range functionality to resolve ranges5 - relational/equivalency tracker - This is the sketched out butunimplemented bit which tracks the symbolic relationships, and removethe need for ranges to support symbolics. ( <,<=, >, >=, ==, != and none).

The consensus appears to be that range-ops and gori-computes are goodcandidates to replace aspects of vr-values and assert generation.

A)

Until I get to (5) (relational tracker), using (1) (irange) is anon-starter since it doesn't handle symbolics.

To eliminate the range issue from the equation, Aldy is currentlyworking on unifying the irange and value_range APIs. This will allowthe rest of the ranger code base to use the value_range implementationtransparently. We can talk about irange or some alternateimplementation of ranges at some later point, but there'll be an APIthat works for all clients.

The existing value_range API gets a few tweaks/cleanups, but mostlythere is an additional set of calls to query sub-ranges which the rangerand range-ops require. These routines basically translate the variousvalue ranges formats into discrete sub-ranges. Thru these rotuines,ANTI_RANGE will appear as 2 sub-ranges, VARYING as a [MIN, MAX] range,and UNDEFINED as an empty range []. These additions should allowvalue_range to function as the range implementation for both the rangerand VRP.

I suspect he will have patches coming shortly that will help to unifythe 2 range implementations, we can discuss details over those patches..

B)

A Unified range API then allows us to work on integrating the range-opsand GORI-computes component into the code base. Range ops wouldreplace the various extract_range_from_*_ routines in vr_values forstatement level ranges. GORI-computes would then replace the assertbuilding code for calculating outgoing ranges on edges. In theory EVRPthen simply calls range_on_edge() from gori_compute instead ofregister_edge_assert() .

The range ops code is designed to perform all evaluations assuming anarbitrary number of sub-ranges. Aldy spent a lot of time last yearunifying the VRP code and the range-ops code to get the identicalresults, and they frequently share a common base. He has gone thruexcruciating care to ensure the calculations are identical and verifiesit by calculating everything using both code bases, comparing them, andaborting if the results ever get diverge.

We will need to adjust the range-ops code to work with symbolics incertain place. This means PLUS, MINUS, all the relations (<,>, etc), andcopy. Probably something else as it is encountered. This is un-sized asyet, but I'm hoping won't be too bad assuming we can utilize some of theexisting code for those bits.. More details when we actually startdoing this and find the lurking dragons.

we'll worry about bitmasks and equivalencies when we get closer tofunctioning, but I don't foresee too much problem since value_range_baseis still being used.

C) That will keep us busy for a while, and should result in the coreintegration. Meanwhile, we'll try to figure out the relational codedesign. I'll go back to my original design, adjust that, then we canfigure out how best to proceed to address the various needs.

D) Finally, the GORI-cache and on-demand ranger are blocked until theabove work is finished.

One additional thing I would like to do eventually is tweak EVRPslightly to align with the ranger model.

The ranger API is basically just 5 entry points which the ranger uses todetermine ranges.

    range_of_expr  - range of a use on a statement

range_of_stmt - range of the result of the statement, (calculatedby range-ops).

    range_on_edge - range on an edge - (provided by gori_computes)
    range_on_entry - range on entry to a block (provided by gori-cache)
    range_on_exit - range after the last statement in a block

Abstracted and simplified, I believe EVRP functions more or less likethis? :

- EVRP starts a block with it's "current range" vector initialized tothe range on entry values. (provided as you step into the block),- It then walks the IL for the block, evaluating each statement,possibly simplifying, and updating this current range vector.- when it reaches the bottom of the block, it calculates outgoing rangeson each edge and updates those to provide a current range at the starteach successor block.

If one considers EVRP without making any IL changes, it can be viewed asanother range calculator. If that is mapped that to the ranger API:

range_of_expr - This is always the current value in thecurrent-range vector as you walk the IL. range_of_stmt - range of the result of the statement (to beprovided by range_ops). so this is identical range_on_edge - this is also identical, to be provided bygori_computes. range_on_entry - Provided at the start of block processing, neverneeded again since it is updated on the fly. range_on_exit - range after the last statement in a block. whenEVRP reaches the end of the block, current value is this as well.

EVRP and the ranger have now become alternate implementations of thesame model, the ranger is on-demand and EVRP just requires processingeverything linearly through a basic block.

We can tweak the interface's to align a bit better, and then adjust thesimplification code so it works with that API. That should make EVRP andthe on-demand Ranger interchangeable during a forward walk.

This would allow a much more accurate comparison of how the 2implementations behave and what kinds of results they can get.

It also opens up the unexplored possibility of some sort of hybridversion which can resolve some of the drawbacks to each approach.


===================

   Short summary:

a) we'll unify value_range and the irange API, confirm there are no newbugs nor performance issue. This would considered complete when theranger is able to fully run using value_range instead of irange.

b) we'll then port the required parts of range-ops and gori-computes tohandle basic symbolics in value_range so that the VRPs' will see thesame results with range-ops as it does with the extract and assertcode. Then we will integrate it with vr_values and EVRP. This would beconsidered complete when results are identical and there is nounacceptable performance impact.


c) meanwhile we'll work on the relational tracking mechanism.

d) Then we can revisit the on-demand engine vs the EVRP walk mechanismand any other things dreamed up between now and then.



Does this seem reasonable?

Andrew

On-Demand range technology [6/5] - Integration

Reply via email to