Re: On-Demand range technology [2/5] - Major Components : How it works

Andrew MacLeod Tue, 28 May 2019 07:18:14 -0700

On 5/27/19 9:02 AM, Richard Biener wrote:

On Fri, May 24, 2019 at 5:50 PM Andrew MacLeod <amacl...@redhat.com> wrote:

The above suggests that iff this is done at all it is not in GORI because
those are not conditional stmts or ranges from feeding those.  The
machinery doing the use-def walking from stmt context also cannot
come along these so I have the suspicion that Ranger cannot handle
telling us that for the stmt following above, for example

   if (_5 != 0)

that _5 is not zero?

Can you clarify?

So there are 2 aspects to this.    the range-ops code for DIV_EXPR, if
asked for the range of op2 () would return ~[0,0] for _5.
But you are also correct in that the walk backwards would not find this.

This is similar functionality to how null_derefs are currently handled,
and in fact could probably be done simultaneously using the same code
base.   I didn't bring null derefs up, but this is a good time :-)

There is a separate class used by the gori-cache which tracks the
non-nullness property at the block level.    It has a single API:
non_null_deref_p (name, bb)    which determines whether the is a
dereference in any BB for NAME, which indicates whether the range has an
implicit ~[0,0] range in that basic block or not.

So when we then have

  _1 = *_2; // after this _2 is non-NULL
  _3 = _1 + 1; // _3 is non-NULL
  _4 = *_3;
...

when a on-demand user asks whether _3 is non-NULL at the
point of _4 = *_3 we don't have this information?  Since the
per-BB caching will only say _1 is non-NULL after the BB.
I'm also not sure whether _3 ever gets non-NULL during
non-NULL processing of the block since walking immediate uses
doesn't really help here?

presumably _3 is globally non-null due to the definition being (pointer+ x) ... ie, _3 has a global range o f ~[0,0] ?


So this seems to be a fundamental limitation [to the caching scheme],
not sure if it is bad in practice.

Or am I missing something?

Not missing anything The non-nullness property is maintains globally atthe basic block level. both _1 and _3 are flagged as being non-null inthe block. Upon exit, its a bit check. If the global information doesnot have the non-nullness property, then when a request is made fornon-nullness and the def and the use are both within the same block,and its flagged as being non-null in that block, then the request isforced back to a quick walk between the def and the use to see if thereis any non-nulless introduced in between. Yes, that makes it a linearwalk, but its infrequent, and often short. to the best of our knowledgeat this point anyway :-)


yes, compile-time complexity is from empirical speed timings and
theory-crafting from the algorithms,  and that the on-entry cache
prevents multiple passes over things.

we have not done a memory analysis yet, not done anything to see if we
can improve it.
It makes very heavy use of bitmaps, which are typically quite sparse.
The on-entry cache is a vector of pointers to a range, initially 0, and
we allocate ranges as needed.   There will be an on-entry range entry
for any ssa-name which has been queried between the query point and the
definition.

So that's similar to LIVE-on-entry thus a SSA name with a range
will have an on-entry range entry on each BB it dominates?
That makes storage requirement quadratic in the worst case.

Yes, assuming a request has been made for all ssa-names everywhere theyare live.

  My belief is this is typically not large, but we have done
no analysis as yet.  This does mean that once an ssa-name goes
"out-of-scope", ie no  more uses in the program, it will not be in the
cache for any of those blocks simply because its never queried past its
end of life.

Not sure where to put it so let me put it here:  I'd like to see the SSA
based VRP pass to die (in fact I'd like all ssa-propagator based
passes to die...).
EVRP is the future here and I can see EVRP making use of the
reverse part of Range-Ops to replace the ad-hoc pattern matching
in register_edge_assert_for (badly named function that now gets you
back a list of assertions hold on an edge).  So I hope to see VRP

Does register_edge_assert cache anything? or walk an assert chain? or is
it basically just a "range-on-this-edge"  call? and the result combined
with what is known at that point?

register_edge_assert pattern-matches the stmts feeding a conditional
stmt on an edge.  It doesn't cache anything (so does quite some redundant
work when processing all outgoing edges).  It's not intended to be called
multiple times but rather when the DOM walker enters a block and we
look at the controlling statement to derive conditional ranges.  For
example for

  _1 = (unsigned) _2;
  if (_1 > 1)
   ...

it derives a vector of two relations on the true edge:

  _1 > 1 == true
  (unsigned) _2 > 1 == true

this is actually what we end up building ASSERT_EXPRs for in traditional
VRP and what we extract ranges from only valid in dominated context from EVRP.

and that gets mapped to the actual range [2, MAX] when? That is whatrange_on_edge will produce for both _1 and _2 on this edge.


If we were to go to the effort of handling symbolics, and the expression was
  if (_1 > _2)

then we'd get ranges _2 = [_1 + 1, MAX] and _1 = [MIN, _2] fromrange_on_edge



Andrew

Re: On-Demand range technology [2/5] - Major Components : How it works

Reply via email to