Re: complete_unrolli / complete_unroll

Albert Cohen Wed, 19 Aug 2009 18:20:11 -0700

Richard Guenther wrote:

gfortran.dg/reassoc_4.f, the hottest loop from calculix.


Thanks.

This example is slightly different. Graphite should be able to handle itwith loop fusion rather than pre-unrolling + cse. But I agree that theunrolling + cse approach also makes sense (and does not depend on thesame legality constraints as loop fusion).

This makes me think of a simple, general criterion to detect cases wherepre-unrolling of inner loop helps further cse and loop optimizations.The idea is to unroll only when we can see some evidence of arrayreferences that are not presently loop-invariant that would be made(outer)-loop invariant via full unrolling of some inner loop.This can be implemented by complementing the current heuristic (or itscomplementary enhancements by Honza) with an additional condition, onlyenabled when running it with the "i" (inner) flag (which should probablybe renamed if we do implement this...).

The simplest, weakest condition I can think of would be to traverse allarray references in the region enclosed by the loop-to-be-unrolled,compute the SCEV for each one, instanciate it in the loop's context, andchecking if it only depends on the loop counter, as well as outer loopcounters or parameters.

This condition would a priori pass on the tramp3d and reassoc_4 cases.Yet it is probably too weak and will still pass on many codes whereunrolling would probably not help at all... and probably harm.If this is the case, we should consider multiple loops to be unrolled,and the combined effect of unrolling ALL of these, resulting in completeinstanciation of the array subscripts with constants. This is a veryspecial case, again satisfied by our two motivating examples. Maybe itwill be too specific and we'll have performance regressions... Itremained to be investigated if we have to go through a strictercondition than the first, weak one I proposed.


If this is not clear, I can write some pseudo-code to clarify :-).

Albert

Re: complete_unrolli / complete_unroll

Reply via email to