SoC Project: Propagating array data dependencies from Tree-SSA to RTL

Alexander Monakov Fri, 23 Mar 2007 10:56:32 -0800

Hello,

I would like to submit the following project for Google Summer of Code:


Propagating array data dependence information from Tree-SSA to RTL

Synopsis:

The RTL array data dependence analyzer was written specifically for swingmodulo scheduling (SMS) implementation in GCC. It is overly conservative,because it uses RTL alias analysis to find intra- and inter-loop memorydependencies. It also assumes that the distance of an inter-loop memorydependence equals to one.

I propose to improve the quality of data dependence analysis on RTL viapropagating the information from Tree-SSA dependence analyzer. The savedinformation will be used in construction of data dependence graph forSMS. It can also be used for other optimizations, e.g. scheduler.


Rationale:

In GCC, there are two analyses of array data dependencies, which are runon Tree-SSA and RTL levels, respectively. The Tree-SSA data dependenceanalysis is located in tree-data-ref.[ch] files, also using parts oftree-chrec.c. For a given loop, the analysis builds a vector of datareferences (represented as struct data_reference) and a vector ofdependence relations (represented as struct data_dependence_relation). Adata reference contains links to a memory reference and a containerstatement, a first accessed location, a base object, and other memoryattributes. A dependence relation contains the data references it links,its type, distance vector, direction vector, and subscripts information.

The RTL array data dependence analyzer is located in ddg.[ch] files andwas written specifically for swing modulo scheduling (SMS) implementationin GCC. The analyzer builds a data dependence graph (DDG) for a givenbasic block. The DDG is represented as a vector of nodes. Each DDG nodecontains vectors of incoming and outgoing dependence edges, sets ofsuccessors and predecessors of the node in the DDG, and the containinginstruction. Each DDG edge, analogously to the Tree-SSA analysis, containssource and destination nodes of the edge, a dependence type, an edgelatency, and a distance. Additionally, the edges that are going to/fromthe same node form a linked list analogously to control flow edges. Theanalyzer uses scheduler dependence analysis (located in sched-deps.c) tobuild intra-loop dependencies and the data flow engine (located in df-*.c)to build inter-loop dependencies.


The RTL analyzer has the following deficiencies:

* DDG is built only for a single basic block loops. This is because thecurrent SMS implementation only supports such loops. The problem not onlyputs additional constraint on the SMS, but also prevents using thedependence information in other passes.

* Distance of inter-loop dependencies is not calculated and is set to oneconservatively. This limits the SMS implementation in interleavinginstructions from successive iterations.

* Intra-loop dependencies are calculated using RTL alias analysis, whichis weaker (e.g. it is not able to disambiguate array references onarchitectures that lack base+offset addressing mode).

I propose to improve the quality of data dependence analysis on RTL viapropagating the information from Tree-SSA dependence analyzer. Theproject will consist of the following steps:

* Export the Tree-SSA data dependence graph as a global data structure(or a field in struct function). The information will be collected beforeivopts to prevent it from turning array references into pointerreferences, which badly influences data dependence analysis.

* Create the mapping between RTX mems and original trees. A part ofexisting patch[1] of propagating alias information to RTL could be used.The patch saves links to original trees in mem’s attributes analogously toMEM_EXPRs.

* Implement the verifier of the consistency of the saved information tocheck that it stays intact throughout the RTL pipeline.

* Use the saved information when constructing the data dependence graphin ddg.c. When two memory references are found to be dependent, a checkwhether the MEMs contain the original trees and whether the trees arearray references will be performed. If the trees are indeed ARRAY_REFsand the information about their dependence relation can be found in theexported graph, then it is possible either to avoid creating the spuriousDDG edge, in case the data references are independent, or to assign thecorrect distance value to the edge, in case this information is present inthe exported graph.


 * Provide new testcases and test the patch for correctness and speedups.

I would be pleased to see Ayal Zaks as my mentor, because proposedimprovement is primarily targeted as modulo scheduling improvement. Incase this is not possible, I will seek guidance from Maxim Kuvyrkov.


Please feel free to share your thoughts and suggestions.

[1] Alias export patch by Dmitry Melnik:http://gcc.gnu.org/ml/gcc-patches/2005-11/msg01518.html


--
Alexander Monakov

SoC Project: Propagating array data dependencies from Tree-SSA to RTL

Reply via email to