- compile with the loop unrolled 1x, 2x, 4x, 8x, 16x, 32x and
measure the time the benchmark takes
The optimal unrolling factor may not be a power of two, depending on
icache size (11 times the loop body size?), iteration count (13*n for
some unknown n?), and whether there are actions perf
Hello,
> Of course, instead of clock(), I'd like to use a non-intrusive
> mechanism. However, my research on this topic didn't lead to anything
> but perfsuite, which doesn't work very well for me (should it?).
>
> So here are the questions
>
> - how can I actually insert the code (I need to do
Thank you everybody with the helpful input.
I ran into a small problem. I have some features ready (well, not
everything I wanted, but still good enough). I'm now trying to
instrument the resulting code to time each loop independently.
Essentially, what I want is to programatically modify loops f
==
Grigori Fursin, PhD
Research Fellow, INRIA Futurs, France
http://fursin.net/research
Re: machine learning for loop unrolling
From: Kenneth Hoste
To: stefan dot ciobaca+gcc at gmail dot com
Cc: GCC
Date: Fri, 8 Jun 2007 21:04:05 +0200
Subject: Re: machine learning for loop unrol
Zdenek Dvorak wrote:
...
The number of parallel "computations" in loop.
The estimated latency of the critical path of loop.
The estimated cycle length of loop body.
The max. dependence height of computations.
The max. height of memory dependencies of computations.
The max. height of control dep
Hello,
> The number of floating point ops. in loop body.
> The number of memory ops. in loop body.
> The number of operands in loop body.
> The number of implicit instructions in loop body.
> The number of unique predicates in loop body.
> The number of indirect references in loop body.
> The numb
On 08 Jun 2007, at 16:31, Stefan Ciobaca wrote:
Hello everyone,
For my bachelor thesis I'm modifying gcc to use machine learning to
predict the optimal unroll factor for different loops (inspired from
this paper: http://www.lcs.mit.edu/publications/pubs/pdf/MIT-LCS-
TR-938.pdf).
Interesting
Hello everyone,
For my bachelor thesis I'm modifying gcc to use machine learning to
predict the optimal unroll factor for different loops (inspired from
this paper: http://www.lcs.mit.edu/publications/pubs/pdf/MIT-LCS-TR-938.pdf).
I've compiled gcc 4.1.2 on my machine and I've located the
loop-u
Hello everyone,
For my bachelor thesis I'm modifying gcc to use machine learning to
predict the optimal unroll factor for different loops (inspired from
this paper: http://www.lcs.mit.edu/publications/pubs/pdf/MIT-LCS-TR-938.pdf).
I've compiled gcc 4.1.2 on my machine and I've located the
loop-u