On 12/9/19 1:14 PM, Martin Liška wrote:
Hello.
Based on presentation that had Sriraman Tallam at a LLVM conference:
https://www.youtube.com/watch?v=DySuXFGmB40
I made a heatmap based on executed instruction addresses. I used
$ perf record -F max -- ./cc1plus -fpreprocessed
/home/marxin/Programming/tramp3d/tramp3d-v4.ii
and
$ perf script -F time,ip,dso
I'm sending link for my system GCC 9 (PGO+lean LTO bootstrap), GCC 10 before
and after my reorder
patch (also PGO+lean LTO bootstrap).
One can see quite significant clustering starting from 5s till the end of
compilation.
Link: https://drive.google.com/open?id=1M0YlxvQPyiVguy5VWRC8dG52UArwAuKS
Martin
For the completeness, the heatmap was generated with the following script:
https://github.com/marxin/script-misc/blob/master/binary-heatmap.py
Martin