Hello. Based on presentation that had Sriraman Tallam at a LLVM conference: https://www.youtube.com/watch?v=DySuXFGmB40
I made a heatmap based on executed instruction addresses. I used $ perf record -F max -- ./cc1plus -fpreprocessed /home/marxin/Programming/tramp3d/tramp3d-v4.ii and $ perf script -F time,ip,dso I'm sending link for my system GCC 9 (PGO+lean LTO bootstrap), GCC 10 before and after my reorder patch (also PGO+lean LTO bootstrap). One can see quite significant clustering starting from 5s till the end of compilation. Link: https://drive.google.com/open?id=1M0YlxvQPyiVguy5VWRC8dG52UArwAuKS Martin