https://llvm.org/bugs/show_bug.cgi?id=26106
James Molloy <james.mol...@arm.com> changed: What |Removed |Added ---------------------------------------------------------------------------- Status|NEW |RESOLVED Resolution|--- |FIXED --- Comment #5 from James Molloy <james.mol...@arm.com> --- Hi, I can't comment on microarchitectural details of ARM CPUs. However the Cortex-A5 is a part that strongly favours efficiency over performance. It would be interesting if you ran this on other ARM parts (for example a Cortex-A15). It is obvious from the numbers that the performance hits a bottleneck/is saturated after vectorization. Unrolling doesn't give much extra gain. This could be due to a number of factors that I'm not keen on speculating on! This is just the theoretical peak performance of that system (because it's not just the CPU, it's the memory bandwidth too) on that benchmark. I'm going to mark this resolved, but feel free to reopen it. James -- You are receiving this mail because: You are on the CC list for the bug.
_______________________________________________ llvm-bugs mailing list llvm-bugs@lists.llvm.org http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-bugs