https://llvm.org/bugs/show_bug.cgi?id=26106

James Molloy <james.mol...@arm.com> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
             Status|NEW                         |RESOLVED
         Resolution|---                         |FIXED

--- Comment #5 from James Molloy <james.mol...@arm.com> ---
Hi,

I can't comment on microarchitectural details of ARM CPUs. However the
Cortex-A5 is a part that strongly favours efficiency over performance. It would
be interesting if you ran this on other ARM parts (for example a Cortex-A15).

It is obvious from the numbers that the performance hits a bottleneck/is
saturated after vectorization. Unrolling doesn't give much extra gain. This
could be due to a number of factors that I'm not keen on speculating on! This
is just the theoretical peak performance of that system (because it's not just
the CPU, it's the memory bandwidth too) on that benchmark.

I'm going to mark this resolved, but feel free to reopen it.

James

-- 
You are receiving this mail because:
You are on the CC list for the bug.
_______________________________________________
llvm-bugs mailing list
llvm-bugs@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-bugs

Reply via email to