The following couple patches add options to better control the loop prefetch pass.
With the current settings the pass tends to be very aggressive, issuing a lot of prefetch hints. Most of these don't translate to better performance. Some of these issued hints may even cause more cache evictions. Luis Machado (2): Introduce prefetch-minimum stride option Introduce prefetch-dynamic-strides option. gcc/config/aarch64/aarch64-protos.h | 6 ++++++ gcc/config/aarch64/aarch64.c | 24 +++++++++++++++++++++++- gcc/doc/invoke.texi | 25 +++++++++++++++++++++++++ gcc/params.def | 18 ++++++++++++++++++ gcc/params.h | 4 ++++ gcc/tree-ssa-loop-prefetch.c | 26 ++++++++++++++++++++++++++ 6 files changed, 102 insertions(+), 1 deletion(-) -- 2.7.4