Hi, As commented at https://gcc.gnu.org/ml/gcc-patches/2014-09/msg00684.html, this is a simple patch enabling neon memset inlining on cortex-a53/cortex-a57 in AArch32 mode.
Test on arm-none-linux-gnueabihf/--with-cpu=cortex-a57/--with-fpu=crypto-neon-fp-arm v8/--with-float=hard. I will further collect benchmark data, see if there is regression. Is it ok if benchmark results are good? 2014-11-13 Bin Cheng <bin.ch...@arm.com> * config/arm/arm.c (arm_cortex_a53_tune, arm_cortex_a57_tune): Prefer neon for stringops on cortex-a53/a57 in AArch32 mode.
Index: gcc/config/arm/arm.c =================================================================== --- gcc/config/arm/arm.c (revision 215108) +++ gcc/config/arm/arm.c (working copy) @@ -1893,7 +1893,7 @@ const struct tune_params arm_cortex_a53_tune = &arm_default_vec_cost, /* Vectorizer costs. */ false, /* Prefer Neon for 64-bits bitops. */ false, false, /* Prefer 32-bit encodings. */ - false, /* Prefer Neon for stringops. */ + true, /* Prefer Neon for stringops. */ 8 /* Maximum insns to inline memset. */ }; @@ -1912,7 +1912,7 @@ const struct tune_params arm_cortex_a57_tune = &arm_default_vec_cost, /* Vectorizer costs. */ false, /* Prefer Neon for 64-bits bitops. */ true, true, /* Prefer 32-bit encodings. */ - false, /* Prefer Neon for stringops. */ + true, /* Prefer Neon for stringops. */ 8 /* Maximum insns to inline memset. */ };