On 11/17/2017 08:11 AM, Richard Sandiford wrote:
> Fully-masked loops can be profitable even if the iteration
> count is smaller than the vectorisation factor. In this case
> we're effectively doing a complete unroll followed by SLP.
>
> The documentation for min-vect-loop-bound says that the
> default value is 0, but actually the default and minimum
> were 1. We need it to be 0 for this case since the parameter
> counts a whole number of vector iterations.
>
> Tested on aarch64-linux-gnu (with and without SVE), x86_64-linux-gnu
> and powerpc64le-linux-gnu. OK to install?
>
> Richard
>
>
> 2017-11-17 Richard Sandiford <richard.sandif...@linaro.org>
> Alan Hayward <alan.hayw...@arm.com>
> David Sherwood <david.sherw...@arm.com>
>
> gcc/
> * doc/sourcebuild.texi (vect_fully_masked): Document.
> * params.def (PARAM_MIN_VECT_LOOP_BOUND): Change minimum and
> default value to 0.
> * tree-vect-loop.c (vect_analyze_loop_costing): New function,
> split out from...
> (vect_analyze_loop_2): ...here. Don't check the vectorization
> factor against the number of loop iterations if the loop is
> fully-masked.
>
> gcc/testsuite/
> * lib/target-supports.exp (check_effective_target_vect_fully_masked):
> New proc.
> * gcc.dg/vect/slp-3.c: Expect all loops to be vectorized if
> vect_fully_masked.
> * gcc.target/aarch64/sve_loop_add_4.c: New test.
> * gcc.target/aarch64/sve_loop_add_4_run.c: Likewise.
> * gcc.target/aarch64/sve_loop_add_5.c: Likewise.
> * gcc.target/aarch64/sve_loop_add_5_run.c: Likewise.
> * gcc.target/aarch64/sve_miniloop_1.c: Likewise.
> * gcc.target/aarch64/sve_miniloop_2.c: Likewise.
OK.
Jeff