Re: Allow the number of iterations to be smaller than VF

James Greenhalgh Sun, 07 Jan 2018 12:53:03 -0800

On Mon, Nov 20, 2017 at 12:12:38AM +0000, Jeff Law wrote:
> On 11/17/2017 08:11 AM, Richard Sandiford wrote:
> > Fully-masked loops can be profitable even if the iteration
> > count is smaller than the vectorisation factor.  In this case
> > we're effectively doing a complete unroll followed by SLP.
> > 
> > The documentation for min-vect-loop-bound says that the
> > default value is 0, but actually the default and minimum
> > were 1.  We need it to be 0 for this case since the parameter
> > counts a whole number of vector iterations.
> > 
> > Tested on aarch64-linux-gnu (with and without SVE), x86_64-linux-gnu
> > and powerpc64le-linux-gnu.  OK to install?
> > 
> > Richard
> > 
> > 
> > 2017-11-17  Richard Sandiford  <richard.sandif...@linaro.org>
> >         Alan Hayward  <alan.hayw...@arm.com>
> >         David Sherwood  <david.sherw...@arm.com>
> > 
> > gcc/
> >     * doc/sourcebuild.texi (vect_fully_masked): Document.
> >     * params.def (PARAM_MIN_VECT_LOOP_BOUND): Change minimum and
> >     default value to 0.
> >     * tree-vect-loop.c (vect_analyze_loop_costing): New function,
> >     split out from...
> >     (vect_analyze_loop_2): ...here. Don't check the vectorization
> >     factor against the number of loop iterations if the loop is
> >     fully-masked.
> > 
> > gcc/testsuite/
> >     * lib/target-supports.exp (check_effective_target_vect_fully_masked):
> >     New proc.
> >     * gcc.dg/vect/slp-3.c: Expect all loops to be vectorized if
> >     vect_fully_masked.
> >     * gcc.target/aarch64/sve_loop_add_4.c: New test.
> >     * gcc.target/aarch64/sve_loop_add_4_run.c: Likewise.
> >     * gcc.target/aarch64/sve_loop_add_5.c: Likewise.
> >     * gcc.target/aarch64/sve_loop_add_5_run.c: Likewise.
> >     * gcc.target/aarch64/sve_miniloop_1.c: Likewise.
> >     * gcc.target/aarch64/sve_miniloop_2.c: Likewise.
> OK.
> Jeff


The AArch64 tests are OK.

James

Re: Allow the number of iterations to be smaller than VF

Reply via email to