Hi,
In this patch series I address PR 88915 in the first patch, by enabling
vectorization of epilogues when doing loop versioning and turn on
--param vect-epilogues-nomask=1 on by default in the second patch.
I benchmarked intrate SPEC2017 for both aarch64 and x86_64 (AVX512).
This patch gives aarch64 a 7% up for x264_r on SPEC2017, with all other
benchmarks in intrate staying the same. On a x86_64 with AVX512 I do see
a 3% drop on exchange2_r and a 1% drop on xz_r. Other benchmarks either
go up a little or stay the same. Again x264_r showing the highest gain
with a 16% improvement and the intrate geomean goes up by 1%.
Andre Vieira (2):
[PATCH 1/2][vect]PR 88915: Vectorize epilogues when versioning loops
[PATCH 2/2][vect]Make vect-epilogues-nomask=1 default