On Tue, Nov 17, 2020 at 12:29 PM Richard Sandiford via Gcc-patches
wrote:
>
> On arm* and aarch64* targets, we can vectorise the second of the main
> loops using SLP, not just the third. As the comments say, whether this
> is supported depends on a very specific permutation, so it seemed better
>
On arm* and aarch64* targets, we can vectorise the second of the main
loops using SLP, not just the third. As the comments say, whether this
is supported depends on a very specific permutation, so it seemed better
to use direct target selectors.
Tested on aarch64-linux-gnu (with and without SVE),