Re: [AArch64] Improve SVE dup intrinsics codegen

2022-05-17 Thread Richard Sandiford via Gcc-patches
"Andre Vieira (lists)" writes: > Hi, > > This patch teaches the aarch64 backend to improve codegen when using dup > with NEON vectors with repeating patterns. It will attempt to use a > smaller NEON vector (or element) to limit the number of instructions > needed to construct the input vector.

[AArch64] Improve SVE dup intrinsics codegen

2022-05-17 Thread Andre Vieira (lists) via Gcc-patches
Hi, This patch teaches the aarch64 backend to improve codegen when using dup with NEON vectors with repeating patterns. It will attempt to use a smaller NEON vector (or element) to limit the number of instructions needed to construct the input vector. Bootstrapped and regression tested  aarc