Hi,

As subject, this patch implements saturating right-shift and narrow high
Neon intrinsic RTL patterns using a vec_concat of a register_operand
and a VQSHRN_N unspec - instead of just a VQSHRN_N unspec. This
more relaxed pattern allows for more aggressive combinations and
ultimately better code generation - which will be confirmed by a new
set of tests in gcc.target/aarch64/narrow_high_combine.c (patch 5/5 in
this series.)

Regression tested and bootstrapped on aarch64-none-linux-gnu - no
issues.

Ok for master?

Thanks,
Jonathan

---

gcc/ChangeLog:

2021-03-04  Jonathan Wright  <jonathan.wri...@arm.com>

        * config/aarch64/aarch64-simd.md (aarch64_<sur>q<r>shr<u>n2_n<mode>):
        Implement as an expand emitting a big/little endian
        instruction pattern.
        (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_le): Define.
        (aarch64_<sur>q<r>shr<u>n2_n<mode>_insn_be): Define.

Attachment: rb14251.patch
Description: rb14251.patch

Reply via email to