Hi,

As subject, this patch implements v[r]addhn2 and v[r]subhn2 Neon intrinsic
RTL patterns using a vec_concat of a register_operand and an ADDSUBHN
unspec - instead of just an ADDSUBHN2 unspec. This more relaxed pattern
allows for more aggressive combinations and ultimately better code
generation - which will be confirmed by a new set of tests in
gcc.target/aarch64/narrow_high_combine.c (patch 5/5 in this series).

This patch also removes the now redundant [R]ADDHN2 and [R]SUBHN2
unspecs and their iterator.

Regression tested and bootstrapped on aarch64-none-linux-gnu - no
issues.

Ok for master?

Thanks,
Jonathan

---

gcc/ChangeLog:

2021-03-03  Jonathan Wright  <jonathan.wri...@arm.com>

        * config/aarch64/aarch64-simd.md (aarch64_<sur><addsub>hn2<mode>):
        Implement as an expand emitting a big/little endian
        instruction pattern.
        (aarch64_<sur><addsub>hn2<mode>_insn_le): Define.
        (aarch64_<sur><addsub>hn2<mode>_insn_be): Define.
        * config/aarch64/iterators.md: Remove UNSPEC_[R]ADDHN2 and
        UNSPEC_[R]SUBHN2 unspecs and ADDSUBHN2 iterator.

Attachment: rb14250.patch
Description: rb14250.patch

Reply via email to