This patch series adds support for the `FEAT_MOP4` extension (see
https://arm-software.github.io/acle/main/acle.html#sme2-mop4-intrinsics).
As part of this series, I also added support for `FEAT_SME_F8F16` and
`FEAT_SME_F8F32`.

This patch series depends on
https://inbox.sourceware.org/gcc-patches/[email protected]/
for the ability to have intrinsics with three type suffixes.

ChangeLog:
  * v1: Initial patch series

Claudio Bantaloukas (1):
  aarch64: Add the ability to have three types in an sve/sme intrinsic
    name

Karl Meakin (5):
  aarch64: Fix comments for register constraints
  aarch64: Add `SME_MOP4` extensions
  aarch64: Add `SME_MOP4` instrinsics
  aarch64: Add new register constraints
  aarch64: Add insns for `SME_MOP4` intrinsics

 gcc/config/aarch64/aarch64-c.cc               |   4 +
 .../aarch64/aarch64-option-extensions.def     |   5 +
 gcc/config/aarch64/aarch64-sme.md             | 173 ++++++++
 .../aarch64/aarch64-sve-builtins-base.cc      |   8 +-
 .../aarch64/aarch64-sve-builtins-functions.h  |  46 ++
 .../aarch64/aarch64-sve-builtins-shapes.cc    |  82 +++-
 .../aarch64/aarch64-sve-builtins-shapes.h     |   1 +
 .../aarch64/aarch64-sve-builtins-sme.cc       |   6 +
 .../aarch64/aarch64-sve-builtins-sme.def      |  79 +++-
 gcc/config/aarch64/aarch64-sve-builtins-sme.h |   2 +
 gcc/config/aarch64/aarch64-sve-builtins.cc    | 398 +++++++++++-------
 gcc/config/aarch64/aarch64-sve-builtins.def   |   4 +
 gcc/config/aarch64/aarch64-sve-builtins.h     |  19 +-
 gcc/config/aarch64/aarch64.cc                 |   2 +
 gcc/config/aarch64/aarch64.h                  |   9 +
 gcc/config/aarch64/constraints.md             |  21 +-
 gcc/config/aarch64/iterators.md               |  50 +++
 gcc/doc/invoke.texi                           |   8 +
 .../gcc.target/aarch64/pragma_cpp_predefs_4.c |   5 +
 .../aarch64/sme/acle-asm/test_sme_acle.h      |   4 +-
 .../sme2/acle-asm/mop4a_za16_bf16_bf16.c      |  85 ++++
 .../sme2/acle-asm/mop4a_za16_f16_f16.c        |  85 ++++
 .../sme2/acle-asm/mop4a_za16_mf8_mf8.c        |  85 ++++
 .../sme2/acle-asm/mop4a_za32_bf16_bf16.c      |  85 ++++
 .../sme2/acle-asm/mop4a_za32_f16_f16.c        |  85 ++++
 .../sme2/acle-asm/mop4a_za32_f32_f32.c        |  85 ++++
 .../sme2/acle-asm/mop4a_za32_mf8_mf8.c        |  86 ++++
 .../sme2/acle-asm/mop4a_za32_s16_s16.c        |  85 ++++
 .../aarch64/sme2/acle-asm/mop4a_za32_s8_s8.c  |  85 ++++
 .../aarch64/sme2/acle-asm/mop4a_za32_s8_u8.c  |  85 ++++
 .../sme2/acle-asm/mop4a_za32_u16_u16.c        |  85 ++++
 .../aarch64/sme2/acle-asm/mop4a_za32_u8_s8.c  |  85 ++++
 .../aarch64/sme2/acle-asm/mop4a_za32_u8_u8.c  |  85 ++++
 .../sme2/acle-asm/mop4a_za64_f64_f64.c        |  85 ++++
 .../sme2/acle-asm/mop4a_za64_s16_s16.c        |  85 ++++
 .../sme2/acle-asm/mop4a_za64_s16_u16.c        |  85 ++++
 .../sme2/acle-asm/mop4a_za64_u16_s16.c        |  85 ++++
 .../sme2/acle-asm/mop4a_za64_u16_u16.c        |  85 ++++
 .../sme2/acle-asm/mop4s_za16_bf16_bf16.c      |  85 ++++
 .../sme2/acle-asm/mop4s_za16_f16_f16.c        |  85 ++++
 .../sme2/acle-asm/mop4s_za32_bf16_bf16.c      |  85 ++++
 .../sme2/acle-asm/mop4s_za32_f16_f16.c        |  85 ++++
 .../sme2/acle-asm/mop4s_za32_s16_s16.c        |  85 ++++
 .../aarch64/sme2/acle-asm/mop4s_za32_s8_s8.c  |  85 ++++
 .../aarch64/sme2/acle-asm/mop4s_za32_s8_u8.c  |  85 ++++
 .../sme2/acle-asm/mop4s_za32_u16_u16.c        |  85 ++++
 .../aarch64/sme2/acle-asm/mop4s_za32_u8_s8.c  |  85 ++++
 .../aarch64/sme2/acle-asm/mop4s_za32_u8_u8.c  |  85 ++++
 .../sme2/acle-asm/mop4s_za64_f64_f64.c        |  85 ++++
 .../sme2/acle-asm/mop4s_za64_s16_s16.c        |  85 ++++
 .../sme2/acle-asm/mop4s_za64_s16_u16.c        |  85 ++++
 .../sme2/acle-asm/mop4s_za64_u16_s16.c        |  85 ++++
 .../sme2/acle-asm/mop4s_za64_u16_u16.c        |  85 ++++
 .../aarch64/sve/acle/general-c/mop4_b16b16.c  |  79 ++++
 .../aarch64/sve/acle/general-c/mop4_base.c    | 106 +++++
 .../aarch64/sve/acle/general-c/mop4_f16f16.c  |  79 ++++
 .../aarch64/sve/acle/general-c/mop4_f64f64.c  |  79 ++++
 .../aarch64/sve/acle/general-c/mop4_f8f16.c   |  84 ++++
 .../aarch64/sve/acle/general-c/mop4_f8f32.c   |  84 ++++
 .../aarch64/sve/acle/general-c/mop4_i16i64.c  |  88 ++++
 60 files changed, 4132 insertions(+), 199 deletions(-)
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za16_bf16_bf16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za16_f16_f16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za16_mf8_mf8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_bf16_bf16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_f16_f16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_f32_f32.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_mf8_mf8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_s16_s16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_s8_s8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_s8_u8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_u16_u16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_u8_s8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_u8_u8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_f64_f64.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_s16_s16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_s16_u16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_u16_s16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_u16_u16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za16_bf16_bf16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za16_f16_f16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_bf16_bf16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_f16_f16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_s16_s16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_s8_s8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_s8_u8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_u16_u16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_u8_s8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_u8_u8.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_f64_f64.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_s16_s16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_s16_u16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_u16_s16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_u16_u16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_b16b16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_base.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_f16f16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_f64f64.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_f8f16.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_f8f32.c
 create mode 100644 
gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_i16i64.c

--
2.43.0

Reply via email to