This patch series adds support for the `FEAT_MOP4` extension (see https://arm-software.github.io/acle/main/acle.html#sme2-mop4-intrinsics). As part of this series, I also added support for `FEAT_SME_F8F16` and `FEAT_SME_F8F32`.
This patch series depends on https://inbox.sourceware.org/gcc-patches/[email protected]/ for the ability to have intrinsics with three type suffixes. ChangeLog: * v1: Initial patch series Claudio Bantaloukas (1): aarch64: Add the ability to have three types in an sve/sme intrinsic name Karl Meakin (5): aarch64: Fix comments for register constraints aarch64: Add `SME_MOP4` extensions aarch64: Add `SME_MOP4` instrinsics aarch64: Add new register constraints aarch64: Add insns for `SME_MOP4` intrinsics gcc/config/aarch64/aarch64-c.cc | 4 + .../aarch64/aarch64-option-extensions.def | 5 + gcc/config/aarch64/aarch64-sme.md | 173 ++++++++ .../aarch64/aarch64-sve-builtins-base.cc | 8 +- .../aarch64/aarch64-sve-builtins-functions.h | 46 ++ .../aarch64/aarch64-sve-builtins-shapes.cc | 82 +++- .../aarch64/aarch64-sve-builtins-shapes.h | 1 + .../aarch64/aarch64-sve-builtins-sme.cc | 6 + .../aarch64/aarch64-sve-builtins-sme.def | 79 +++- gcc/config/aarch64/aarch64-sve-builtins-sme.h | 2 + gcc/config/aarch64/aarch64-sve-builtins.cc | 398 +++++++++++------- gcc/config/aarch64/aarch64-sve-builtins.def | 4 + gcc/config/aarch64/aarch64-sve-builtins.h | 19 +- gcc/config/aarch64/aarch64.cc | 2 + gcc/config/aarch64/aarch64.h | 9 + gcc/config/aarch64/constraints.md | 21 +- gcc/config/aarch64/iterators.md | 50 +++ gcc/doc/invoke.texi | 8 + .../gcc.target/aarch64/pragma_cpp_predefs_4.c | 5 + .../aarch64/sme/acle-asm/test_sme_acle.h | 4 +- .../sme2/acle-asm/mop4a_za16_bf16_bf16.c | 85 ++++ .../sme2/acle-asm/mop4a_za16_f16_f16.c | 85 ++++ .../sme2/acle-asm/mop4a_za16_mf8_mf8.c | 85 ++++ .../sme2/acle-asm/mop4a_za32_bf16_bf16.c | 85 ++++ .../sme2/acle-asm/mop4a_za32_f16_f16.c | 85 ++++ .../sme2/acle-asm/mop4a_za32_f32_f32.c | 85 ++++ .../sme2/acle-asm/mop4a_za32_mf8_mf8.c | 86 ++++ .../sme2/acle-asm/mop4a_za32_s16_s16.c | 85 ++++ .../aarch64/sme2/acle-asm/mop4a_za32_s8_s8.c | 85 ++++ .../aarch64/sme2/acle-asm/mop4a_za32_s8_u8.c | 85 ++++ .../sme2/acle-asm/mop4a_za32_u16_u16.c | 85 ++++ .../aarch64/sme2/acle-asm/mop4a_za32_u8_s8.c | 85 ++++ .../aarch64/sme2/acle-asm/mop4a_za32_u8_u8.c | 85 ++++ .../sme2/acle-asm/mop4a_za64_f64_f64.c | 85 ++++ .../sme2/acle-asm/mop4a_za64_s16_s16.c | 85 ++++ .../sme2/acle-asm/mop4a_za64_s16_u16.c | 85 ++++ .../sme2/acle-asm/mop4a_za64_u16_s16.c | 85 ++++ .../sme2/acle-asm/mop4a_za64_u16_u16.c | 85 ++++ .../sme2/acle-asm/mop4s_za16_bf16_bf16.c | 85 ++++ .../sme2/acle-asm/mop4s_za16_f16_f16.c | 85 ++++ .../sme2/acle-asm/mop4s_za32_bf16_bf16.c | 85 ++++ .../sme2/acle-asm/mop4s_za32_f16_f16.c | 85 ++++ .../sme2/acle-asm/mop4s_za32_s16_s16.c | 85 ++++ .../aarch64/sme2/acle-asm/mop4s_za32_s8_s8.c | 85 ++++ .../aarch64/sme2/acle-asm/mop4s_za32_s8_u8.c | 85 ++++ .../sme2/acle-asm/mop4s_za32_u16_u16.c | 85 ++++ .../aarch64/sme2/acle-asm/mop4s_za32_u8_s8.c | 85 ++++ .../aarch64/sme2/acle-asm/mop4s_za32_u8_u8.c | 85 ++++ .../sme2/acle-asm/mop4s_za64_f64_f64.c | 85 ++++ .../sme2/acle-asm/mop4s_za64_s16_s16.c | 85 ++++ .../sme2/acle-asm/mop4s_za64_s16_u16.c | 85 ++++ .../sme2/acle-asm/mop4s_za64_u16_s16.c | 85 ++++ .../sme2/acle-asm/mop4s_za64_u16_u16.c | 85 ++++ .../aarch64/sve/acle/general-c/mop4_b16b16.c | 79 ++++ .../aarch64/sve/acle/general-c/mop4_base.c | 106 +++++ .../aarch64/sve/acle/general-c/mop4_f16f16.c | 79 ++++ .../aarch64/sve/acle/general-c/mop4_f64f64.c | 79 ++++ .../aarch64/sve/acle/general-c/mop4_f8f16.c | 84 ++++ .../aarch64/sve/acle/general-c/mop4_f8f32.c | 84 ++++ .../aarch64/sve/acle/general-c/mop4_i16i64.c | 88 ++++ 60 files changed, 4132 insertions(+), 199 deletions(-) create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za16_bf16_bf16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za16_f16_f16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za16_mf8_mf8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_bf16_bf16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_f16_f16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_f32_f32.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_mf8_mf8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_s16_s16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_s8_s8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_s8_u8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_u16_u16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_u8_s8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za32_u8_u8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_f64_f64.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_s16_s16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_s16_u16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_u16_s16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4a_za64_u16_u16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za16_bf16_bf16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za16_f16_f16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_bf16_bf16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_f16_f16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_s16_s16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_s8_s8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_s8_u8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_u16_u16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_u8_s8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za32_u8_u8.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_f64_f64.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_s16_s16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_s16_u16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_u16_s16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sme2/acle-asm/mop4s_za64_u16_u16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_b16b16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_base.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_f16f16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_f64f64.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_f8f16.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_f8f32.c create mode 100644 gcc/testsuite/gcc.target/aarch64/sve/acle/general-c/mop4_i16i64.c -- 2.43.0
