Re: [PATCH v2 2/3] aarch64: Add support for fp8dot2 and fp8dot4

2024-12-09 Thread Richard Sandiford
writes: > The AArch64 FEAT_FP8DOT2 and FEAT_FP8DOT4 extension introduces > instructions for dot product of vectors. > > This patch introduces the following intrinsics: > 1. vdot{q}_{fp16|fp32}_mf8_fpm. > 2. vdot{q}_lane{q}_{fp16|fp32}_mf8_fpm. > > It introduces two flags: fp8dot2 and fp8dot4. > >

[PATCH v2 2/3] aarch64: Add support for fp8dot2 and fp8dot4

2024-11-14 Thread saurabh.jha
The AArch64 FEAT_FP8DOT2 and FEAT_FP8DOT4 extension introduces instructions for dot product of vectors. This patch introduces the following intrinsics: 1. vdot{q}_{fp16|fp32}_mf8_fpm. 2. vdot{q}_lane{q}_{fp16|fp32}_mf8_fpm. It introduces two flags: fp8dot2 and fp8dot4. We had to add space for a