Re: [ARM] Enable FP16 vector arithmetic operations.

Tamar Christina Mon, 15 May 2017 01:33:08 -0700

Ping
________________________________________
From: gcc-patches-ow...@gcc.gnu.org <gcc-patches-ow...@gcc.gnu.org> on behalf 
of Tamar Christina <tamar.christ...@arm.com>
Sent: Tuesday, May 2, 2017 3:46:49 PM
To: Matthew Wahab; gcc-patches
Cc: nd; ni...@redhat.com; Richard Earnshaw; Ramana Radhakrishnan; Kyrylo Tkachov
Subject: Re: [ARM] Enable FP16 vector arithmetic operations.


Hi All,

I'm taking this one over from Matthew, I think it slipped through the cracks 
before.

Since it still applies cleanly on trunk I'm just pinging it.

Ok for trunk?

Tamar
________________________________________
From: gcc-patches-ow...@gcc.gnu.org <gcc-patches-ow...@gcc.gnu.org> on behalf 
of Matthew Wahab <matthew.wa...@foss.arm.com>
Sent: Friday, September 23, 2016 4:02 PM
To: gcc-patches
Subject: [ARM] Enable FP16 vector arithmetic operations.

Hello,

Support for the ARMv8.2-A FP16 NEON arithmetic instructions was added
using non-standard names for the instruction patterns. This was needed
because the NEON floating point semantics meant that their use by the
compiler for HFmode arithmetic operations needed to be restricted. This
follows the implementation for 32-bit NEON intructions.

As with the 32-bit instructions, the restriction on the HFmode
operation can be lifted when -funsafe-math-optimizations is
enabled. This patch does that, defining the standard pattern names
addhf3, subhf3, mulhf3 and fmahf3.

This patch also updates the NEON intrinsics to use the arithmetic
operations when -ffast-math is enabled. This is to make keep the 16-bit
support consistent with the 32-bit supportd. It is needed so that code
using the f16 intrinsics are subject to the same optimizations as code
using the f32 intrinsics would be.

Tested for arm-none-linux-gnueabihf with native bootstrap and make check
on ARMv8-A and for arm-none-eabi and armeb-none-eabi with cross-compiled
make check on an ARMv8.2-A emulator.

Ok for trunk?
Matthew

gcc/
2016-09-23  Matthew Wahab  <matthew.wa...@arm.com>

        * config/arm/arm_neon.h (vadd_f16): Use standard arithmetic
        operations in fast-math mode.
        (vaddq_f16): Likewise.
        (vmul_f16): Likewise.
        (vmulq_f16): Likewise.
        (vsub_f16): Likewise.
        (vsubq_f16): Likewise.
        * config/arm/neon.md (add<mode>3): New.
        (sub<mode>3): New.
        (fma:<VH:mode>3): New.  Also remove outdated comment.
        (mul<mode>3): New.

testsuite/
2016-09-23  Matthew Wahab  <matthew.wa...@arm.com>

        * gcc.target/arm/armv8_2-fp16-arith-1.c: Expand comment.  Update
        expected output of vadd, vsub and vmul instructions.
        * gcc.target/arm/armv8_2-fp16-arith-2.c: New.
        * gcc.target/arm/armv8_2-fp16-neon-2.c: New.
        * gcc.target/arm/armv8_2-fp16-neon-3.c: New.

Re: [ARM] Enable FP16 vector arithmetic operations.

Reply via email to