Hi Kyrill,
> 
> Sorry for missing this.
> For the record you are referring to the patch at:
> https://gcc.gnu.org/ml/gcc-patches/2016-09/msg01700.html
> 
> This is ok and in line with what we do for the f32 intrinsics.
> My only concern was that we can do this only if
> __ARM_FEATURE_FP16_VECTOR_ARITHMETIC
> is defined from the architecture/fpu level, but these intrinsics are already
> gated on that in arm_neon.h.
> 
> This is ok for trunk if a bootstrap and test run on arm-none-linux-gnueabihf
> with current trunk shows no issues.

Thanks, bootstrapped and regtested now on arm-none-linux-gnueabihf and no 
issues.
I'll go ahead and commit then.

Regards,
Tamar

> 
> Thanks,
> Kyrill
> 
> > Tamar
> > ________________________________________
> > From: gcc-patches-ow...@gcc.gnu.org <gcc-patches-ow...@gcc.gnu.org>
> on
> > behalf of Matthew Wahab <matthew.wa...@foss.arm.com>
> > Sent: Friday, September 23, 2016 4:02 PM
> > To: gcc-patches
> > Subject: [ARM] Enable FP16 vector arithmetic operations.
> >
> > Hello,
> >
> > Support for the ARMv8.2-A FP16 NEON arithmetic instructions was added
> > using non-standard names for the instruction patterns. This was needed
> > because the NEON floating point semantics meant that their use by the
> > compiler for HFmode arithmetic operations needed to be restricted.
> > This follows the implementation for 32-bit NEON intructions.
> >
> > As with the 32-bit instructions, the restriction on the HFmode
> > operation can be lifted when -funsafe-math-optimizations is enabled.
> > This patch does that, defining the standard pattern names addhf3,
> > subhf3, mulhf3 and fmahf3.
> >
> > This patch also updates the NEON intrinsics to use the arithmetic
> > operations when -ffast-math is enabled. This is to make keep the
> > 16-bit support consistent with the 32-bit supportd. It is needed so
> > that code using the f16 intrinsics are subject to the same
> > optimizations as code using the f32 intrinsics would be.
> >
> > Tested for arm-none-linux-gnueabihf with native bootstrap and make
> > check on ARMv8-A and for arm-none-eabi and armeb-none-eabi with
> > cross-compiled make check on an ARMv8.2-A emulator.
> >
> > Ok for trunk?
> > Matthew
> >
> > gcc/
> > 2016-09-23  Matthew Wahab  <matthew.wa...@arm.com>
> >
> >         * config/arm/arm_neon.h (vadd_f16): Use standard arithmetic
> >         operations in fast-math mode.
> >         (vaddq_f16): Likewise.
> >         (vmul_f16): Likewise.
> >         (vmulq_f16): Likewise.
> >         (vsub_f16): Likewise.
> >         (vsubq_f16): Likewise.
> >         * config/arm/neon.md (add<mode>3): New.
> >         (sub<mode>3): New.
> >         (fma:<VH:mode>3): New.  Also remove outdated comment.
> >         (mul<mode>3): New.
> >
> > testsuite/
> > 2016-09-23  Matthew Wahab  <matthew.wa...@arm.com>
> >
> >         * gcc.target/arm/armv8_2-fp16-arith-1.c: Expand comment.  Update
> >         expected output of vadd, vsub and vmul instructions.
> >         * gcc.target/arm/armv8_2-fp16-arith-2.c: New.
> >         * gcc.target/arm/armv8_2-fp16-neon-2.c: New.
> >         * gcc.target/arm/armv8_2-fp16-neon-3.c: New.

Reply via email to