Hello HJ,

On 28 июн 07:19, H.J. Lu via Gcc-patches wrote:
> Enable FMA in rsqrt<mode>2 expander and fold rsqrtv16sf2 expander into
> rsqrt<mode>2 expander which expands to UNSPEC_RSQRT28 for TARGET_AVX512ER.
> Although it doesn't show performance change in our workloads, FMA can
> improve other workloads.
> 
> gcc/
> 
>       PR target/88713
>       * config/i386/i386-expand.c (ix86_emit_swsqrtsf): Enable FMA.
>       * config/i386/sse.md (VF_AVX512VL_VF1_128_256): New.
>       (rsqrt<mode>2): Replace VF1_128_256 with VF_AVX512VL_VF1_128_256.
>       (rsqrtv16sf2): Removed.
> 
> gcc/testsuite/
> 
>       PR target/88713
>       * gcc.target/i386/pr88713-1.c: New test.
>       * gcc.target/i386/pr88713-2.c: Likewise.

So, you've introduced new rsqrt expanders for DF vectors and relaxed
condition for V16SF. What I didn't get is why did you change unspec
type from RSQRT to RSQRT28 for V16SF expander?

--
K

Reply via email to