Re: [2/3 PATCH]AArch64 use canonical ordering for complex mul, fma and fms

Richard Sandiford via Gcc-patches Fri, 17 Dec 2021 08:24:48 -0800

Tamar Christina <tamar.christ...@arm.com> writes:
> Hi All,
>
> After the first patch in the series this updates the optabs to expect the
> canonical sequence.
>
> Bootstrapped Regtested on aarch64-none-linux-gnu and no issues.
>
> Ok for master? and backport along with the first patch?
>
> Thanks,
> Tamar
>
> gcc/ChangeLog:
>
>       PR tree-optimization/102819
>       PR tree-optimization/103169
>       * config/aarch64/aarch64-simd.md (cml<fcmac1><conj_op><mode>4,
>       cmul<conj_op><mode>3): Use canonical order.
>       * config/aarch64/aarch64-sve.md (cml<fcmac1><conj_op><mode>4,
>       cmul<conj_op><mode>3): Likewise.
>
> --- inline copy of patch -- 
> diff --git a/gcc/config/aarch64/aarch64-simd.md 
> b/gcc/config/aarch64/aarch64-simd.md
> index 
> f95a7e1d91c97c9e981d75e71f0b49c02ef748ba..875896ee71324712c8034eeff9cfb5649f9b0e73
>  100644
> --- a/gcc/config/aarch64/aarch64-simd.md
> +++ b/gcc/config/aarch64/aarch64-simd.md
> @@ -556,17 +556,17 @@ (define_insn "aarch64_fcmlaq_lane<rot><mode>"
>  ;; remainder.  Because of this, expand early.
>  (define_expand "cml<fcmac1><conj_op><mode>4"
>    [(set (match_operand:VHSDF 0 "register_operand")
> -     (plus:VHSDF (match_operand:VHSDF 1 "register_operand")
> -                 (unspec:VHSDF [(match_operand:VHSDF 2 "register_operand")
> -                                (match_operand:VHSDF 3 "register_operand")]
> -                                FCMLA_OP)))]
> +     (plus:VHSDF (unspec:VHSDF [(match_operand:VHSDF 1 "register_operand")
> +                                (match_operand:VHSDF 2 "register_operand")]
> +                                FCMLA_OP)
> +                 (match_operand:VHSDF 3 "register_operand")))]
>    "TARGET_COMPLEX && !BYTES_BIG_ENDIAN"
>  {
>    rtx tmp = gen_reg_rtx (<MODE>mode);
> -  emit_insn (gen_aarch64_fcmla<rotsplit1><mode> (tmp, operands[1],
> -                                              operands[3], operands[2]));
> +  emit_insn (gen_aarch64_fcmla<rotsplit1><mode> (tmp, operands[3],
> +                                              operands[1], operands[2]));
>    emit_insn (gen_aarch64_fcmla<rotsplit2><mode> (operands[0], tmp,
> -                                              operands[3], operands[2]));
> +                                              operands[1], operands[2]));
>    DONE;
>  })
>  
> @@ -583,9 +583,9 @@ (define_expand "cmul<conj_op><mode>3"
>    rtx tmp = force_reg (<MODE>mode, CONST0_RTX (<MODE>mode));
>    rtx res1 = gen_reg_rtx (<MODE>mode);
>    emit_insn (gen_aarch64_fcmla<rotsplit1><mode> (res1, tmp,
> -                                              operands[2], operands[1]));
> +                                              operands[1], operands[2]));
>    emit_insn (gen_aarch64_fcmla<rotsplit2><mode> (operands[0], res1,
> -                                              operands[2], operands[1]));
> +                                              operands[1], operands[2]));


This doesn't look right.  Going from the documentation, patch 1 isn't
changing the operand order for CMUL: the conjugated operand (if there
is one) is still operand 2.  The FCMLA sequences use the opposite order,
where the conjugated operand (if there is one) is operand 1.  So I think
the reversal here is still needed.

Same for the multiplication operands in CML* above.

Thanks,
Richard

Re: [2/3 PATCH]AArch64 use canonical ordering for complex mul, fma and fms

Reply via email to