On Sat, Mar 5, 2022 at 4:05 PM Jakub Jelinek <ja...@redhat.com> wrote:
>
> Hi!
>
> The following testcase ICEs, because the cond_andv* expander
> has vector_operand predicates in both of the commutative inputs
> and calls gen_andv*_mask which calls ix86_binary_operator_ok
> in its condition, but nothing calls ix86_fixup_binary_operands_no_copy
> during the expansion, which means cond_* accepts even operands
> like 2 MEMs which then can't be matched.
>
> The following patch handles it like most other insns that the other
> cond_* patterns use - by having a separate define_expand that calls
> ix86_fixup_binary_operands_no_copy and define_ins with
> ix86_binary_operator_ok.
>
> Bootstrapped/regtested on x86_64-linux and i686-linux, ok for trunk?
Ok.
>
> Note, the predicates on cond_fma* and other FMA variants look all wrong to
> me, usually the fma instructions require nonimmediate_operand operands,
> but the cond_* patterns use vector_operand.  Besides what this patch
--------cut from predicate.md---------
1142; Return true when OP is operand acceptable for vector memory operand.
1143; Only AVX can have misaligned memory operand.
1144(define_predicate "vector_memory_operand"
1145  (and (match_operand 0 "memory_operand")
1146       (ior (match_test "TARGET_AVX")
1147            (match_test "MEM_ALIGN (op) >= GET_MODE_ALIGNMENT (mode)"))))
1148
1149; Return true when OP is register_operand or vector_memory_operand.
1150(define_predicate "vector_operand"
1151  (ior (match_operand 0 "register_operand")
1152       (match_operand 0 "vector_memory_operand")))
--------cut end------------------------

vector_operand is a subset of nonimmediate_operands, so it's more like
a potential optimization issue rather than a correctness one?
> fixes and the unfixed fma which I don't have spare cycles for right
> now I went through all the other cond_* patterns checking for predicate
> mismatches or similar missing ix86_fixup_binary_operands* issues and
> didn't find other problems.
>
> 2022-03-05  Jakub Jelinek  <ja...@redhat.com>
>
>         PR target/104779
>         * config/i386/sse.md (avx512dq_mul<mode>3<mask_name>): New
>         define_expand pattern.  Rename define_insn to ...
>         (*avx512dq_mul<mode>3<mask_name>): ... this.
>         (<code><mode>3_mask): New any_logic define_expand pattern.
>         (<mask_codefor><code><mode>3<mask_name>): Rename to ...
>         (*<code><mode>3<mask_name>): ... this.
>
>         * gcc.target/i386/pr104779.c: New test.
>
> --- gcc/config/i386/sse.md.jj   2022-02-24 15:27:14.722743984 +0100
> +++ gcc/config/i386/sse.md      2022-03-04 13:56:34.863572916 +0100
> @@ -15210,7 +15210,15 @@ (define_expand "cond_mul<mode>"
>    DONE;
>  })
>
> -(define_insn "avx512dq_mul<mode>3<mask_name>"
> +(define_expand "avx512dq_mul<mode>3<mask_name>"
> +  [(set (match_operand:VI8_AVX512VL 0 "register_operand")
> +       (mult:VI8_AVX512VL
> +         (match_operand:VI8_AVX512VL 1 "bcst_vector_operand")
> +         (match_operand:VI8_AVX512VL 2 "bcst_vector_operand")))]
> +  "TARGET_AVX512DQ && <mask_mode512bit_condition>"
> +  "ix86_fixup_binary_operands_no_copy (MULT, <MODE>mode, operands);")
> +
> +(define_insn "*avx512dq_mul<mode>3<mask_name>"
>    [(set (match_operand:VI8_AVX512VL 0 "register_operand" "=v")
>         (mult:VI8_AVX512VL
>           (match_operand:VI8_AVX512VL 1 "bcst_vector_operand" "%v")
> @@ -16824,7 +16832,18 @@ (define_expand "cond_<code><mode>"
>    DONE;
>  })
>
> -(define_insn "<mask_codefor><code><mode>3<mask_name>"
> +(define_expand "<code><mode>3_mask"
> +  [(set (match_operand:VI48_AVX512VL 0 "register_operand")
> +       (vec_merge:VI48_AVX512VL
> +         (any_logic:VI48_AVX512VL
> +           (match_operand:VI48_AVX512VL 1 "bcst_vector_operand")
> +           (match_operand:VI48_AVX512VL 2 "bcst_vector_operand"))
> +         (match_operand:VI48_AVX512VL 3 "nonimm_or_0_operand")
> +         (match_operand:<avx512fmaskmode> 4 "register_operand")))]
> +  "TARGET_AVX512F"
> +  "ix86_fixup_binary_operands_no_copy (<CODE>, <MODE>mode, operands);")
> +
> +(define_insn "*<code><mode>3<mask_name>"
>    [(set (match_operand:VI48_AVX_AVX512F 0 "register_operand" "=x,x,v")
>         (any_logic:VI48_AVX_AVX512F
>           (match_operand:VI48_AVX_AVX512F 1 "bcst_vector_operand" "%0,x,v")
> --- gcc/testsuite/gcc.target/i386/pr104779.c.jj 2022-03-04 14:09:03.278961269 
> +0100
> +++ gcc/testsuite/gcc.target/i386/pr104779.c    2022-03-04 14:08:38.063318794 
> +0100
> @@ -0,0 +1,27 @@
> +/* PR target/104779 */
> +/* { dg-do compile } */
> +/* { dg-options "-O1 --param sccvn-max-alias-queries-per-access=0" } */
> +
> +__attribute__ ((simd)) int
> +foo (int x, int y, int z)
> +{
> +  return (x & y) * !!z;
> +}
> +
> +__attribute__ ((simd)) int
> +bar (int x, int y, int z)
> +{
> +  return (x | y) * !!z;
> +}
> +
> +__attribute__ ((simd)) int
> +baz (int x, int y, int z)
> +{
> +  return (x ^ y) * !!z;
> +}
> +
> +__attribute__ ((simd, target ("avx512dq"))) long
> +qux (long x, long y, long z)
> +{
> +  return (x * y) * !!z;
> +}
>
>         Jakub
>


-- 
BR,
Hongtao

Reply via email to