on 2024/7/1 22:28, Richard Biener wrote:
> On Mon, Jul 1, 2024 at 8:16 AM Kewen.Lin <li...@linux.ibm.com> wrote:
>>
>> Hi,
>>
>> As PR115659 shows, assuming c = x CMP y, there are some
>> folding chances for patterns r = c ? -1/z : z/0.
>>
>> For r = c ? -1 : z, it can be folded into:
>>   - r = c | z (with ior_optab supported)
>>   - or r = c ? c : z
>>
>> while for r = c ?  z : 0, it can be foled into:
>>   - r = c & z (with and_optab supported)
>>   - or r = c ? z : c
>>
>> This patch is to teach ISEL to take care of them and also
>> remove the redundant gsi_replace as the caller of function
>> gimple_expand_vec_cond_expr will handle it.
> 
> Yeah, not the nicest API ...
> 
>> Bootstrapped and regtested on x86_64-redhat-linux and
>> powerpc64{,le}-linux-gnu.
>>
>> Is it ok for trunk?
> 
> Minor nit below
> 
>> BR,
>> Kewen
>> -----
>>         PR tree-optimization/115659
>>
>> gcc/ChangeLog:
>>
>>         * gimple-isel.cc (gimple_expand_vec_cond_expr): Add more foldings for
>>         patterns x CMP y ? -1 : z and x CMP y ? z : 0.
>> ---
>>  gcc/gimple-isel.cc | 48 +++++++++++++++++++++++++++++++++++++++-------
>>  1 file changed, 41 insertions(+), 7 deletions(-)
>>
>> diff --git a/gcc/gimple-isel.cc b/gcc/gimple-isel.cc
>> index 54c1801038b..71af1a8cd97 100644
>> --- a/gcc/gimple-isel.cc
>> +++ b/gcc/gimple-isel.cc
>> @@ -240,16 +240,50 @@ gimple_expand_vec_cond_expr (struct function *fun, 
>> gimple_stmt_iterator *gsi,
>>             can_compute_op0 = expand_vec_cmp_expr_p (op0a_type, op0_type,
>>                                                      tcode);
>>
>> -         /* Try to fold x CMP y ? -1 : 0 to x CMP y.  */
>>           if (can_compute_op0
>> -             && integer_minus_onep (op1)
>> -             && integer_zerop (op2)
>>               && TYPE_MODE (TREE_TYPE (lhs)) == TYPE_MODE (TREE_TYPE (op0)))
>>             {
>> -             tree conv_op = build1 (VIEW_CONVERT_EXPR, TREE_TYPE (lhs), 
>> op0);
>> -             gassign *new_stmt = gimple_build_assign (lhs, conv_op);
>> -             gsi_replace (gsi, new_stmt, true);
>> -             return new_stmt;
>> +             /* Assuming c = x CMP y.  */
>> +             bool op1_minus_onep = integer_minus_onep (op1);
>> +             bool op2_zerop = integer_zerop (op2);
>> +             tree vtype = TREE_TYPE (lhs);
>> +             machine_mode vmode = TYPE_MODE (vtype);
>> +             /* Try to fold r = c ? -1 : 0 to r = c.  */
>> +             if (op1_minus_onep && op2_zerop)
>> +               {
>> +                 tree conv_op = build1 (VIEW_CONVERT_EXPR, vtype, op0);
>> +                 return gimple_build_assign (lhs, conv_op);
>> +               }
>> +             /* Try to fold r = c ? -1 : z to r = c | z, or
>> +                r = c ? c : z.  */
>> +             if (op1_minus_onep)
>> +               {
>> +                 tree conv_op = build1 (VIEW_CONVERT_EXPR, vtype, op0);
>> +                 tree new_op0 = make_ssa_name (vtype);
>> +                 gassign *new_stmt = gimple_build_assign (new_op0, conv_op);
>> +                 gsi_insert_seq_before (gsi, new_stmt, GSI_SAME_STMT);
>> +                 if (optab_handler (ior_optab, vmode) != CODE_FOR_nothing)
>> +                   /* r = c | z */
>> +                   return gimple_build_assign (lhs, BIT_IOR_EXPR, new_op0,
>> +                                               op2);
>> +                 /* r = c ? c : z */
>> +                 op1 = new_op0;
> 
> maybe better call it new_op1 then?  Or new_op.
> 
>> +               }
>> +             /* Try to fold r = c ? z : 0 to r = c & z, or
>> +                r = c ? z : c.  */
>> +             else if (op2_zerop)
>> +               {
>> +                 tree conv_op = build1 (VIEW_CONVERT_EXPR, vtype, op0);
>> +                 tree new_op0 = make_ssa_name (vtype);
>> +                 gassign *new_stmt = gimple_build_assign (new_op0, conv_op);
>> +                 gsi_insert_seq_before (gsi, new_stmt, GSI_SAME_STMT);
>> +                 if (optab_handler (and_optab, vmode) != CODE_FOR_nothing)
>> +                   /* r = c | z */
>> +                   return gimple_build_assign (lhs, BIT_AND_EXPR, new_op0,
>> +                                               op1);
>> +                 /* r = c ? z : c */
>> +                 op2 = new_op0;
> 
> Likewise (new_op2 or also new_op).
> 
> OK with that nit fixed.

Thanks Richi, refined with new_op1/new_op2, re-tested well and pushed as 
r15-1763.

BR,
Kewen

> 
> Thanks,
> Richard.
> 
>> +               }
>>             }
>>
>>           /* When the compare has EH we do not want to forward it when
>> --
>> 2.43.0

Reply via email to