On Mon, 26 Jun 2023, juzhe.zh...@rivai.ai wrote:

> From: Ju-Zhe Zhong <juzhe.zh...@rivai.ai>
> 
> Hi, this patch is to add LEN_MASK_STORE into SCCVN.
> 
> LEN_MASK_STORE is predicated by both len and mask together.
> My understanding is that LEN_MASK_STORE has same rhs_off and offset as 
> MASK_STORE.
> The size = MIN (length (deduced from mask), (len + bias)).
> 
> Not sure my understanding it correct or no. 
> Hope experts (both Richard && Richi) can correct me if I am wrong.
> 
> Thanks.
> 
> gcc/ChangeLog:
> 
>         * tree-ssa-sccvn.cc (vn_reference_lookup_3): Add LEN_MASK_STORE.
> 
> ---
>  gcc/tree-ssa-sccvn.cc | 17 +++++++++++++++++
>  1 file changed, 17 insertions(+)
> 
> diff --git a/gcc/tree-ssa-sccvn.cc b/gcc/tree-ssa-sccvn.cc
> index 11061a374a2..d1629800705 100644
> --- a/gcc/tree-ssa-sccvn.cc
> +++ b/gcc/tree-ssa-sccvn.cc
> @@ -3304,6 +3304,16 @@ vn_reference_lookup_3 (ao_ref *ref, tree vuse, void 
> *data_,
>         if (!tree_fits_uhwi_p (len) || !tree_fits_shwi_p (bias))
>           return (void *)-1;
>         break;
> +     case IFN_LEN_MASK_STORE:
> +       len = gimple_call_arg (call, 2);
> +       bias = gimple_call_arg (call, 5);
> +       if (!tree_fits_uhwi_p (len) || !tree_fits_shwi_p (bias))
> +         return (void *)-1;
> +       mask = gimple_call_arg (call, internal_fn_mask_index (fn));
> +       mask = vn_valueize (mask);
> +       if (TREE_CODE (mask) != VECTOR_CST)
> +         return (void *)-1;
> +       break;
>       default:
>         return (void *)-1;
>       }
> @@ -3379,6 +3389,13 @@ vn_reference_lookup_3 (ao_ref *ref, tree vuse, void 
> *data_,
>                     pd.rhs_off = start;
>                     pd.offset = offset2i + start;
>                     pd.size = length;
> +                   if (fn == IFN_LEN_MASK_STORE)

can you use if (len != 0) here?

> +                     {
> +                       HOST_WIDE_INT len_bitsize
> +                         = (tree_to_uhwi (len) + tree_to_shwi (bias))
> +                           * BITS_PER_UNIT;
> +                       pd.size = length > len_bitsize ? length : len_bitsize;
> +                     }
>                     if (ranges_known_overlap_p (offset, maxsize,
>                                                 pd.offset, pd.size))
>                       return data->push_partial_def (pd, set, set,

the code continues with

                      mask_idx++;
                    }
                  while (known_lt (mask_idx, TYPE_VECTOR_SUBPARTS 
(vectype)));

it might be nicer to store TYPE_VECTOR_SUBPARTS in a variable and
trim that on 'len' instead?

Thanks,
Richard.

Reply via email to