Tamar Christina <tamar.christ...@arm.com> writes:
> Hi All,
>
> While writing a patch series I started getting incorrect codegen out from
> VEC_PERM on partial struct types.
>
> It turns out that this was happening because the TARGET_CAN_CHANGE_MODE_CLASS
> implementation has a slight bug in it.  The hook only checked for SIMD to
> Partial but never Partial to SIMD.   This resulted in incorrect subregs to be
> generated from the fallback code in VEC_PERM_EXPR expansions.
>
> I have unfortunately not been able to trigger it using a standalone testcase 
> as
> the mid-end optimizes away the permute every time I try to describe a permute
> that would result in the bug.
>
> The patch now rejects any conversion of partial SIMD struct types, unless they
> are both partial structures of the same number of registers or one is a SIMD
> type who's size is less than 8 bytes.
>
> Bootstrapped Regtested on aarch64-none-linux-gnu and no issues.
>
> Ok for master? And backport to GCC 12?
>
> Thanks,
> Tamar
>
> gcc/ChangeLog:
>
>       * config/aarch64/aarch64.cc (aarch64_can_change_mode_class): Restrict
>       conversions between partial struct types properly.
>
> --- inline copy of patch -- 
> diff --git a/gcc/config/aarch64/aarch64.cc b/gcc/config/aarch64/aarch64.cc
> index 
> d3c3650d7d728f56adb65154127dc7b72386c5a7..84dbe2f4ea7d03b424602ed98a34e7824217dc91
>  100644
> --- a/gcc/config/aarch64/aarch64.cc
> +++ b/gcc/config/aarch64/aarch64.cc
> @@ -26471,9 +26471,10 @@ aarch64_can_change_mode_class (machine_mode from,
>    bool from_pred_p = (from_flags & VEC_SVE_PRED);
>    bool to_pred_p = (to_flags & VEC_SVE_PRED);
>  
> -  bool from_full_advsimd_struct_p = (from_flags == (VEC_ADVSIMD | 
> VEC_STRUCT));
>    bool to_partial_advsimd_struct_p = (to_flags == (VEC_ADVSIMD | VEC_STRUCT
>                                                  | VEC_PARTIAL));
> +  bool from_partial_advsimd_struct_p = (from_flags == (VEC_ADVSIMD | 
> VEC_STRUCT
> +                                                | VEC_PARTIAL));
>  
>    /* Don't allow changes between predicate modes and other modes.
>       Only predicate registers can hold predicate modes and only
> @@ -26496,9 +26497,23 @@ aarch64_can_change_mode_class (machine_mode from,
>      return false;
>  
>    /* Don't allow changes between partial and full Advanced SIMD structure
> -     modes.  */
> -  if (from_full_advsimd_struct_p && to_partial_advsimd_struct_p)
> -    return false;
> +     modes unless both are a partial struct with the same number of registers
> +     or the vector bitsizes must be the same.  */
> +  if (to_partial_advsimd_struct_p ^ from_partial_advsimd_struct_p)
> +    {
> +      /* If they're both partial structures, allow if they have the same 
> number
> +      or registers.  */
> +      if (to_partial_advsimd_struct_p == from_partial_advsimd_struct_p)
> +     return known_eq (GET_MODE_SIZE (from), GET_MODE_SIZE (to));

It looks like the ^ makes this line unreachable.  I guess it should
be a separate top-level condition.

> +      /* If one is a normal SIMD register, allow only if no larger than 
> 64-bit.  */
> +      if ((to_flags & VEC_ADVSIMD) == to_flags)
> +     return known_le (GET_MODE_SIZE (to), 8);
> +      else if ((from_flags & VEC_ADVSIMD) == from_flags)
> +     return known_le (GET_MODE_SIZE (from), 8);
> +
> +      return false;
> +    }

I don't think we need to restrict this to SIMD modes.  A plain DI would
be OK too.  So I think it should just be:

    return (known_le (GET_MODE_SIZE (to), 8)
            || known_le (GET_MODE_SIZE (from, 8));

Thanks,
Richard

>  
>    if (maybe_ne (BITS_PER_SVE_VECTOR, 128u))
>      {

Reply via email to