Tamar Christina <tamar.christ...@arm.com> writes:
> @@ -11279,7 +11279,22 @@ aarch64_rtx_mult_cost (rtx x, enum rtx_code code, 
> int outer, bool speed)
>    op1 = XEXP (x, 1);
 
>    if (VECTOR_MODE_P (mode))
> -    mode = GET_MODE_INNER (mode);
> +    {
> +      unsigned int vec_flags = aarch64_classify_vector_mode (mode);
> +      mode = GET_MODE_INNER (mode);
> +      if (vec_flags & VEC_ADVSIMD)
> +     {
> +       /* The by element versions of the instruction has the same costs as 
> the

“by-element”, s/has/have/

> +          normal 3 vector version.  So don't add the costs of the duplicate 
> into

“3-vector”

> +          the costs of the multiply.  We make an assumption that the value in

Maybe s/value in/input to/?

> +          the VEC_DUPLICATE is already the FP&SIMD side.  This means costing 
> of

“already on the”

Some lines are over the 80-character limit.

OK with those changes, thanks.

Richard

> +          a MUL by element pre RA is a bit optimistic.  */
> +       if (GET_CODE (op0) == VEC_DUPLICATE)
> +         op0 = XEXP (op0, 0);
> +       else if (GET_CODE (op1) == VEC_DUPLICATE)
> +         op1 = XEXP (op1, 0);
> +     }
> +    }
>  
>    /* Integer multiply/fma.  */
>    if (GET_MODE_CLASS (mode) == MODE_INT)


Reply via email to