On Tue, Oct 25, 2016 at 1:21 PM, Bin Cheng <bin.ch...@arm.com> wrote:
> Hi,
> This is a patch set adding various match.pd patterns in order to generate 
> simplified MIN/MAX_EXPR mostly from COND_EXPR.  This is the first one 
> optimizing (convert1 (minmax ((convert2 (x) c)))) to minmax (x c), if 
> convert2 promotes x and convert1 demotes back to x's type.  With this patch, 
> generated assembly for test:
> .L4:
>         ldr     q0, [x3, x1]
>         add     w2, w2, 1
>         cmp     w0, w2
>         ushll   v2.4s, v0.4h, 0
>         ushll2  v1.4s, v0.8h, 0
>         umin    v2.4s, v2.4s, v3.4s
>         umin    v1.4s, v1.4s, v3.4s
>         xtn     v4.4h, v2.4s
>         xtn2    v4.8h, v1.4s
>         str     q4, [x3, x1]
>         add     x1, x1, 16
>         bhi     .L4
>
> can be improved to:
> .L4:
>         ldr     q0, [x3, x1]
>         add     w2, w2, 1
>         cmp     w0, w2
>         umin    v1.8h, v0.8h, v2.8h
>         str     q1, [x3, x1]
>         add     x1, x1, 16
>         bhi     .L4
>
> Bootstrap and test on x86_64 and AArch64 for whole patch set.  Is it OK?

Why restrict to GIMPLE?

+/* (convert1 (minmax ((convert2 (x) c)))) -> minmax (x c) if convert2
+   promotes x and convert1 demotes back to x's type.  */
+
+(for minmax (min max)
+ (simplify
+  (convert (minmax@0 (convert @1) INTEGER_CST@2))
+  (if (types_compatible_p (TREE_TYPE (@1), type))

comment mentions convert1 and convert2, just convert is correct I think.

Please use types_match instead of types_compatible_p, this is a
wrapper that does the correct thing for both GENERIC and GIMPLE.

+   (with
+    {
+      tree minmax_type = TREE_TYPE (@0);
+      signop sgn = TYPE_SIGN (type);
+      widest_int type_min = widest_int::from (wi::min_value (type), sgn);
+      widest_int type_max = widest_int::from (wi::max_value (type), sgn);
+    }
+    (if (sgn == TYPE_SIGN (minmax_type)
+        && TYPE_PRECISION (minmax_type) >= TYPE_PRECISION (type)
+        && wi::to_widest (@2) >= type_min && wi::to_widest (@2) <= type_max)

instead of this you can use int_fits_type_p (type, @2)

+     (minmax @1 { fold_convert (type, @2); }))))))

use

     (minmax @1 (convert @2))

I believe the transform is also a win if @2 is not a constant but similarly
promoted as @1.  This slightly complicates the patter and thus can
be done as followup (if we think it's useful at this point).

With the simplification you should get rid of the with{}

Thanks,
Richard.

> Thanks,
> bin
>
> 2016-10-21  Bin Cheng  <bin.ch...@arm.com>
>
>         * match.pd ((convert1 (minmax ((convert2 (x) c)))) -> minmax (x c)):
>         New pattern.
>
> gcc/testsuite/ChangeLog
> 2016-10-21  Bin Cheng  <bin.ch...@arm.com>
>
>         * gcc.dg/fold-convmaxconv-1.c: New test.
>         * gcc.dg/fold-convminconv-1.c: New test.

Reply via email to