[i386] recognize haddpd

Marc Glisse Sun, 02 Sep 2012 09:20:47 -0700

Hello,

this patch passes bootstrap+testsuite. It is probably wrong in many ways,but I don't know enough to do more without some advice.

The goal is to recognize that v[0]+v[1] can be computed with haddpd. Withthe patch, v[0]-v[1] becomes hsubpd and v[1]+v[0] becomes haddpd. Also,thanks to it, {v[0]-v[1], w[0]-w[1]} is now recognized as a single hsubpd.


1) Is a define_insn the right tool?

2) {v[0]-v[1], v[0]-v[1]} is not recognized as a hsubpd becausevec_duplicate doesn't match vec_concat. Do we really need to duplicate(no pun intended) the pattern?3) v[0]+v[1] is not recognized. Some pass changed their order, and nothingtries the reverse order. I can see 3 ways: canonicalize the orderat some point, let combine try both orders for commutative operators ormake the patterns more flexible (I don't know how many would need changing).4) I don't understand the set_attr part. I copied it from the haddpddefine_insn, and removed (set_attr "type" "sseadd") because it crashed thecompiler. isa and prefix make sense and they match the alternatives, but Iam not sure about "mode" (removing it still works IIRC).



2012-09-02  Marc Glisse  <[email protected]>

gcc/
        * config/i386/sse.md (*sse3_h<plusminus_insn>v2df3_low): New.

gcc/testsuite/
        * gcc.target/i386/pr54400.c: New testcase.

--
Marc Glisse

Index: testsuite/gcc.target/i386/pr54400.c
===================================================================
--- testsuite/gcc.target/i386/pr54400.c (revision 0)
+++ testsuite/gcc.target/i386/pr54400.c (revision 0)
@@ -0,0 +1,11 @@
+/* { dg-do compile } */
+/* { dg-options "-O2 -msse3 -mfpmath=sse" } */
+
+#include <x86intrin.h>
+
+double f (__m128d p)
+{
+  return p[0] - p[1];
+}
+
+/* { dg-final { scan-assembler "hsubpd" } } */

Property changes on: testsuite/gcc.target/i386/pr54400.c
___________________________________________________________________
Added: svn:keywords
   + Author Date Id Revision URL
Added: svn:eol-style
   + native

Index: config/i386/sse.md
===================================================================
--- config/i386/sse.md  (revision 190861)
+++ config/i386/sse.md  (working copy)
@@ -1231,20 +1231,37 @@
            (vec_select:DF (match_dup 2) (parallel [(const_int 1)])))))]
   "TARGET_SSE3"
   "@
    h<plusminus_mnemonic>pd\t{%2, %0|%0, %2}
    vh<plusminus_mnemonic>pd\t{%2, %1, %0|%0, %1, %2}"
   [(set_attr "isa" "noavx,avx")
    (set_attr "type" "sseadd")
    (set_attr "prefix" "orig,vex")
    (set_attr "mode" "V2DF")])
 
+(define_insn "*sse3_h<plusminus_insn>v2df3_low"
+  [(set (match_operand:DF 0 "register_operand" "=x,x")
+       (plusminus:DF
+         (vec_select:DF
+           (match_operand:V2DF 1 "register_operand" "0,x")
+           (parallel [(const_int 0)]))
+         (vec_select:DF
+           (match_dup 1)
+           (parallel [(const_int 1)]))))]
+  "TARGET_SSE3"
+  "@
+   h<plusminus_mnemonic>pd\t{%0, %0|%0, %0}
+   vh<plusminus_mnemonic>pd\t{%1, %1, %0|%0, %1, %1}"
+  [(set_attr "isa" "noavx,avx")
+   (set_attr "prefix" "orig,vex")
+   (set_attr "mode" "V2DF")])
+
 (define_insn "avx_h<plusminus_insn>v8sf3"
   [(set (match_operand:V8SF 0 "register_operand" "=x")
        (vec_concat:V8SF
          (vec_concat:V4SF
            (vec_concat:V2SF
              (plusminus:SF
                (vec_select:SF
                  (match_operand:V8SF 1 "register_operand" "x")
                  (parallel [(const_int 0)]))
                (vec_select:SF (match_dup 1) (parallel [(const_int 1)])))

[i386] recognize haddpd

Reply via email to