Attached patch corrects RMW operation with LEA peephole pattern. The
mode of the LEA is either SImode (for QImode, HImode or SImode
operation) or DImode.

2019-03-28  Uroš Bizjak  <ubiz...@gmail.com>

    PR target/89865
    * config/i386/i386.md (RMW operation with LEA peephole):
    Use LEAMODE mode attribute instead of SWI mode iterator for
    LEA pattern.

The patch triggers

FAIL: gcc.target/i386/pr49095.c scan-assembler-times \\\\), % 45

testsuite failure also for x86_64-linux-gnu. The adjusted number of
found patterns was wrong from the beginning and hid the uncovered
problem with LEA operation.

Bootstrapped and regression tested on x86_64-linux-gnu {,-m32}.

Committed to mainline SVN.

Uros.
Index: config/i386/i386.md
===================================================================
--- config/i386/i386.md (revision 269995)
+++ config/i386/i386.md (working copy)
@@ -18684,14 +18684,16 @@
 (define_peephole2
   [(set (match_operand:SWI 0 "register_operand")
        (match_operand:SWI 1 "memory_operand"))
-   (set (match_operand:SWI 3 "register_operand")
-       (plus:SWI (match_dup 0)
-                 (match_operand:SWI 2 "<nonmemory_operand>")))
-   (set (match_dup 1) (match_dup 3))
-   (set (reg FLAGS_REG) (compare (match_dup 3) (const_int 0)))]
+   (set (match_operand:<LEAMODE> 3 "register_operand")
+       (plus:<LEAMODE> (match_operand:<LEAMODE> 4 "register_operand")
+                       (match_operand:<LEAMODE> 2 "<nonmemory_operand>")))
+   (set (match_dup 1) (match_operand:SWI 5 "register_operand"))
+   (set (reg FLAGS_REG) (compare (match_dup 5) (const_int 0)))]
   "(TARGET_READ_MODIFY_WRITE || optimize_insn_for_size_p ())
+   && REGNO (operands[4]) == REGNO (operands[0])
+   && REGNO (operands[5]) == REGNO (operands[3])
    && peep2_reg_dead_p (4, operands[3])
-   && (rtx_equal_p (operands[0], operands[3])
+   && ((REGNO (operands[0]) == REGNO (operands[3]))
        || peep2_reg_dead_p (2, operands[0]))
    && !reg_overlap_mentioned_p (operands[0], operands[1])
    && !reg_overlap_mentioned_p (operands[3], operands[1])
@@ -18700,17 +18702,17 @@
        || immediate_operand (operands[2], QImode)
        || any_QIreg_operand (operands[2], QImode))
    && ix86_match_ccmode (peep2_next_insn (3), CCGOCmode)"
-  [(parallel [(set (match_dup 4) (match_dup 6))
-             (set (match_dup 1) (match_dup 5))])]
+  [(parallel [(set (match_dup 6) (match_dup 8))
+             (set (match_dup 1) (match_dup 7))])]
 {
-  operands[4] = SET_DEST (PATTERN (peep2_next_insn (3)));
-  operands[5]
+  operands[6] = SET_DEST (PATTERN (peep2_next_insn (3)));
+  operands[7]
     = gen_rtx_PLUS (<MODE>mode,
                    copy_rtx (operands[1]),
-                   operands[2]);
-  operands[6]
-    = gen_rtx_COMPARE (GET_MODE (operands[4]),
-                      copy_rtx (operands[5]),
+                   gen_lowpart (<MODE>mode, operands[2]));
+  operands[8]
+    = gen_rtx_COMPARE (GET_MODE (operands[6]),
+                      copy_rtx (operands[7]),
                       const0_rtx);
 })
 

Reply via email to