[to-be-committed][V2][RISC-V] Avoid unnecessary extensions after sCC insns

Jeff Law Thu, 05 Sep 2024 11:04:09 -0700

So the first patch failed the pre-commit CI; it didn't fail in mytesting because I'm using --with-arch to set a default configurationthat includes things like zicond to ensure that's always tested. Andthe failing test is skipped when zicond is enabled by default.

The failing test is designed to ensure that we don't miss anif-conversion due to costing issues around the extension that wastypically done in an sCC sequence (which is why it's only run whenzicond is off).

The test failed because we have a little routine that is highlydependent on the code generated by the sCC expander and will adjust thecosting to account for expansion quirks that usually go away in registerallocation.

That code needs to be enhanced to work after the sCC expansion change.Essentially it needs to account for the subreg extraction that shows upin the sequence as well as being a bit looser on mode checking.

I kept the code working for the old sequences -- in theory a user couldconjure up the old sequence so handling them seems useful.

This also drops the testsuite changes. Palmer's change makes themunnecessary.


Waiting on pre-commit CI before taking any further action...

Jeff

diff --git a/gcc/config/riscv/riscv.cc b/gcc/config/riscv/riscv.cc
index a38cb72f09f..39489c4377e 100644
--- a/gcc/config/riscv/riscv.cc
+++ b/gcc/config/riscv/riscv.cc
@@ -4218,11 +4218,29 @@ riscv_noce_conversion_profitable_p (rtx_insn *seq,
              riscv_if_info.original_cost += COSTS_N_INSNS (1);
              riscv_if_info.max_seq_cost += COSTS_N_INSNS (1);
            }
-         last_dest = NULL_RTX;
+
          rtx dest = SET_DEST (x);
-         if (COMPARISON_P (src)
+
+         /* Do something similar for the  moves that are likely to
+            turn into NOP moves by the time the register allocator is
+            done.  These are also side effects of how our sCC expanders
+            work.  We'll want to check and update LAST_DEST here too.  */
+         if (last_dest
              && REG_P (dest)
-             && GET_MODE (dest) == SImode)
+             && GET_MODE (dest) == SImode
+             && SUBREG_P (src)
+             && SUBREG_PROMOTED_VAR_P (src)
+             && REGNO (SUBREG_REG (src)) == REGNO (last_dest))
+           {
+             riscv_if_info.original_cost += COSTS_N_INSNS (1);
+             riscv_if_info.max_seq_cost += COSTS_N_INSNS (1);
+             if (last_dest)
+               last_dest = dest;
+           }
+         else
+           last_dest = NULL_RTX;
+
+         if (COMPARISON_P (src) && REG_P (dest))
            last_dest = dest;
        }
       else
@@ -4904,13 +4922,31 @@ riscv_expand_int_scc (rtx target, enum rtx_code code, 
rtx op0, rtx op1, bool *in
   riscv_extend_comparands (code, &op0, &op1);
   op0 = force_reg (word_mode, op0);
 
+  /* For sub-word targets on rv64, do the computation in DImode
+     then extract the lowpart for the final target, marking it
+     as sign extended.  Note that it's also properly zero extended,
+     but it's probably more profitable to expose it as sign extended.  */
+  rtx t;
+  if (TARGET_64BIT && GET_MODE (target) == SImode)
+    t = gen_reg_rtx (DImode);
+  else
+    t = target;
+
   if (code == EQ || code == NE)
     {
       rtx zie = riscv_zero_if_equal (op0, op1);
-      riscv_emit_binary (code, target, zie, const0_rtx);
+      riscv_emit_binary (code, t, zie, const0_rtx);
     }
   else
-    riscv_emit_int_order_test (code, invert_ptr, target, op0, op1);
+    riscv_emit_int_order_test (code, invert_ptr, t, op0, op1);
+
+  if (t != target)
+    {
+      t = gen_lowpart (SImode, t);
+      SUBREG_PROMOTED_VAR_P (t) = 1;
+      SUBREG_PROMOTED_SET (t, SRP_SIGNED);
+      emit_move_insn (target, t);
+    }
 }
 
 /* Like riscv_expand_int_scc, but for floating-point comparisons.  */

[to-be-committed][V2][RISC-V] Avoid unnecessary extensions after sCC insns

Reply via email to