[PATCH] Fix SFmode subreg of DImode and TImode

This patch fixes the breakage in the PowerPC due to a recent change in SUBREG
behavior.  While it is arguable that the patch that caused the breakage should
be reverted, this patch should be a bandage to prevent these changes from
happening again.

I first noticed it in building the Spec 2017 wrf_r and blender_r
benchmarks.  Once I applied this patch, I also noticed several of the
tests now pass.

The core of the problem is we need to treat SUBREG's of SFmode and SImode
specially on the PowerPC.  This is due to the fact that SFmode values that are
in the vector and floating point registers are represented as DFmode.  When we
want to do a direct move between the GPR registers and the vector registers, we
have to convert the value from the DFmode representation to/from the SFmode
representation.

By doing this special processing instead of doing the transfer via store and
load, we were able to speed up the math library which at times want to use the
SFmode values in a union, and do logical operations on it (to test exponent
ranges, etc.) and then move it over to use as a floating point value.

I did a bootstrap build on a little endian power9 system with and without the
patch applied.  There was no regression in the tests.  I'm doing a build on a
big endian power8 system, but it hasn't finished yet as I sent this email.  I
will check on the big endian progress tomorrow morning.

The following tests now pass once again with the test.

        C tests:
        ========
        gcc.c-torture/compile/20071102-1.c
        gcc.c-torture/compile/pr55921.c
        gcc.c-torture/compile/pr85945.c
        gcc.c-torture/execute/complex-3.c
        gcc.dg/atomic/c11-atomic-exec-1.c
        gcc.dg/atomic/c11-atomic-exec-2.c
        gcc.dg/atomic/c11-atomic-exec-4.c
        gcc.dg/atomic/c11-atomic-exec-5.c
        gcc.dg/c11-atomic-2.c
        gcc.dg/pr42475.c
        gcc.dg/pr47201.c
        gcc.dg/pr48335-1.c
        gcc.dg/torture/pr67741.c
        gcc.dg/tree-ssa/ssa-dom-thread-10.c
        gcc.dg/tsan/pr88030.c
        gcc.dg/ubsan/float-cast-overflow-atomic.c
        gcc.dg/vect/no-tree-sra-bb-slp-pr50730.c

        C++ tests:
        ==========
        g++.dg/opt/alias1.C
        g++.dg/template/koenig6.C
        g++.dg/torture/pr40924.C
        tmpdir-g++.dg-struct-layout-1/t001

        Fortran tests:
        ==============
        gfortran.dg/array_constructor_type_22.f03
        gfortran.dg/array_function_6.f90
        gfortran.dg/derived_comp_array_ref_7.f90
        gfortran.dg/elemental_scalar_args_1.f90
        gfortran.dg/elemental_subroutine_1.f90
        gfortran.dg/inline_matmul_5.f90
        gfortran.dg/inline_matmul_8.f90
        gfortran.dg/inline_matmul_9.f90
        gfortran.dg/matmul_bounds_6.f90
        gfortran.dg/operator_1.f90
        gfortran.dg/past_eor.f90
        gfortran.dg/pr101121.f
        gfortran.dg/pr91552.f90
        gfortran.dg/spread_shape_1.f90
        gfortran.dg/typebound_operator_3.f03
        gfortran.dg/value_1.f90
        gfortran.fortran-torture/execute/entry_4.f90
        gfortran.fortran-torture/execute/intrinsic_dotprod.f90
        gfortran.fortran-torture/execute/intrinsic_matmul.f90

Can I check this fix into the master branch?

2021-09-06  Michael Meissner  <meiss...@linux.ibm.com>

gcc/

        * config/rs6000/rs6000.c (rs6000_emit_move_si_sf_subreg): Deal
        with SUBREGs of TImode and DImode.
---
 gcc/config/rs6000/rs6000.c | 10 ++++++++++
 1 file changed, 10 insertions(+)

diff --git a/gcc/config/rs6000/rs6000.c b/gcc/config/rs6000/rs6000.c
index b9ebd56c993..7bbf29a3e1c 100644
--- a/gcc/config/rs6000/rs6000.c
+++ b/gcc/config/rs6000/rs6000.c
@@ -10942,6 +10942,16 @@ rs6000_emit_move_si_sf_subreg (rtx dest, rtx source, 
machine_mode mode)
          return true;
        }
 
+      /* In case we are given a SUBREG for a larger type, reduce it to
+        SImode.  */
+      if (mode == SFmode && GET_MODE_SIZE (inner_mode) > 4)
+       {
+         rtx tmp = gen_reg_rtx (SImode);
+         emit_move_insn (tmp, gen_lowpart (SImode, source));
+         emit_insn (gen_movsf_from_si (dest, tmp));
+         return true;
+       }
+
       if (mode == SFmode && inner_mode == SImode)
        {
          emit_insn (gen_movsf_from_si (dest, inner_source));
-- 
2.31.1


-- 
Michael Meissner, IBM
IBM, M/S 2506R, 550 King Street, Littleton, MA 01460-6245, USA
email: meiss...@linux.ibm.com, phone: +1 (978) 899-4797

Reply via email to