[Bug target/117718] Inefficient address computation for d-form vector loads

2024-11-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117718 --- Comment #3 from Michael Meissner --- No, the issue is with DQ addressing (i.e. vector load/store with offset), we can't guarantee that the external address will be properly aligned with the bottom 4 bits must be set to 0. In theory, we have

[Bug target/117729] New: On power10 consider using vector pair load/store in prologue/epilog in saving vector registers

2024-11-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
Severity: enhancement Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- GCC should consider using the load vector pair and store vector pair instructions in the prologue

[Bug target/117721] New: Big endian test suite failures comparing default cpu and --with-cpu=power7

2024-11-20 Thread meissner at gcc dot gnu.org via Gcc-bugs
: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- I build a GCC trunk on the gcc110 cfarm system. I got the following failures when I built GCC without using --with-cpu

[Bug target/79137] Improve powerpc vector permutes

2024-11-17 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=79137 Michael Meissner changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/117487] New: Power8 optimizations for math library aren't done in power9 or power10 (PR target/71977)

2024-11-07 Thread meissner at gcc dot gnu.org via Gcc-bugs
IRMED Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- I was answering an email about something else, and I wanted to look up code that I added in Januar

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-22 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 --- Comment #11 from Michael Meissner --- For singlebuff.c, there is a clear improvement when using the XXEVAL instruction: XXEVAL TRUNK GCC14 GCC13 GCC12 GCC11 -- - - - - - -O3: 4.46 5.40

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-22 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 --- Comment #10 from Michael Meissner --- There is an instruction that was added in power10 (XXEVAL) that does provide fusion between VSX vectors that includes ANDC->XOR and XOR->XOR fusion. I have coded up patches to support this and I will be

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-22 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 --- Comment #9 from Michael Meissner --- I tried several of the options to change the code generation: -mno-power10-fusion which disables doing the fusion pairing. Combinations of -fno-schedule-insns and -fno-schedule-insns2. -fno-sched-press

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-22 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 --- Comment #8 from Michael Meissner --- I added an option to not do the combiner patterns until after reload, and it does not seem to fire at all.

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 --- Comment #5 from Michael Meissner --- For the singlebuff.c benchmark, the numbers are: Trunk (sources checked out October 5th):5.40 seconds GCC 14 (sources checked out October 21st): 5.40 seconds GCC 13 (sources checked out October 21

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 --- Comment #6 from Michael Meissner --- Note, in the first comment, I mis-read the instruction, and the instruction being used is vector unsigned long long rotate left, and not vector unsigned long long shift left. I.e.:

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 --- Comment #4 from Michael Meissner --- I tracked down the commit that first made the slowdown visible: commit 3a61ca1b9256535e1bfb19b2d46cde21f3908a5d (HEAD) Author: Jan Hubicka Date: Thu Jul 6 18:56:22 2023 +0200 Improve profile upda

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 Michael Meissner changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |meissner at gcc dot gnu.org

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 --- Comment #2 from Michael Meissner --- Created attachment 59406 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=59406&action=edit Singlebuff.c test The singlebuff.c is a simpler test case than multibuff.c. However, the numbers quoted an

[Bug target/117251] SHA3 code for PowerPC has a major slow down

2024-10-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=117251 Michael Meissner changed: What|Removed |Added Priority|P3 |P2 Version|15.0

[Bug target/117251] New: SHA3 code for PowerPC has a major slow down

2024-10-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- Created attachment 59405 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=59405&action=edit Multibuff.c test The sha3 functions compiled for the powerpc has a s

[Bug target/89213] Optimize V2DI shifts by a constant on power8 & above systems.

2024-10-07 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89213 Michael Meissner changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug libstdc++/114742] invalid use of '__ieee128' in and

2024-09-19 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114742 --- Comment #4 from Michael Meissner --- The minimum architecture for IEEE 128-bit support is power7, because it needs the VSX registers to pass and return IEEE 128-bit values. Now, in theory, IEEE 128-bit support could have required only Altiv

[Bug target/107757] PPCLE: Inefficient vector constant creation

2024-08-16 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107757 --- Comment #4 from Michael Meissner --- Note, this code only shows up when the target CPU is power8. For the following code: vector long long lsb64() { return vec_splats(1LL); } Both power9 and power10 generate: xxspltib 34,1

[Bug target/89213] Optimize V2DI shifts by a constant on power8 & above systems.

2024-08-12 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89213 Michael Meissner changed: What|Removed |Added Attachment #58918|0 |1 is obsolete|

[Bug target/89213] Optimize V2DI shifts by a constant on power8 & above systems.

2024-08-12 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=89213 Michael Meissner changed: What|Removed |Added Attachment #45612|0 |1 is obsolete|

[Bug target/107757] PPCLE: Inefficient vector constant creation

2024-08-12 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107757 --- Comment #3 from Michael Meissner --- As Segher says, the test is not quite correct. I would write it as: vector long long lsb64_opt() { vector long long a = vec_splats(~0LL); __asm__("vsrd %0,%1,%2":"=v"(a):"v"(a),"v"(a)); return

[Bug target/115800] PowerPC GCC cannot build a little endian compile if --with-cpu=power5 is used

2024-07-05 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115800 --- Comment #6 from Michael Meissner --- Of course it would also apply if you are building a BE compiler that has little endian multilibs, you would run into the same situation.

[Bug target/115800] PowerPC GCC cannot build a little endian compile if --with-cpu=power5 is used

2024-07-05 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115800 --- Comment #5 from Michael Meissner --- And libstdc++-v3 errors are similar: mkdir -p ./powerpc64le-unknown-linux-gnu/bits/stdc++.h.gch /home/meissner/fsf-build-ppc64le/work171-p5/./gcc/xgcc -shared-libgcc -B/home/meissner/fsf-build-ppc64le/wo

[Bug target/115800] PowerPC GCC cannot build a little endian compile if --with-cpu=power5 is used

2024-07-05 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=115800 --- Comment #4 from Michael Meissner --- Libgfortran gives various errors that _Float128 is not supported on this target. libtool: compile: /home/meissner/fsf-build-ppc64le/work171-p5/./gcc/xgcc -B/home/meissner/fsf-build-ppc64le/work171-p5/./

[Bug target/115800] New: PowerPC GCC cannot build a little endian compile if --with-cpu=power5 is used

2024-07-05 Thread meissner at gcc dot gnu.org via Gcc-bugs
Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- The libgfortran and libstdc++-v3 libraries cannot be built if you build a little endian compiler and set the default

[Bug target/113652] [14 regression] Failed bootstrap on ppc unrecognized opcode: `lfiwzx' with -mcpu=7450

2024-04-12 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113652 --- Comment #23 from Michael Meissner --- This is one of those things where there is no right answer in part because we need other things to flesh out the support. The reason -mvsx was used is we need the VSX registers to build the IEEE 128-bit

[Bug target/94630] General bug for changes needed to switch the powerpc64le-linux long double default

2024-04-11 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=94630 Michael Meissner changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/101019] GCC should consider using PLI/SLDI/PADDI to load up 64-bit constants on power10

2024-04-11 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101019 Michael Meissner changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/99708] __SIZEOF_FLOAT128__ not defined on powerpc64le-linux

2024-04-10 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99708 Michael Meissner changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug libstdc++/104772] std::numeric_limits<__float128> should be specialized

2024-04-10 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104772 Bug 104772 depends on bug 99708, which changed state. Bug 99708 Summary: __SIZEOF_FLOAT128__ not defined on powerpc64le-linux https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99708 What|Removed |Added -

[Bug target/110960] TestSatWidenMulPairwiseAdd in the Google Highway test suite fails when compiled with GCC 12 or later with the -mcpu=power9 option

2024-03-29 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=110960 Michael Meissner changed: What|Removed |Added CC||meissner at gcc dot gnu.org

[Bug target/113652] [14 regression] Failed bootstrap on ppc unrecognized opcode: `lfiwzx' with -mcpu=7450

2024-03-28 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113652 --- Comment #19 from Michael Meissner --- When I wrote the VSX support many years ago, I intended that -mvsx enable all of ISA 2.06, which includes ISA 2.05, etc. My intentions were there 2 options for power7, one is the base ISA 2.07 support f

[Bug target/70928] Load simple float constants via VSX operations on PowerPC

2024-03-27 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70928 Michael Meissner changed: What|Removed |Added Ever confirmed|0 |1 Last reconfirmed|

[Bug bootstrap/31418] Bootstrap failure with -O2 -funroll-loops -funsafe-math-optimizations options on PPC

2024-03-27 Thread meissner at gcc dot gnu.org via Gcc-bugs
|RESOLVED CC||meissner at gcc dot gnu.org --- Comment #2 from Michael Meissner --- I built the current GCC 14 development compiler using -O2 -funroll-loops -funsafe-math-optimizations, and it built fine. I suspect it had been fixed

[Bug target/112886] New: We need a new print_operand output modifier for vector double

2023-12-06 Thread meissner at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- I've been working with vector double support to provide faster memory latency for specialized applications. While the work

[Bug target/104698] Inefficient code for DI to TI sign extend on power10

2023-10-13 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104698 Michael Meissner changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/111778] PowerPC constant code change uses an undefined shift

2023-10-11 Thread meissner at gcc dot gnu.org via Gcc-bugs
, ||meissner at gcc dot gnu.org, ||segher at gcc dot gnu.org Build||powerpc64le-unknown-linux-g ||nu Target||powerpc64le-unknown

[Bug target/111778] New: PowerPC constant code change uses an undefined shift

2023-10-11 Thread meissner at gcc dot gnu.org via Gcc-bugs
Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- I was building a cross compiler to PowerPC on my x86_86 workstation with the latest version of GCC on October 11th. I could not build the compiler on the x86_64

[Bug target/105325] power10: Error: operand out of range

2023-07-05 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105325 Michael Meissner changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/103498] Spec 2017 imagick_r is 2.62% slower on Power10 with pc-relative addressing compared to not using pc-relative addressing

2023-06-01 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103498 Michael Meissner changed: What|Removed |Added Resolution|--- |FIXED Status|UNCONFIRMED

[Bug target/109067] Powerpc GCC does not support __ibm128 complex multiply/divide if long double is IEEE 128-bit.

2023-04-11 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=109067 Michael Meissner changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/70243] PowerPC V4SFmode should not use Altivec instructions on VSX systems

2023-04-05 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70243 --- Comment #5 from Michael Meissner --- Created attachment 54814 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=54814&action=edit Test case This is test case that shows the generation of fmaddfp and fnmsubfp.

[Bug target/105325] power10: Error: operand out of range

2023-03-20 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=105325 Michael Meissner changed: What|Removed |Added Assignee|acsawdey at gcc dot gnu.org|meissner at gcc dot gnu.org

[Bug target/109067] New: Powerpc GCC does not support __ibm128 complex multiply/divide if long double is IEEE 128-bit.

2023-03-08 Thread meissner at gcc dot gnu.org via Gcc-bugs
Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: ---

[Bug target/108958] New: Powerpcle could generate mtvsrdd for zero extend DI to TI mode, when the TImode is in a vector register

2023-02-27 Thread meissner at gcc dot gnu.org via Gcc-bugs
: UNCONFIRMED Severity: enhancement Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- If you have a DImode variable (i.e. long) in a GPR, and you want to zero extend it to

[Bug middle-end/108623] We need to grow the precision field in tree_type_common for PowerPC

2023-02-01 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108623 --- Comment #7 from Michael Meissner --- Created attachment 54387 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=54387&action=edit Proposed patch combining Richard's patch and an assertion.

[Bug middle-end/108623] We need to grow the precision field in tree_type_common for PowerPC

2023-02-01 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108623 Michael Meissner changed: What|Removed |Added Last reconfirmed||2023-02-01 Ever confirmed|0

[Bug middle-end/108623] We need to grow the precision field in tree_type_common for PowerPC

2023-02-01 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108623 --- Comment #4 from Michael Meissner --- I must have missed the spare bits. I think it is better to use the full 16 bits for precision. I also think your other changes to realign bit fields greater than 1 bit.

[Bug other/108623] New: We need to grow the precision field in tree_type_common for PowerPC

2023-02-01 Thread meissner at gcc dot gnu.org via Gcc-bugs
Priority: P3 Component: other Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- The current patches that have been submitted to the PowerPC back end need to grow the precision field in the tree_type_common

[Bug target/93738] [10/11/12/13 regression] test case gcc.target/powerpc/20050603-3.c fails

2022-11-30 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=93738 Michael Meissner changed: What|Removed |Added CC||meissner at gcc dot gnu.org

[Bug testsuite/106345] Some ppc64le tests fail with -mcpu=power9 -mtune=power9

2022-08-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=106345 Michael Meissner changed: What|Removed |Added CC||meissner at gcc dot gnu.org

[Bug target/106682] New: Powerpc test gcc.target/powerpc/pr86731-fwrapv-longlong.c fails on power8, passes on power9/power10

2022-08-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
Status: UNCONFIRMED Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- I was doing builds on a power10 for patch submission, and I noticed the

[Bug testsuite/106681] New: Powerpc test gcc.dg/pr104992.c fails on power10

2022-08-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
Component: testsuite Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- I was doing builds on a power10 system for patch submission, and I noticed the following test fails when the test is compiled for power10, but it does not fail

[Bug testsuite/106680] New: Test gcc.target/powerpc/bswap64-4.c fails on 32-bit BE

2022-08-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
Component: testsuite Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- I was doing some builds for submitting patches, and I did runs on BE systems as well as LE systems. I noticed the test gcc.target/powerpc/bswap64-4.c fails

[Bug testsuite/101169] [10 regression] test case gcc.target/powerpc/fold-vec-extract-char.p7.c fails after r10-9880

2022-08-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
||meissner at gcc dot gnu.org Status|UNCONFIRMED |NEW Last reconfirmed||2022-08-18 --- Comment #3 from Michael Meissner --- The fold-vec-extract tests work fine on the development version of GCC 13 for 64-bit, but they are

[Bug fortran/96983] [11/12 regression] ICE compiling gfortran.dg/pr96711.f90 starting with r11-3042

2022-03-17 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=96983 Michael Meissner changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/104868] [12 Regression] powerpc: Compiling libgfortran with -flto failing with GCC 12

2022-03-11 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104868 --- Comment #8 from Michael Meissner --- Matheus, try the patch I just attached to the PR that I posted to the gcc-patches mailing list.

[Bug target/104868] [12 Regression] powerpc: Compiling libgfortran with -flto failing with GCC 12

2022-03-11 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104868 Michael Meissner changed: What|Removed |Added CC||meissner at gcc dot gnu.org

[Bug lto/104868] powerpc: Compiling libgfortran with -flto failing with GCC 12

2022-03-10 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104868 --- Comment #4 from Michael Meissner --- In looking at it, the reason is the convert from DImode to TImode has several constraints. The constraint that matters in this case has the output being an Altivec register, while the input is a GPR regi

[Bug target/104253] libgcc missing __floatdiif

2022-03-05 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104253 Michael Meissner changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/104698] Inefficient code for DI to TI sign extend on power10

2022-02-28 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104698 --- Comment #3 from Michael Meissner --- It goes beyond 'just use RTL'. The problem is the code only generates an altivec instruction. So if the __int128_t value is in a GPR, the compiler will need to do a move to the vector registers (1 insn)

[Bug target/104698] New: Inefficient code for DI to TI sign extend on power10

2022-02-25 Thread meissner at gcc dot gnu.org via Gcc-bugs
Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- On power10, signed conversion from DImode to TImode is inefficient for GCC 11 and the current GCC 12. GCC 10 does not do this optimization. On power10, GCC tries

[Bug target/104335] [12 regression] build failure if go is included in languages after r12-6747

2022-02-23 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104335 Michael Meissner changed: What|Removed |Added CC||asolokha at gmx dot com --- Comment

[Bug target/104256] ICE in validate_condition_mode, at config/rs6000/rs6000.cc:11354

2022-02-23 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104256 Michael Meissner changed: What|Removed |Added Resolution|--- |DUPLICATE Status|ASSIGNE

[Bug target/104256] ICE in validate_condition_mode, at config/rs6000/rs6000.cc:11354

2022-02-17 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104256 --- Comment #1 from Michael Meissner --- Created attachment 52463 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52463&action=edit Proposed patch

[Bug target/104256] ICE in validate_condition_mode, at config/rs6000/rs6000.cc:11354

2022-02-17 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104256 Michael Meissner changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |meissner at gcc dot gnu.org

[Bug target/99197] Built-ins for packing/unpacking __ibm128 not documented

2022-02-14 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99197 Michael Meissner changed: What|Removed |Added CC||meissner at gcc dot gnu.org

[Bug ipa/102059] Incorrect always_inline diagnostic in LTO mode with #pragma GCC target("cpu=power10")

2022-02-08 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102059 --- Comment #31 from Michael Meissner --- Created attachment 52383 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52383&action=edit Simpler patch to fix the problem with power8-fusion. This patch just ignores the -mpower8-fusion option in

[Bug target/104253] libgcc missing __floatdiif

2022-01-31 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104253 --- Comment #11 from Michael Meissner --- The patch has been posted, I'm awaiting approval. https://gcc.gnu.org/pipermail/gcc-patches/2022-January/589469.html BTW, the copy_to_mode_reg bug I mentioned earlier goes away with the patch.

[Bug target/104253] libgcc missing __floatdiif

2022-01-28 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104253 Michael Meissner changed: What|Removed |Added Attachment #52306|0 |1 is obsolete|

[Bug target/104253] libgcc missing __floatdiif

2022-01-28 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104253 --- Comment #8 from Michael Meissner --- Yes, you are right. I didn't remember which functions were generated by the compiler, but I just did all of the conversion functions.

[Bug target/104124] Poor optimization for vector splat DW with small consts

2022-01-27 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104124 --- Comment #3 from Michael Meissner --- There are two things going on. 1) There is no vspltisd instruction, so we can't generate a single instruction to load constants other than 0 or -1. Unfortunately, this was not added in either power9 or

[Bug libgcc/104253] libgcc missing __floatdiif

2022-01-27 Thread meissner at gcc dot gnu.org via Gcc-bugs
|unassigned at gcc dot gnu.org |meissner at gcc dot gnu.org --- Comment #5 from Michael Meissner --- The other issue that I mentioned in note #2 is likely a different issue when -mabi=ibmlongdouble is used. I didn't have the patch to automatically use IEEE 128-bit if the compiler used to

[Bug libgcc/104253] libgcc missing __floatdiif

2022-01-27 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104253 --- Comment #4 from Michael Meissner --- Created attachment 52306 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52306&action=edit Patch to use the correct names for __ibm128 converts if long double is IEEE 128-bit The problem was interna

[Bug libgcc/104253] libgcc missing __floatdiif

2022-01-26 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104253 Michael Meissner changed: What|Removed |Added Last reconfirmed||2022-01-26 Status|UNCONF

[Bug testsuite/103763] [12 regression] gcc.target/powerpc/fold-vec-splat-floatdouble.c fails after r12-5988

2022-01-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103763 Michael Meissner changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/104136] Gcc cannot compile wrf_r for power10 using -Ofast

2022-01-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104136 Michael Meissner changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/104136] Gcc cannot compile wrf_r for power10 using -Ofast

2022-01-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104136 --- Comment #5 from Michael Meissner --- Fixed in commit f9063d12633c62a089115df032a19295854d8b06 on January 21, 2022.

[Bug target/104136] Gcc cannot compile wrf_r for power10 using -Ofast

2022-01-21 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104136 Michael Meissner changed: What|Removed |Added Attachment #52246|0 |1 is obsolete|

[Bug target/104136] Gcc cannot compile wrf_r for power10 using -Ofast

2022-01-20 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104136 Michael Meissner changed: What|Removed |Added Attachment #52244|0 |1 is obsolete|

[Bug target/104136] Gcc cannot compile wrf_r for power10 using -Ofast

2022-01-20 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104136 --- Comment #1 from Michael Meissner --- Created attachment 52244 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52244&action=edit Patch to mark XXSPLTIW and XXSPLTIDP as possibly being prefixed If you compile module_advect_em.F90 with -O

[Bug target/104136] Gcc cannot compile wrf_r for power10 using -Ofast

2022-01-19 Thread meissner at gcc dot gnu.org via Gcc-bugs
|critical Host||powerpc64le-unknown-linux-g ||nu Assignee|unassigned at gcc dot gnu.org |meissner at gcc dot gnu.org Target||powerpc64le

[Bug target/104136] New: Gcc cannot compile wrf_r for power10 using -Ofast

2022-01-19 Thread meissner at gcc dot gnu.org via Gcc-bugs
Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- Using the current trunk compiler (from January 18th, 2022), I cannot compile the module_advect_em fortran module with either -Ofast or -O3 using my normal spec build

[Bug testsuite/102935] [12 regression] new test case gcc.target/powerpc/pr101384-1.c fails

2022-01-12 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102935 Michael Meissner changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug testsuite/102935] [12 regression] new test case gcc.target/powerpc/pr101384-1.c fails

2022-01-07 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102935 Michael Meissner changed: What|Removed |Added Attachment #52143|0 |1 is obsolete|

[Bug testsuite/102935] [12 regression] new test case gcc.target/powerpc/pr101384-1.c fails

2022-01-07 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102935 --- Comment #2 from Michael Meissner --- Created attachment 52143 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52143&action=edit Patch to update code generation test The test wants to load all 1's into a vector register. On power8 it u

[Bug testsuite/102935] [12 regression] new test case gcc.target/powerpc/pr101384-1.c fails

2022-01-07 Thread meissner at gcc dot gnu.org via Gcc-bugs
||2022-01-07 CC||dje at gcc dot gnu.org, ||meissner at gcc dot gnu.org, ||segher at gcc dot gnu.org Assignee|unassigned at gcc dot

[Bug testsuite/103763] [12 regression] gcc.target/powerpc/fold-vec-splat-floatdouble.c fails after r12-5988

2022-01-07 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103763 --- Comment #1 from Michael Meissner --- Created attachment 52141 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=52141&action=edit Patch to fix the insn count Update the insn regex for power10.

[Bug testsuite/103763] [12 regression] gcc.target/powerpc/fold-vec-splat-floatdouble.c fails after r12-5988

2022-01-07 Thread meissner at gcc dot gnu.org via Gcc-bugs
||2022-01-07 Status|UNCONFIRMED |ASSIGNED Assignee|unassigned at gcc dot gnu.org |meissner at gcc dot gnu.org

[Bug target/103498] New: Spec 2017 imagick_r is 2.62% slower on Power10 with pc-relative addressing compared to not using pc-relative addressing

2021-11-30 Thread meissner at gcc dot gnu.org via Gcc-bugs
Version: 12.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: target Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- I was doing some Spec 2017 rate runs on a single power10

[Bug target/99921] PowerPC xxeval has the wrong predicates

2021-11-30 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=99921 Michael Meissner changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2021-11-30 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 103320, which changed state. Bug 103320 Summary: 12 Regression] Spec 2017 benchmark roms_r fails on PowerPC for -Ofast https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103320 What|Removed |

[Bug target/103320] 12 Regression] Spec 2017 benchmark roms_r fails on PowerPC for -Ofast

2021-11-30 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103320 Michael Meissner changed: What|Removed |Added Resolution|--- |WONTFIX Status|UNCONFIRM

[Bug libstdc++/103387] powerpc64le: segmentation fault on std::cout with ieee128 long double variable

2021-11-23 Thread meissner at gcc dot gnu.org via Gcc-bugs
||2021-11-23 Priority|P3 |P1 CC||meissner at gcc dot gnu.org Ever confirmed|0 |1 Status|UNCONFIRMED |NEW --- Comment #1 from Michael Meissner

[Bug tree-optimization/103317] Spec 2017 benchmark blender_r fails with -Ofast on PowerPc (power9, power10)

2021-11-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103317 Michael Meissner changed: What|Removed |Added Priority|P2 |P1

[Bug regression/103318] Spec 2017 benchmark perlbench_r fails on PowerPC for -Ofast and -O3, passes with -O2

2021-11-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103318 Michael Meissner changed: What|Removed |Added Priority|P2 |P1

[Bug regression/103320] Spec 2017 benchmark roms_r fails on PowerPC for -Ofast

2021-11-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103320 Michael Meissner changed: What|Removed |Added Priority|P2 |P1

[Bug regression/103320] Spec 2017 benchmark roms_r fails on PowerPC for -Ofast

2021-11-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
, ||dje at gcc dot gnu.org, ||meissner at gcc dot gnu.org, ||segher at gcc dot gnu.org, ||wschmidt at gcc dot gnu.org

[Bug regression/103320] New: Spec 2017 benchmark roms_r fails on PowerPC for -Ofast

2021-11-18 Thread meissner at gcc dot gnu.org via Gcc-bugs
Component: regression Assignee: unassigned at gcc dot gnu.org Reporter: meissner at gcc dot gnu.org Target Milestone: --- The Spec 2017 benchmark roms_r compiles fine but produces the wrong output when compiled with -Ofast options on both power9 and power10. In going back with

  1   2   3   4   5   6   7   8   9   10   >