[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-08-01 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #43 from Filip Kastl --- (In reply to H.J. Lu from comment #42) > Created attachment 62020 [details] > A new patch > > Here is a patch not to limit non all 0s/1s vector loads in the same loop. > Please try it. This patch also helps

[Bug tree-optimization/121332] New: [16 Regression] 8-16% slowdown of 519.lbm_r on AMD Zen 2 since r16-2601-ge8a51144c02e1c

2025-07-31 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121332 Bug ID: 121332 Summary: [16 Regression] 8-16% slowdown of 519.lbm_r on AMD Zen 2 since r16-2601-ge8a51144c02e1c Product: gcc Version: 16.0 Status: UNCONFIRMED

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-29 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #34 from Filip Kastl --- (In reply to H.J. Lu from comment #33) > Created attachment 61995 [details] > An updated patch > > Please try this. The updated patch helps! We go from 233s to 163s. So the patch reverts the slowdown. I'

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-29 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #30 from Filip Kastl --- (In reply to H.J. Lu from comment #29) > Created attachment 61973 [details] > A new patch > > Please try this. Sadly, this patch doesn't help. Actually, lbm gets compiled into the same binary with and with

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-25 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #28 from Filip Kastl --- Created attachment 61965 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=61965&action=edit testcase 2 (reduced lbm, where the spill can be seen) Ok, I think I have confirmed that there is a spill going

[Bug tree-optimization/121155] [16 Regression] 4-6% slowdown of 444.namd since r16-2193-g363b29a9cfbb47

2025-07-18 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121155 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug tree-optimization/121155] New: [16 Regression] 4-6% slowdown of 444.namd since r16-2193-g363b29a9cfbb47

2025-07-18 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121155 Bug ID: 121155 Summary: [16 Regression] 4-6% slowdown of 444.namd since r16-2193-g363b29a9cfbb47 Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: misse

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-15 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #27 from Filip Kastl --- If I find the spilling, I'll try to produce a testcase where it can be seen.

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-15 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #25 from Filip Kastl --- (In reply to H.J. Lu from comment #24) > Why is it bad for znver2? Oh, I thought we are trying to figure that out. Spilling because of register pressure, as richi suggested in comment 3, is the best guess w

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-15 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #23 from Filip Kastl --- testcase.c enum { ST, SB, ET, EB, WT, WB } LBM_initializeGrid() { double *grid; grid[ST] = grid[SB] = grid[ET] = grid[EB] = grid[WT] = grid[WB] = 1.0 / 36.0; } Compile with -Ofa

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-15 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #21 from Filip Kastl --- Oh, ok. I misunderstood. Well, you have SPEC CPU 2017, right? Then setting OPTIMIZE= -Ofast -march=znver2 -mtune=znver2 -g -flto -fdump-rtl-all should work. Perhaps you'll also need COPTIMIZE = -

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-15 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #19 from Filip Kastl --- Well, if you want to reproduce the lbm slowdown, you need a Zen2 or Zen5 machine. I'm not sure how I would produce a testcase that would also uncover the slowdown on other microarchitectures, sorry. If I un

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-14 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 Filip Kastl changed: What|Removed |Added Status|WAITING |NEW --- Comment #17 from Filip Kastl ---

[Bug tree-optimization/121037] [16 Regression] 4-6% slowdown of 482.sphinx3 since r16-2088-ge9079e4f43d135

2025-07-11 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121037 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug tree-optimization/121037] New: [16 Regression] 4-6% slowdown of 482.sphinx3 since r16-2088-ge9079e4f43d135

2025-07-11 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121037 Bug ID: 121037 Summary: [16 Regression] 4-6% slowdown of 482.sphinx3 since r16-2088-ge9079e4f43d135 Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: mi

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-08 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #16 from Filip Kastl --- Ok, I'll try to extract a smaller testcase.

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-08 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #13 from Filip Kastl --- (In reply to Filip Kastl from comment #12) > As I've commented in pr120957, I've also bisected 9% Zen3 -Ofast > -march=native slowdown to this commit. That slowdown can also be solved by > applying the patch

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-08 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #12 from Filip Kastl --- As I've commented in pr120957, I've also bisected 9% Zen3 -Ofast -march=native slowdown to this commit. That slowdown can also be solved by applying the patch hjl has provided.

[Bug target/120957] [16 Regression] 6% slowdown of 503.bwaves_r on Zen2 since r16-1647-gc06979ff957485

2025-07-08 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120957 Filip Kastl changed: What|Removed |Added Summary|[16 Regression] 6-9%|[16 Regression] 6% slowdown

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-08 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #11 from Filip Kastl --- (In reply to H.J. Lu from comment #9) > Created attachment 61803 [details] > A patch > > Please try this. Tried applying this on top of r16-1644-gaba3b9d3a48a07. With r16-1644-gaba3b9d3a48a07 ... 224s With

[Bug sanitizer/120984] New: [16 Regression] Bunch of 'insufficient space for an object of type...' errors during ubsan bootstrap

2025-07-07 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120984 Bug ID: 120984 Summary: [16 Regression] Bunch of 'insufficient space for an object of type...' errors during ubsan bootstrap Product: gcc Version: 16.0 Status: UNCONFIRM

[Bug target/120957] [16 Regression] 6-9% slowdown of 503.bwaves_r on Zen{2,3} since r16-1647-gc06979ff957485

2025-07-07 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120957 --- Comment #3 from Filip Kastl --- I've bisected this on Zen2. It is possible that this is actually two different slowdowns and only the Zen2 slowdown is caused by r16-1647. I'll bisect on Zen3.

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 Filip Kastl changed: What|Removed |Added Summary|[16 Regression] 24-40% |[16 Regression] 24-40%

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 since r16-1644-gaba3b9d3a48a07

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #7 from Filip Kastl --- >(In reply to Filip Kastl from comment #0) > there was a 40% exec time slowdown (on another machine I measured only 24%) > of 527.cam4_r SPEC 2017 benchmark when run with -Ofast -march=native -flto and this s

[Bug target/120943] [16 Regression] 5% slowdown of 527.cam4_r on Zen{4,5} since r16-1643-gd073bb6cfc219d

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120943 --- Comment #3 from Filip Kastl --- (In reply to H.J. Lu from comment #1) > Please try: > > https://patchwork.sourceware.org/project/gcc/list/?series=48886 Yes, if I apply this patch, the slowdown goes away

[Bug target/120957] [16 Regression] 6-9% slowdown of 503.bwaves_r on Zen{2,3} since r16-1647-gc06979ff957485

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120957 --- Comment #1 from Filip Kastl --- The slowdown is also present on 410.bwaves from 2006 SPEC https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=467.40.0 https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=301.40.0 again, both on Zen2

[Bug target/120959] New: [16 Regression] 9% slowdown of 549.fotonik3d_r on Zen5 since r16-1645-g309dbcea2cabb3

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120959 Bug ID: 120959 Summary: [16 Regression] 9% slowdown of 549.fotonik3d_r on Zen5 since r16-1645-g309dbcea2cabb3 Product: gcc Version: 16.0 Status: UNCONFIRMED Ke

[Bug target/120957] [16 Regression] 6-9% slowdown of 503.bwaves_r on Zen{2,3} since r16-1647-gc06979ff957485

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120957 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug target/120957] New: [16 Regression] 6-9% slowdown of 503.bwaves_r on Zen{2,3} since r16-1647-gc06979ff957485

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120957 Bug ID: 120957 Summary: [16 Regression] 6-9% slowdown of 503.bwaves_r on Zen{2,3} since r16-1647-gc06979ff957485 Product: gcc Version: 16.0 Status: UNCONFIRMED

[Bug target/120956] [16 Regression] 6% slowdown of 503.bwaves_r since

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120956 Filip Kastl changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/120956] New: [16 Regression] 6% slowdown of 503.bwaves_r since

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120956 Bug ID: 120956 Summary: [16 Regression] 6% slowdown of 503.bwaves_r since Product: gcc Version: 16.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component

[Bug target/120941] [16 Regression] 10-40% slowdown of 519.lbm_r on Zen2 since r16-1644-gaba3b9d3a48a07

2025-07-04 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 Filip Kastl changed: What|Removed |Added Summary|[16 Regression] 10-40% |[16 Regression] 10-40%

[Bug target/120943] [16 Regression] 5% slowdown of 527.cam4_r on Zen{4,5} since r16-1643-gd073bb6cfc219d

2025-07-03 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120943 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug target/120943] New: [16 Regression] 5% slowdown of 527.cam4_r on Zen{4,5} since r16-1643-gd073bb6cfc219d

2025-07-03 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120943 Bug ID: 120943 Summary: [16 Regression] 5% slowdown of 527.cam4_r on Zen{4,5} since r16-1643-gd073bb6cfc219d Product: gcc Version: 16.0 Status: UNCONFIRMED Key

[Bug target/120941] [16 Regression] 10-40% slowdown of 519.lbm_r on Zen{2,3} since r16-1644-gaba3b9d3a48a07

2025-07-03 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 Filip Kastl changed: What|Removed |Added Summary|[16 Regression] 20-40% |[16 Regression] 10-40%

[Bug target/120941] [16 Regression] 20-40% slowdown of 519.lbm_r on Zen2 since r16-1644-gaba3b9d3a48a07

2025-07-03 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug target/120941] New: [16 Regression] 20-40% slowdown of 519.lbm_r on Zen2 since r16-1644-gaba3b9d3a48a07

2025-07-03 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 Bug ID: 120941 Summary: [16 Regression] 20-40% slowdown of 519.lbm_r on Zen2 since r16-1644-gaba3b9d3a48a07 Product: gcc Version: 16.0 Status: UNCONFIRMED Keyw

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-07-03 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 --- Comment #17 from Filip Kastl --- (In reply to Andrew Pinski from comment #15) > So it looks like (a * b) are closer in value to (vnb12 * 1.2e+1 - c) than > (vnb12 * 1.2e+1) is to (a * b - c) . Btw, for the purpose of me trying to get better

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-07-02 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 --- Comment #14 from Filip Kastl --- If I do -fdump-tree-optimized, I see these two differences in function inl1100: A has higher numerical error (3.09998e+02)| B has ok numerical error (3.12012e+02) -

[Bug target/120866] [16 Regression] pdp11-aout, powerpc-ibm-aix7.1 and powerpc-ibm-aix7.2 crosscompilers fail to build

2025-06-29 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120866 --- Comment #3 from Filip Kastl --- (In reply to Sam James from comment #1) > Huh, it's really a trunk regression? I can't yet think of which change > would've done this. It seems to be. I've just tested this with trunk.

[Bug target/120866] [16 Regression] pdp11-aout, powerpc-ibm-aix7.1 and powerpc-ibm-aix7.2 crosscompilers fail to build

2025-06-29 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120866 Filip Kastl changed: What|Removed |Added Summary|[16 Regression] pdp11-aout |[16 Regression] pdp11-aout,

[Bug target/120866] [16 Regression] pdp11-aout crosscompiler fails to build

2025-06-29 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120866 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug target/120866] New: [16 Regression] pdp11-aout crosscompiler fails to build

2025-06-29 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120866 Bug ID: 120866 Summary: [16 Regression] pdp11-aout crosscompiler fails to build Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: build Severi

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-06-28 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 --- Comment #13 from Filip Kastl --- My theory is that the "miscompiled" functions are actually two: inl1100 and inl1120. If I compile these two functions with r16-1549 and the rest of innerf.f with r16-1550, I get the same gromacs output as wh

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-06-26 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 --- Comment #12 from Filip Kastl --- gfortran -std=legacy -c -o innerf.o -Ofast -g -march=native -mtune=native innerf.f these are the compile options, btw

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-06-26 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 --- Comment #11 from Filip Kastl --- So the file that is getting "miscompiled" is innerf.f. I found out by compiling this gromacs source file with r16-1550 GCC and all the other source files with r16-1549 GCC and then linking that together. I'

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2025-06-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 113833, which changed state. Bug 113833 Summary: 435.gromacs fails verification on with -Ofast -march={cascadelake,icelake-server} and PGO after r14-7272-g57f611604e8bab https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113833

[Bug tree-optimization/113833] 435.gromacs fails verification on with -Ofast -march={cascadelake,icelake-server} and PGO after r14-7272-g57f611604e8bab

2025-06-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=113833 Filip Kastl changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-06-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 --- Comment #10 from Filip Kastl --- > Given the nature of the change that caused this (trimming integral ranges > bounds to match the bitmasks) its probable that a smaller range had some > other pass make a different decision. Yeah, I also t

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-06-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 --- Comment #9 from Filip Kastl --- Ok, I'll try to find out from which file (maybe even from which function) the numerical error originates (thanks for the tips, Sam). It will take some time though since all of the Zen4/5 machines I have avail

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-06-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 Filip Kastl changed: What|Removed |Added Keywords|wrong-code | --- Comment #6 from Filip Kastl --- Rem

[Bug tree-optimization/120752] 5% slowdown of 525.x264_r since r16-1346-gb0d50cbb42ab2c

2025-06-22 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120752 --- Comment #2 from Filip Kastl --- Created attachment 61680 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=61680&action=edit perf report -n output before Honza's commit (In reply to Jan Hubicka from comment #1) > if you happen to have bi

[Bug tree-optimization/120752] 5% slowdown of 525.x264_r since r16-1346-gb0d50cbb42ab2c

2025-06-22 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120752 --- Comment #3 from Filip Kastl --- Created attachment 61681 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=61681&action=edit perf report -n output after Honza's commit

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-06-22 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 --- Comment #3 from Filip Kastl --- (In reply to Andrew Macleod from comment #2) > Does it still fail with the fix for PR 120701? Sadly, the fix for pr120701 doesn't help. I can still replicate this on r16-1594-gb03e0d69b37f6e and on current t

[Bug tree-optimization/120752] New: 5% slowdown of 525.x264_r since r16-1346-gb0d50cbb42ab2c

2025-06-21 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120752 Bug ID: 120752 Summary: 5% slowdown of 525.x264_r since r16-1346-gb0d50cbb42ab2c Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: missed-optimization

[Bug tree-optimization/120752] 5% slowdown of 525.x264_r since r16-1346-gb0d50cbb42ab2c

2025-06-21 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120752 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug tree-optimization/120751] New: [16 Regression] 10-15% slowdown of 454.calculix on Zen4 and Zen5 since r16-1001-g0291f53f8d2343

2025-06-21 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120751 Bug ID: 120751 Summary: [16 Regression] 10-15% slowdown of 454.calculix on Zen4 and Zen5 since r16-1001-g0291f53f8d2343 Product: gcc Version: 16.0 Status: UNCONFIRMED

[Bug tree-optimization/120751] [16 Regression] 10-15% slowdown of 454.calculix on Zen4 and Zen5 since r16-1001-g0291f53f8d2343

2025-06-21 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120751 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug tree-optimization/120747] New: [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-06-21 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 Bug ID: 120747 Summary: [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638 Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: wrong-c

[Bug tree-optimization/120749] New: [16 Regression] 5% slowdown of 548.exchange2_r on Aarch64

2025-06-21 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120749 Bug ID: 120749 Summary: [16 Regression] 5% slowdown of 548.exchange2_r on Aarch64 Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: missed-optimization,

[Bug tree-optimization/120749] [16 Regression] 5% slowdown of 548.exchange2_r on Aarch64

2025-06-21 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120749 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug tree-optimization/120747] [16 Regression] 435.gromacs miscompares since r16-1550-g9244ea4bf55638

2025-06-21 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120747 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug target/120733] [16 Regression][aarch64] ICE in gen_highpart, at lra.cc:1484 since r16-1565-g2dcc6dbd8a00ca

2025-06-21 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120733 --- Comment #2 from Filip Kastl --- Btw, 500.perlbench and 435.gromacs SPEC CPU benchmarks currently cannot be built because of this (at least for some combinations of compiler flags).

[Bug tree-optimization/120221] Missed optimization related to switch handling

2025-05-13 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120221 --- Comment #7 from Filip Kastl --- So this isn't specific for switches. Rather, this is some kind of forward propagation of a shift that we don't currently do, right?

[Bug target/120256] New: [16 Regression] 4% slowdown of 454.calculix on aarch64

2025-05-13 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120256 Bug ID: 120256 Summary: [16 Regression] 4% slowdown of 454.calculix on aarch64 Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: missed-optimization, needs-bisection

[Bug tree-optimization/120069] [16 Regression] Yes another imagick -march=native -flto -Ofast + PGO regression (+ wrf and lbm) since r16-270-ga0a64aa5da0af5

2025-05-13 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120069 Filip Kastl changed: What|Removed |Added Summary|[16 Regression] Yes another |[16 Regression] Yes another

[Bug tree-optimization/120069] [16 Regression] Yes another imagick -march=native -flto -Ofast + PGO regression since r16-270-ga0a64aa5da0af5

2025-05-13 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120069 --- Comment #11 from Filip Kastl --- I've also just bisected this 2006 lbm -Ofast -march=native -flto PGO AMD Zen3 16% slowdown to r16-270-ga0a64aa5da0af5: https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=474.240.0

[Bug tree-optimization/120069] [16 Regression] Yes another imagick -march=native -flto -Ofast + PGO regression since r16-270-ga0a64aa5da0af5

2025-05-13 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120069 --- Comment #10 from Filip Kastl --- I would make a pr for the arm slowdowns but it looks like they disappeared. But do tell me if you think I should report those or feel free to report them yourself.

[Bug tree-optimization/120069] [16 Regression] Yes another imagick -march=native -flto -Ofast + PGO regression since r16-270-ga0a64aa5da0af5

2025-05-13 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120069 Filip Kastl changed: What|Removed |Added Summary|[16 Regression] Yes another |[16 Regression] Yes another

[Bug tree-optimization/120069] [16 Regression] Yes another imagick -march=native -flto -Ofast + PGO regression between g:1c0cbc1b300e08df5ebfce00a7195890d78f2064 and g:55b01e17c793688a2878fa43a76df126

2025-05-12 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120069 Filip Kastl changed: What|Removed |Added CC||pheeck at gcc dot gnu.org --- Comment #7

[Bug target/119900] [16 regression] imagick slowdown with -Ofast -march=native -fprofile-use since r16-39-gf6859fb621179e (interaction of rpad and late-combine)

2025-05-12 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119900 --- Comment #8 from Filip Kastl --- Looks fixed. Should we close this?

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2025-05-12 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 120226, which changed state. Bug 120226 Summary: 8% regression of exchange2 with -O2 between g:d0571638a6bad932 and g:9b13bea07706a7ca https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120226 What|Removed

[Bug tree-optimization/120219] [16 Regression] ~11% slowdown of 548.exchange2_r on x86_64 (maybe also on aarch64?) since r16-448-g8335fd561fa823

2025-05-12 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120219 --- Comment #4 from Filip Kastl --- *** Bug 120226 has been marked as a duplicate of this bug. ***

[Bug target/120226] 8% regression of exchange2 with -O2 between g:d0571638a6bad932 and g:9b13bea07706a7ca

2025-05-12 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120226 Filip Kastl changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED CC|

[Bug tree-optimization/120219] [16 Regression] ~11% slowdown of 548.exchange2_r on x86_64 (maybe also on aarch64?) since r16-448-g8335fd561fa823

2025-05-12 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120219 Filip Kastl changed: What|Removed |Added See Also||https://gcc.gnu.org/bugzill

[Bug tree-optimization/120219] [16 Regression] ~11% slowdown of 548.exchange2_r on x86_64

2025-05-12 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120219 Filip Kastl changed: What|Removed |Added Summary|[16 Regression] ~11%|[16 Regression] ~11% |s

[Bug target/119919] 7% exchange2 regression between g:6390fc86995fbd5239497cb9e1797a3af51d3936 and g:f72a2d221539cede358f2487b94bc370c6fc44b5

2025-05-11 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119919 Filip Kastl changed: What|Removed |Added See Also||https://gcc.gnu.org/bugzill

[Bug tree-optimization/120219] New: [16 Regression] ~11% slowdown of 548.exchange2_r on AMD Zen

2025-05-11 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120219 Bug ID: 120219 Summary: [16 Regression] ~11% slowdown of 548.exchange2_r on AMD Zen Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: missed-optimizatio

[Bug target/120218] [16 Regression] 8% slowdown of 507.cactuBSSN_r on Intel

2025-05-11 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120218 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug target/120218] New: [16 Regression] 8% slowdown of 507.cactuBSSN_r on Intel

2025-05-11 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120218 Bug ID: 120218 Summary: [16 Regression] 8% slowdown of 507.cactuBSSN_r on Intel Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: missed-optimization, n

[Bug target/119919] 7% exchange2 regression between g:6390fc86995fbd5239497cb9e1797a3af51d3936 and g:f72a2d221539cede358f2487b94bc370c6fc44b5

2025-05-11 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119919 Filip Kastl changed: What|Removed |Added CC||pheeck at gcc dot gnu.org --- Comment #8

[Bug tree-optimization/120080] [16 regression] ICE when building llvm-20.1.3 (find_bit_tests, at tree-switch-conversion.cc:1799) since r16-347-g1381a5114788a2

2025-05-10 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120080 Filip Kastl changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug tree-optimization/120080] [16 regression] ICE when building llvm-20.1.3 (find_bit_tests, at tree-switch-conversion.cc:1799) since r16-347-g1381a5114788a2

2025-05-07 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120080 --- Comment #10 from Filip Kastl --- The line that crashes GCC seems to be this one gcc_checking_assert (l > 0); where l is the number of clusters of a switch. At this point in compilation clusters = cases so this means that bit-test switch

[Bug tree-optimization/119965] [16 Regression] 531.deepsjeng_r binary is 50% bigger since r16-116-gcfb04e0de6aa43

2025-05-01 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119965 --- Comment #1 from Filip Kastl --- Here is a graph showing this code size increase. https://lnt.opensuse.org/db_default/v4/SPEC/graph?plot.0=293.378.4

[Bug tree-optimization/119044] [15 regression] 5-16% slowdown of 436.cactusADM since r15-7665-gee30e2586a3142

2025-05-01 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119044 --- Comment #5 from Filip Kastl --- Nice. From the graph it looks like this helped a lot.

[Bug target/114411] [14/15/16 Regression] 12% exec time slowdown of 433.milc on aarch64

2025-05-01 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114411 Filip Kastl changed: What|Removed |Added Resolution|--- |FIXED Status|UNCONFIRMED

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2025-05-01 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 114411, which changed state. Bug 114411 Summary: [14/15/16 Regression] 12% exec time slowdown of 433.milc on aarch64 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114411 What|Removed |Ad

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2025-04-27 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 119924, which changed state. Bug 119924 Summary: [16 Regression] ICE when building 531.deepsjeng_r during ipa-cp since r16-101-g132d01d96ea9d6 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119924 What|Remov

[Bug tree-optimization/119924] [16 Regression] ICE when building 531.deepsjeng_r during ipa-cp since r16-101-g132d01d96ea9d6

2025-04-27 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119924 Filip Kastl changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug tree-optimization/119965] New: [16 Regression] 531.deepsjeng_r binary is 50% bigger since r16-116-gcfb04e0de6aa43

2025-04-27 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119965 Bug ID: 119965 Summary: [16 Regression] 531.deepsjeng_r binary is 50% bigger since r16-116-gcfb04e0de6aa43 Product: gcc Version: 16.0 Status: UNCONFIRMED Keywo

[Bug target/119929] [16 Regression] build fails on mips64el-linux-gnu (and other mips targets) since r16-51-g727a43e0a66052

2025-04-27 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119929 Filip Kastl changed: What|Removed |Added CC||pheeck at gcc dot gnu.org Summ

[Bug tree-optimization/119963] [16 Regression

2025-04-27 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119963 Filip Kastl changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug tree-optimization/119963] New: [16 Regression

2025-04-27 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119963 Bug ID: 119963 Summary: [16 Regression Product: gcc Version: 16.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: tree-optimization Assig

[Bug target/114411] [14/15/16 Regression] 12% exec time slowdown of 433.milc on aarch64

2025-04-27 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=114411 --- Comment #5 from Filip Kastl --- Soon after I reported this, the graph went to roughly the original values. So if no one minds, I'll close this.

[Bug target/119925] [15/16 Regression] 4% slowdown of 538.imagick_r and 10% of 433.milc on Aarch64

2025-04-27 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119925 --- Comment #3 from Filip Kastl --- (In reply to Filip Kastl from comment #1) > I've just noticed that there is also a 10% slowdown of 433.milc SPEC 2006 > benchmark also with -Ofast -march=native PGO. Looking at the graphs, I'd > say these two

[Bug target/119925] [15/16 Regression] 4% slowdown of 538.imagick_r and 10% of 433.milc on Aarch64

2025-04-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119925 Filip Kastl changed: What|Removed |Added Summary|[15/16 Regression] 4% |[15/16 Regression] 4% |

[Bug target/119927] New: 5% slowdown of 415.gamess on Intel Ice Lake

2025-04-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119927 Bug ID: 119927 Summary: 5% slowdown of 415.gamess on Intel Ice Lake Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: missed-optimization, needs-bisection Severit

[Bug target/119925] New: [15/16 Regression] 4% slowdown of 538.imagick_r on Aarch64

2025-04-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119925 Bug ID: 119925 Summary: [15/16 Regression] 4% slowdown of 538.imagick_r on Aarch64 Product: gcc Version: 16.0 Status: UNCONFIRMED Keywords: missed-optimization

[Bug target/119925] [15/16 Regression] 4% slowdown of 538.imagick_r on Aarch64

2025-04-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119925 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

[Bug tree-optimization/119924] [16 Regression] ICE when building 531.deepsjeng_r during ipa-cp since r16-101-g132d01d96ea9d6

2025-04-24 Thread pheeck at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119924 Filip Kastl changed: What|Removed |Added Target Milestone|--- |16.0

  1   2   3   4   >