[Bug target/121306] [16 Regression] testcase failures after r16-2614-g965564eafb721f on x86_64

2025-08-05 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121306 --- Comment #13 from H.J. Lu --- (In reply to Sam James from comment #12) > Is this one fixed now, or does it still need Richard S's simplify-rtx patch > (https://inbox.sourceware.org/gcc-patches/mpt34a7f5mk@arm.com/)? Need this and https:

[Bug target/121410] [16 Regression] ICE: in expand_set_or_cpymem_prologue_epilogue_by_misaligned_moves, at config/i386/i386-expand.cc:8872 with -O2 -mavx512f -mstore-max=128

2025-08-05 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121410 H.J. Lu changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/121410] [16 Regression] ICE: in expand_set_or_cpymem_prologue_epilogue_by_misaligned_moves, at config/i386/i386-expand.cc:8872 with -O2 -mavx512f -mstore-max=128

2025-08-05 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121410 --- Comment #1 from H.J. Lu --- A patch is posted at https://gcc.gnu.org/pipermail/gcc-patches/2025-August/691707.html

[Bug target/121410] [16 Regression] ICE: in expand_set_or_cpymem_prologue_epilogue_by_misaligned_moves, at config/i386/i386-expand.cc:8872 with -O2 -mavx512f -mstore-max=128

2025-08-05 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121410 H.J. Lu changed: What|Removed |Added Status|UNCONFIRMED |ASSIGNED Assignee|unassigned at gcc

[Bug middle-end/26163] [meta-bug] missed optimization in SPEC (2k17, 2k and 2k6 and 95)

2025-08-03 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=26163 Bug 26163 depends on bug 120941, which changed state. Bug 120941 Summary: [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07 https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 What

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-08-03 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 H.J. Lu changed: What|Removed |Added Resolution|--- |FIXED Status|WAITING

[Bug middle-end/121352] tls_model attribute documentation needs update

2025-08-01 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121352 --- Comment #2 from H.J. Lu --- (In reply to Alexander Monakov from comment #1) > That patch deliberately takes care not to optimize in presence of the > attribute: > > /* Upgrade TLS access model based on optimized visibility status,

[Bug middle-end/121352] New: tls_model attribute documentation needs update

2025-08-01 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121352 Bug ID: 121352 Summary: tls_model attribute documentation needs update Product: gcc Version: 13.4.1 Status: UNCONFIRMED Severity: normal Priority: P3 Component:

[Bug middle-end/121318] handle_tls_model_attribute failed to consider compiler options

2025-08-01 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121318 H.J. Lu changed: What|Removed |Added Resolution|--- |DUPLICATE Status|UNCONFIRMED

[Bug c/107419] attributes are ignored when selecting TLS model

2025-08-01 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107419 H.J. Lu changed: What|Removed |Added CC||hjl.tools at gmail dot com --- Comment #2 fro

[Bug c++/107393] Wrong TLS model for specialized template

2025-07-31 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107393 --- Comment #1 from H.J. Lu --- Created attachment 62024 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=62024&action=edit A patch

[Bug c/107419] attributes are ignored when selecting TLS model

2025-07-31 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107419 --- Comment #1 from H.J. Lu --- Created attachment 62025 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=62025&action=edit A patch

[Bug fortran/107421] problematic interaction of 'common' and 'threadprivate'

2025-07-31 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=107421 --- Comment #1 from H.J. Lu --- Created attachment 62023 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=62023&action=edit A patch

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-31 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #42 from H.J. Lu --- Created attachment 62020 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=62020&action=edit A new patch Here is a patch not to limit non all 0s/1s vector loads in the same loop. Please try it.

[Bug target/121324] New: [16 Regression] FAIL: gcc.target/i386/pr82524.c caused by r16-2649-g0340177d54d08b

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121324 Bug ID: 121324 Summary: [16 Regression] FAIL: gcc.target/i386/pr82524.c caused by r16-2649-g0340177d54d08b Product: gcc Version: 16.0 Status: UNCONFIRMED Sever

[Bug c++/120409] FAIL: g++.dg/coroutines/torture/pr119916.C

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120409 --- Comment #14 from H.J. Lu --- (In reply to Iain Sandoe from comment #13) > (In reply to H.J. Lu from comment #12) > > (In reply to Iain Sandoe from comment #5) > > > well .. the visible difference in gimple, but this is code outside of the >

[Bug c++/120409] FAIL: g++.dg/coroutines/torture/pr119916.C

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120409 H.J. Lu changed: What|Removed |Added Target Milestone|--- |16.0 --- Comment #12 from H.J. Lu --- (In re

[Bug target/121319] New: Why is global_options.x_flag_pcc_struct_return cleared?

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121319 Bug ID: 121319 Summary: Why is global_options.x_flag_pcc_struct_return cleared? Product: gcc Version: 16.0 Status: UNCONFIRMED Severity: normal Prior

[Bug c++/120409] FAIL: g++.dg/coroutines/torture/pr119916.C

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120409 --- Comment #11 from H.J. Lu --- typeck.cc has /* We can't initialize a register from a AGGR_INIT_EXPR. */ else if (! cfun->returns_struct && TREE_CODE (retval) == TARGET_EXPR && TREE_CODE (TARGET_EXPR_

[Bug middle-end/121318] handle_tls_model_attribute failed to consider compiler options

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121318 --- Comment #5 from H.J. Lu --- (In reply to Andrew Pinski from comment #2) > I am not so sure, the documentation of attribute say it should override the > default. Linker can override the TLS mode attribute. Why can't compiler perform TLS opt

[Bug middle-end/121318] handle_tls_model_attribute failed to consider compiler options

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121318 --- Comment #4 from H.J. Lu --- Created attachment 62007 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=62007&action=edit A patch

[Bug c/121318] New: handle_tls_model_attribute failed to consider compiler options

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121318 Bug ID: 121318 Summary: handle_tls_model_attribute failed to consider compiler options Product: gcc Version: 16.0 Status: UNCONFIRMED Severity: normal

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #41 from H.J. Lu --- (In reply to rguent...@suse.de from comment #40) > On Wed, 30 Jul 2025, hjl.tools at gmail dot com wrote: > > With my patch, we got > > > > basic_block bb = nearest_common_dominator_for_set (CDI_DOMINATORS, bb

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #39 from H.J. Lu --- (In reply to rguent...@suse.de from comment #38) > On Wed, 30 Jul 2025, hjl.tools at gmail dot com wrote: > > > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 > > > > --- Comment #37 from H.J. Lu --- > >

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #37 from H.J. Lu --- (In reply to Richard Biener from comment #35) > (In reply to H.J. Lu from comment #33) > > Created attachment 61995 [details] > > An updated patch > > > > Please try this. > > Looking at the patch I do wonder a

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-30 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #36 from H.J. Lu --- (In reply to Filip Kastl from comment #34) > (In reply to H.J. Lu from comment #33) > > Created attachment 61995 [details] > > An updated patch > > > > Please try this. > > The updated patch helps! We go from

[Bug target/121306] [16 Regression] testcase failures after r16-2614-g965564eafb721f on x86_64

2025-07-29 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121306 H.J. Lu changed: What|Removed |Added Ever confirmed|0 |1 Status|UNCONFIRMED

[Bug c++/120409] FAIL: g++.dg/coroutines/torture/pr119916.C

2025-07-29 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120409 --- Comment #9 from H.J. Lu --- The .006t.original dump differences between -m32 and -mx32 ;; Function folly::OptionalPromiseReturn folly::OptionalPromise::get_return_object() (null) ;; enabled by -tree-original -< = TARGET_EXPR = TARGET_E

[Bug target/120427] [13/14/15 Regression] "and $0,mem" is generated without -Oz since r12-6106-gef26c151c14a87

2025-07-29 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120427 H.J. Lu changed: What|Removed |Added Attachment #61996|0 |1 is obsolete|

[Bug target/120427] [13/14/15 Regression] "and $0,mem" is generated without -Oz since r12-6106-gef26c151c14a87

2025-07-29 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120427 H.J. Lu changed: What|Removed |Added Resolution|FIXED |--- Status|RESOLVED

[Bug target/120427] [13/14/15 Regression] "and $0,mem" is generated without -Oz since r12-6106-gef26c151c14a87

2025-07-29 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120427 --- Comment #12 from H.J. Lu --- Created attachment 61996 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=61996&action=edit A patch

[Bug bootstrap/121302] New: [16 Regression] Bootstrap failed with ada

2025-07-29 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121302 Bug ID: 121302 Summary: [16 Regression] Bootstrap failed with ada Product: gcc Version: 16.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: bootst

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-29 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 H.J. Lu changed: What|Removed |Added Attachment #61973|0 |1 is obsolete|

[Bug target/120427] [13/14/15 Regression] "and $0,mem" is generated without -Oz since r12-6106-gef26c151c14a87

2025-07-29 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120427 --- Comment #10 from H.J. Lu --- (In reply to Harald van Dijk from comment #9) > (In reply to H.J. Lu from comment #2) > > We shouldn't transform "mov $-1,reg" to "push $-1; pop reg". > > We should transform "mov $-1,reg" to "or $-1,reg" instead

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-29 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #32 from H.J. Lu --- (In reply to Filip Kastl from comment #30) > (In reply to H.J. Lu from comment #29) > > Created attachment 61973 [details] > > A new patch > > > > Please try this. > > Sadly, this patch doesn't help. Actually,

[Bug target/120427] [13/14/15 Regression] "and $0,mem" is generated without -Oz since r12-6106-gef26c151c14a87

2025-07-28 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120427 H.J. Lu changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug testsuite/121285] New: FAIL: g++.dg/modules/class-11_a.H

2025-07-28 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121285 Bug ID: 121285 Summary: FAIL: g++.dg/modules/class-11_a.H Product: gcc Version: 16.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: testsuite

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-25 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 H.J. Lu changed: What|Removed |Added Attachment #61803|0 |1 is obsolete|

[Bug target/121208] Wrong user-level interrupt vector value with TLS variable when build with optimisation

2025-07-24 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121208 H.J. Lu changed: What|Removed |Added Attachment #61953|0 |1 is obsolete|

[Bug testsuite/120923] check-function-bodies doesn't support "1: call mcount"

2025-07-24 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120923 H.J. Lu changed: What|Removed |Added Resolution|--- |DUPLICATE Status|NEW

[Bug gcov-profile/120881] [16 Regression] -pg may not call mcount at function entry by r16-1550-g9244ea4bf55638

2025-07-24 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120881 --- Comment #32 from H.J. Lu --- *** Bug 120923 has been marked as a duplicate of this bug. ***

[Bug target/120936] [13/14/15/16 Regression] x86_function_profiler emits an unused label by by r5-3767-gecc81e33123d7a

2025-07-24 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120936 Bug 120936 depends on bug 120923, which changed state. Bug 120923 Summary: check-function-bodies doesn't support "1: call mcount" https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120923 What|Removed |Added

[Bug target/121208] Wrong user-level interrupt vector value with TLS variable when build with optimisation

2025-07-23 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121208 H.J. Lu changed: What|Removed |Added Assignee|unassigned at gcc dot gnu.org |hjl.tools at gmail dot com --- Commen

[Bug target/121208] Wrong user-level interrupt vector value with TLS variable when build with optimisation

2025-07-21 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121208 H.J. Lu changed: What|Removed |Added CC||hjl.tools at gmail dot com See Also

[Bug rtl-optimization/121205] New: FAIL: gcc.dg/asm-hard-reg-2.c with -m32

2025-07-21 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121205 Bug ID: 121205 Summary: FAIL: gcc.dg/asm-hard-reg-2.c with -m32 Product: gcc Version: 16.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: rtl-opti

[Bug testsuite/121078] [16 regression] gcc.dg/aru-2.c etc. FAIL

2025-07-18 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121078 H.J. Lu changed: What|Removed |Added Attachment #61909|0 |1 is obsolete|

[Bug testsuite/121078] [16 regression] gcc.dg/aru-2.c etc. FAIL

2025-07-18 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121078 --- Comment #6 from H.J. Lu --- Created attachment 61909 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=61909&action=edit A patch Try this.

[Bug testsuite/121150] FAIL: 20_util/hash/int128.cc -std=c++17 execution test

2025-07-17 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121150 --- Comment #2 from H.J. Lu --- (In reply to Jonathan Wakely from comment #1) > There is no long anywhere here. The problem is that size_t is 32 bits, and > using an INT64 suffix won't change that: the value will still be too large > for size_t.

[Bug testsuite/121150] New: FAIL: 20_util/hash/int128.cc -std=c++17 execution test

2025-07-17 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121150 Bug ID: 121150 Summary: FAIL: 20_util/hash/int128.cc -std=c++17 execution test Product: gcc Version: 16.0 Status: UNCONFIRMED Severity: normal Prior

[Bug target/120908] *tls_(local|global)_dynamic_64_ has an implicit RDI clobber

2025-07-17 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120908 H.J. Lu changed: What|Removed |Added Target Milestone|--- |13.5 Status|NEW

[Bug tree-optimization/121138] New: [16 Regression] missed optimization in the loop caused by r16-303-g1a13684dfc7286

2025-07-16 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121138 Bug ID: 121138 Summary: [16 Regression] missed optimization in the loop caused by r16-303-g1a13684dfc7286 Product: gcc Version: 16.0 Status: UNCONFIRMED Severi

[Bug target/121062] integer registers of vector constant load shouldn't go through memory

2025-07-16 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121062 H.J. Lu changed: What|Removed |Added Target Milestone|--- |16.0 Status|NEW

[Bug testsuite/121078] [16 regression] gcc.dg/aru-2.c etc. FAIL

2025-07-16 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121078 --- Comment #3 from H.J. Lu --- (In reply to r...@cebitec.uni-bielefeld.de from comment #2) > > --- Comment #1 from H.J. Lu --- > > Please try > > > > https://patchwork.sourceware.org/project/gcc/list/?series=49715 > > Unfortunately, the tests

[Bug testsuite/121078] [16 regression] gcc.dg/aru-2.c etc. FAIL

2025-07-15 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121078 H.J. Lu changed: What|Removed |Added Status|NEW |WAITING

[Bug testsuite/121078] [16 regression] gcc.dg/aru-2.c etc. FAIL

2025-07-15 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121078 H.J. Lu changed: What|Removed |Added Ever confirmed|0 |1 Status|UNCONFIRMED

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-15 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #24 from H.J. Lu --- (In reply to Filip Kastl from comment #23) > testcase.c > enum { ST, SB, ET, EB, WT, WB } > LBM_initializeGrid() { > double *grid; > grid[ST] = grid[SB] = grid[ET] = grid[EB] = > grid[WT] = gr

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-15 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #22 from H.J. Lu --- (In reply to Filip Kastl from comment #21) > Oh, ok. I misunderstood. > > Well, you have SPEC CPU 2017, right? Then setting > No, I don't. Please extract a small testcase. > OPTIMIZE= -Ofast -march=znve

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-15 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #20 from H.J. Lu --- (In reply to Filip Kastl from comment #19) > Well, if you want to reproduce the lbm slowdown, you need a Zen2 or Zen5 > machine. I'm not sure how I would produce a testcase that would also > uncover the slowdown

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-14 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #18 from H.J. Lu --- (In reply to Filip Kastl from comment #17) > This is the replacement that causes the slowdown (well, two replacements): > > -- > Replace: > > (insn 2224 2228 20 (set (reg:V4DF 1604) >

[Bug target/121062] integer registers of vector constant load shouldn't go through memory

2025-07-14 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121062 --- Comment #5 from H.J. Lu --- Created attachment 61867 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=61867&action=edit A patch I am testing this combined patch.

[Bug target/121015] [16 regression] ICE when building blender for x86_64 with -O3 -mavx

2025-07-14 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121015 H.J. Lu changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug gcov-profile/120881] [16 Regression] -pg may not call mount at function entry by r16-1550-g9244ea4bf55638

2025-07-14 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120881 H.J. Lu changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug target/121062] integer registers of vector constant load shouldn't go through memory

2025-07-14 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121062 H.J. Lu changed: What|Removed |Added Ever confirmed|0 |1 Status|UNCONFIRMED

[Bug target/121062] New: integer registers of vector constant load shouldn't go through memory

2025-07-14 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121062 Bug ID: 121062 Summary: integer registers of vector constant load shouldn't go through memory Product: gcc Version: 16.0 Status: UNCONFIRMED Severity: normal

[Bug target/120870] [16 regression] CPython miscompiled with preserve_none and CFLAGS="-O2 -march=znver2 -ggdb3"

2025-07-13 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120870 --- Comment #20 from H.J. Lu --- (In reply to Sam James from comment #19) > (In reply to Sam James from comment #18) > > (In reply to H.J. Lu from comment #17) > > > Created attachment 61837 [details] > > > A patch > > > > > > Please try this.

[Bug gcov-profile/121045] [16 Regression] FAIL: g++.dg/torture/pr58552.C caused by r16-2197-g385d9937f0e23c

2025-07-12 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121045 H.J. Lu changed: What|Removed |Added Ever confirmed|0 |1 Status|UNCONFIRMED

[Bug gcov-profile/121045] New: [16 Regression] FAIL: g++.dg/torture/pr58552.C caused by r16-2196-g52d9c2272f6366

2025-07-11 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121045 Bug ID: 121045 Summary: [16 Regression] FAIL: g++.dg/torture/pr58552.C caused by r16-2196-g52d9c2272f6366 Product: gcc Version: 16.0 Status: UNCONFIRMED Severi

[Bug target/120870] [16 regression] CPython miscompiled with preserve_none and CFLAGS="-O2 -march=znver2 -ggdb3"

2025-07-10 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120870 --- Comment #17 from H.J. Lu --- Created attachment 61837 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=61837&action=edit A patch Please try this. No idea why it works for me.

[Bug target/120870] [16 regression] CPython miscompiled with preserve_none and CFLAGS="-O2 -march=znver2 -ggdb3"

2025-07-10 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120870 --- Comment #15 from H.J. Lu --- (In reply to Sam James from comment #10) > Created attachment 61824 [details] > ceval.i.xz > > ceval.o is broken. > > ``` > $ gcc -c -fno-strict-overflow -O2 -mavx -mtune=znver2 -std=c11 > -fvisibility=hidden -

[Bug target/121015] [16 regression] ICE when building blender for x86_64 with -O3 -mavx

2025-07-10 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121015 H.J. Lu changed: What|Removed |Added Attachment #61829|0 |1 is obsolete|

[Bug target/121015] [16 regression] ICE when building blender for x86_64 with -O3 -mavx

2025-07-09 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121015 H.J. Lu changed: What|Removed |Added Attachment #61827|0 |1 is obsolete|

[Bug target/121015] [16 regression] ICE when building blender for x86_64 with -O3 -mavx

2025-07-09 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=121015 H.J. Lu changed: What|Removed |Added Last reconfirmed||2025-07-09 Assignee|unassigned at g

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-08 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 H.J. Lu changed: What|Removed |Added CC||rguenth at gcc dot gnu.org --- Comment #15 fr

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-08 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #14 from H.J. Lu --- (In reply to Filip Kastl from comment #11) > (In reply to H.J. Lu from comment #9) > > Created attachment 61803 [details] > > A patch > > > > Please try this. > > Tried applying this on top of r16-1644-gaba3b9d

[Bug target/119703] x86: spurious branches for inlined memset in ranges (40; 64) when requesting unrolled loops without simd

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119703 H.J. Lu changed: What|Removed |Added Target Milestone|--- |16.0 Status|UNCONFIRMED

[Bug target/101366] memset codegen for constant sized does not use SSE instructions

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101366 H.J. Lu changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 H.J. Lu changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug target/118276] memset 88 uses rep stosq while 80 uses SSE

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118276 H.J. Lu changed: What|Removed |Added Resolution|--- |FIXED Status|ASSIGNED

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 118276, which changed state. Bug 118276 Summary: memset 88 uses rep stosq while 80 uses SSE https://gcc.gnu.org/bugzilla/show_bug.cgi?id=118276 What|Removed |Added

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 102294, which changed state. Bug 102294 Summary: memset expansion is sometimes slow for small sizes https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102294 What|Removed |Added

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 108585, which changed state. Bug 108585 Summary: memset uses SSE stores but afterwards does not but if used "" will use them https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108585 What|Removed

[Bug target/108585] memset uses SSE stores but afterwards does not but if used "" will use them

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108585 H.J. Lu changed: What|Removed |Added Target Milestone|--- |16.0 Resolution|---

[Bug target/102294] memset expansion is sometimes slow for small sizes

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102294 H.J. Lu changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 119704, which changed state. Bug 119704 Summary: x86: partially disobeyed strategy rep-based request for inlined memset https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119704 What|Removed

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 119703, which changed state. Bug 119703 Summary: x86: spurious branches for inlined memset in ranges (40; 64) when requesting unrolled loops without simd https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119703 W

[Bug target/119704] x86: partially disobeyed strategy rep-based request for inlined memset

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119704 H.J. Lu changed: What|Removed |Added Status|UNCONFIRMED |RESOLVED Resolution|---

[Bug target/120708] ix86_expand_set_or_cpymem ignores MOVE_MAX

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120708 H.J. Lu changed: What|Removed |Added Status|NEW |RESOLVED Resolution|---

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 101366, which changed state. Bug 101366 Summary: memset codegen for constant sized does not use SSE instructions https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101366 What|Removed |Adde

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 120708, which changed state. Bug 120708 Summary: ix86_expand_set_or_cpymem ignores MOVE_MAX https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120708 What|Removed |Added

[Bug target/84719] gcc's __builtin_memcpy performance with certain number of bytes is terrible compared to clang's

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84719 H.J. Lu changed: What|Removed |Added Target Milestone|--- |16.0 Resolution|---

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 84719, which changed state. Bug 84719 Summary: gcc's __builtin_memcpy performance with certain number of bytes is terrible compared to clang's https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84719 What|Remo

[Bug target/70308] memset generates rep stosl instead of rep stosq

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70308 H.J. Lu changed: What|Removed |Added Status|ASSIGNED|RESOLVED Resolution|---

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 70308, which changed state. Bug 70308 Summary: memset generates rep stosl instead of rep stosq https://gcc.gnu.org/bugzilla/show_bug.cgi?id=70308 What|Removed |Added --

[Bug go/120670] vector_loop caused FAIL: go.test/test/fixedbugs/issue13160.go execution, -O2 -g

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120670 H.J. Lu changed: What|Removed |Added Resolution|--- |FIXED Status|NEW

[Bug target/120725] [meta] Poor memcpy/memset inline strategies for -mtune=generic

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120725 Bug 120725 depends on bug 120683, which changed state. Bug 120683 Summary: vector_loop/unrolled_loop generates poor codes on memset/memcpy https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120683 What|Removed |Adde

[Bug target/120683] vector_loop/unrolled_loop generates poor codes on memset/memcpy

2025-07-07 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120683 H.J. Lu changed: What|Removed |Added Resolution|--- |FIXED Target Milestone|---

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-05 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #10 from H.J. Lu --- (In reply to Filip Kastl from comment #8) > The same commit (r16-1644-gaba3b9d3a48a07) causes ~20% slowdown of 470lbm > from 2006 SPEC on Zen5 with -Ofast -march=native -flto -fprofile-use. > > https://lnt.opens

[Bug target/120941] [16 Regression] 24-40% slowdown of 519.lbm_r on Zen2 and 470.lbm on Zen5 since r16-1644-gaba3b9d3a48a07

2025-07-04 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120941 --- Comment #9 from H.J. Lu --- Created attachment 61803 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=61803&action=edit A patch Please try this.

[Bug target/120900] C++ passes user aligned struct differently from C

2025-07-04 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120900 --- Comment #11 from H.J. Lu --- (In reply to H.J. Lu from comment #10) > This makes C similar to C++: > > diff --git a/gcc/c/c-decl.cc b/gcc/c/c-decl.cc > index 8bbd6ebc66a..0da6c65fc6a 100644 > --- a/gcc/c/c-decl.cc > +++ b/gcc/c/c-decl.cc >

[Bug target/120683] vector_loop/unrolled_loop generates poor codes on memset/memcpy

2025-07-04 Thread hjl.tools at gmail dot com via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=120683 H.J. Lu changed: What|Removed |Added CC||pheeck at gcc dot gnu.org --- Comment #4 from

  1   2   3   4   5   6   7   8   9   10   >