https://gcc.gnu.org/bugzilla/show_bug.cgi?id=103592
Bug ID: 103592
Summary: fatigue2 benchmarks on zen runs 43% faster with
-fno-tree-vectorize -fno-tree-slp-vectorize
Product: gcc
Version: 12.0
Status: UNCONFIRMED
Severity: normal
Priority: P3
Component: tree-optimization
Assignee: unassigned at gcc dot gnu.org
Reporter: hubicka at gcc dot gnu.org
Target Milestone: ---
While looking into -fno-inline-functions-called-once difference I noticed that
on zen hardware I get:
- 0m33s runtime for fatigue2 benchmark (from phoronix) when built with -Ofast
-march=native -fno-slp-vectorize -fno-tree-vectorize
- 0m57s for -Ofast -march=native binary