From: "Cui,Lili" <lili....@intel.com> Hi,
I have four patches for tremont tuning, With all patches applied, performance impacts on SPEC CPU 2017 are: 500.perlbench_r 1.81% 502.gcc_r 0.57% 505.mcf_r 1.16% 520.omnetpp_r 0.00% 523.xalancbmk_r 0.00% 525.x264_r 4.55% 531.deepsjeng_r 0.00% 541.leela_r 0.39% 548.exchange2_r 1.13% 557.xz_r 0.00% geomean for intrate 0.95% 503.bwaves_r 0.00% 507.cactuBSSN_r 6.94% 508.namd_r 12.37% 510.parest_r 1.01% 511.povray_r 3.70% 519.lbm_r 36.61% 521.wrf_r 8.79% 526.blender_r 2.91% 527.cam4_r 6.23% 538.imagick_r 0.28% 544.nab_r 21.99% 549.fotonik3d_r 3.63% 554.roms_r -1.20% geomean for fprate 7.50% Bootstrapped and regtested on x86_64-linux-gnu{-m32,-m64}. Ok for master? x86: Update -mtune=tremont x86: Update memcpy/memset inline strategies for -mtune=tremont x86: Properly handle USE_VECTOR_FP_CONVERTS/USE_VECTOR_CONVERTS x86: Add TARGET_SSE_PARTIAL_REG_[FP_]CONVERTS_DEPENDENCY gcc/common/config/i386/i386-common.c | 2 +- gcc/config/i386/i386-features.c | 23 +++- gcc/config/i386/i386-options.c | 2 +- gcc/config/i386/i386.c | 1 + gcc/config/i386/i386.h | 4 + gcc/config/i386/i386.md | 9 +- gcc/config/i386/x86-tune-costs.h | 124 ++++++++++++++++++ gcc/config/i386/x86-tune-sched.c | 2 + gcc/config/i386/x86-tune.def | 52 +++++--- gcc/testsuite/gcc.target/i386/avx-covert-1.c | 19 +++ .../gcc.target/i386/avx-fp-covert-1.c | 15 +++ .../gcc.target/i386/avx-int-covert-1.c | 14 ++ gcc/testsuite/gcc.target/i386/pr101900-1.c | 18 +++ gcc/testsuite/gcc.target/i386/pr101900-2.c | 18 +++ gcc/testsuite/gcc.target/i386/pr101900-3.c | 19 +++ gcc/testsuite/gcc.target/i386/sse-covert-1.c | 19 +++ .../gcc.target/i386/sse-fp-covert-1.c | 15 +++ .../gcc.target/i386/sse-int-covert-1.c | 14 ++ 18 files changed, 344 insertions(+), 26 deletions(-) create mode 100644 gcc/testsuite/gcc.target/i386/avx-covert-1.c create mode 100644 gcc/testsuite/gcc.target/i386/avx-fp-covert-1.c create mode 100644 gcc/testsuite/gcc.target/i386/avx-int-covert-1.c create mode 100644 gcc/testsuite/gcc.target/i386/pr101900-1.c create mode 100644 gcc/testsuite/gcc.target/i386/pr101900-2.c create mode 100644 gcc/testsuite/gcc.target/i386/pr101900-3.c create mode 100644 gcc/testsuite/gcc.target/i386/sse-covert-1.c create mode 100644 gcc/testsuite/gcc.target/i386/sse-fp-covert-1.c create mode 100644 gcc/testsuite/gcc.target/i386/sse-int-covert-1.c -- 2.17.1 Thanks, Lili.