https://gcc.gnu.org/bugzilla/show_bug.cgi?id=77484

--- Comment #38 from Dominik Vogt <vogt at linux dot vnet.ibm.com> ---
Finally, the total between after the last and before the first patch.  Overall,
some tests gain some performance and others lose some.  The total number of
instructions has grown somewhat (especially tonto, calculix, dealII and wrf),
but there's no obvious connection between an increased number of instructions
and loss of performance.

Is this what can be expected of the patches?

All compiled with -O3 -funroll-loops -march=zEC12.

r244260 vs. r243994
-------------------
                               run-old.result                run-new.result
f410.bwaves                             1.28s    1.27s (  -0.78%,   0.79% )
f416.gamess                             7.10s    6.82s (  -3.94%,   4.11% )
f433.milc                               5.53s    5.53s (   0.00%,   0.00% )
f434.zeusmp                             2.19s    2.18s (  -0.46%,   0.46% )
f435.gromacs                            1.34s    1.33s (  -0.75%,   0.75% )
f436.cactusADM                         24.72s   24.80s (   0.32%,  -0.32% )
f437.leslie3d                           2.76s    2.75s (  -0.36%,   0.36% )
f444.namd                              12.13s   12.13s (   0.00%,   0.00% )
f447.dealII                             2.03s    2.02s (  -0.49%,   0.50% )
f450.soplex                             3.90s    3.92s (   0.51%,  -0.51% )
f453.povray                             2.88s    2.86s (  -0.69%,   0.70% )
f454.calculix                          17.32s   17.36s (   0.23%,  -0.23% )
f459.GemsFDTD                           7.22s    7.13s (  -1.25%,   1.26% )
f465.tonto                              0.93s    0.93s (   0.00%,   0.00% )
f470.lbm                                2.65s    2.66s (   0.38%,  -0.38% )
f481.wrf                                3.84s    3.84s (   0.00%,   0.00% )
f482.sphinx3                           10.49s   10.56s (   0.67%,  -0.66% )
i400.perlbench                          7.58s    7.25s (  -4.35%,   4.55% )
i401.bzip2                              3.98s    3.96s (  -0.50%,   0.51% )
i403.gcc                                1.00s    1.01s (   1.00%,  -0.99% )
i429.mcf                                1.49s    1.49s (   0.00%,   0.00% )
i445.gobmk                              3.55s    3.53s (  -0.56%,   0.57% )
i456.hmmer                              1.56s    1.55s (  -0.64%,   0.65% )
i458.sjeng                              3.81s    3.79s (  -0.52%,   0.53% )
i462.libquantum                        17.12s   17.11s (  -0.06%,   0.06% )
i464.h264ref                            3.14s    3.17s (   0.96%,  -0.95% )
i471.omnetpp                           11.39s   11.52s (   1.14%,  -1.13% )
i473.astar                              7.22s    7.26s (   0.55%,  -0.55% )
i483.xalancbmk                          7.62s    7.69s (   0.92%,  -0.91% )

--

f470.lbm 2984 insns identical
i429.mcf 4165 insns -4 smaller
i462.libquantum 11735 insns +0 changed
i473.astar 12460 insns +32 BIGGER!, 2 funcs bigger (max +79 insns)
f410.bwaves 9820 insns +7 BIGGER!, 1 funcs bigger (max +7 insns)
i401.bzip2 22439 insns -63 smaller
f437.leslie3d 28725 insns +9 BIGGER!, 5 funcs bigger (max +19 insns)
i458.sjeng 38864 insns -26 smaller, 2 funcs bigger (max +24 insns)
f433.milc 35091 insns -70 smaller, 1 funcs bigger (max +5 insns)
f482.sphinx3 51879 insns +4 BIGGER!, 5 funcs bigger (max +15 insns)
i456.hmmer 85157 insns -33 smaller, 4 funcs bigger (max +91 insns)
f444.namd 76220 insns -3 smaller
f434.zeusmp 73937 insns +43 BIGGER!, 3 funcs bigger (max +27 insns)
f459.GemsFDTD 111465 insns +84 BIGGER!, 5 funcs bigger (max +57 insns)
f436.cactusADM 201648 insns +125 BIGGER!, 37 funcs bigger (max +68 insns)
f435.gromacs 250725 insns -53 smaller, 11 funcs bigger (max +25 insns)
i471.omnetpp 135992 insns -435 smaller, 15 funcs bigger (max +80 insns)
i445.gobmk 249112 insns -1167 smaller, 16 funcs bigger (max +82 insns)
f450.soplex 131531 insns -558 smaller, 22 funcs bigger (max +18 insns)
f453.povray 247399 insns -48 smaller, 3 funcs bigger (max +92 insns)
i400.perlbench 305683 insns -216 smaller, 51 funcs bigger (max +554 insns)
f454.calculix 478026 insns +485 BIGGER!, 22 funcs bigger (max +157 insns)
i464.h264ref 316483 insns -76 smaller, 8 funcs bigger (max +76 insns)
i403.gcc 800574 insns -782 smaller, 100 funcs bigger (max +1674 insns)
f465.tonto 1138432 insns +2511 BIGGER!, 235 funcs bigger (max +455 insns)
f447.dealII 764322 insns +597 BIGGER!, 171 funcs bigger (max +295 insns)
f481.wrf 1081604 insns +2769 BIGGER!, 141 funcs bigger (max +2329 insns)
i483.xalancbmk 919758 insns -483 smaller, 277 funcs bigger (max +1002 insns)
f416.gamess 2553939 insns -1589 smaller, 127 funcs bigger (max +46 insns)

statistics:
-----------
29      tests (total)
11      test executables have grown (more insns)
16      test executables have shrunk (fewer insns)
10140169        insns total (old)
+1060   insns difference
+104    insns per 1,000,000
-360    weighted insns per 1,000,000 *
1264    functions have grown (total) **
+2329   insns in most grown function

Reply via email to