During "bashmark" memory benchmark perfomance analyze, I found ~100x perfomance
regression between gcc 3.4.5 and gcc 4.X.

Compiler options: -march=athlon-xp -O3

test_cmd execution time:
- GCC 3.4.5: 0.43user 0.00system 0:00.44elapsed
- GCC 4.0.2: 34.83user 0.68system 0:36.09elapsed
- GCC 4.1.0: 33.86user 0.58system 0:34.96elapsed

Lurking inside assembler generation showed that GCC4 don't inline memcpy and
memset calls. (I can attach assembler code on request)

So, it looks like GCC4 inliner is broken at some point.

--------- GCC 3.4.5 output------------------------------------------
Reading specs from /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/specs
Configured with: /mnt/oktet/tmp/portage/gcc-3.4.5-r1/work/gcc-3.4.5/configure
--prefix=/usr --bindir=/usr/i686-pc-linux-gnu/gcc-bin/3.4.5
--includedir=/usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include
--datadir=/usr/share/gcc-data/i686-pc-linux-gnu/3.4.5
--mandir=/usr/share/gcc-data/i686-pc-linux-gnu/3.4.5/man
--infodir=/usr/share/gcc-data/i686-pc-linux-gnu/3.4.5/info
--with-gxx-include-dir=/usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include/g++-v3
--host=i686-pc-linux-gnu --build=i686-pc-linux-gnu --disable-altivec
--enable-nls --without-included-gettext --with-system-zlib --disable-checking
--disable-werror --disable-libunwind-exceptions --disable-multilib
--disable-libgcj --enable-languages=c,c++,f77 --enable-shared
--enable-threads=posix --enable-__cxa_atexit --enable-clocale=gnu
Thread model: posix
gcc version 3.4.5 (Gentoo 3.4.5-r1, ssp-3.4.5-1.0, pie-8.7.9)
 /usr/libexec/gcc/i686-pc-linux-gnu/3.4.5/cc1plus -E -quiet -v -D_GNU_SOURCE
test_cmd.cpp -march=athlon-xp -O3 -o test_cmd.ii
ignoring nonexistent directory
"/usr/lib/gcc/i686-pc-linux-gnu/3.4.5/../../../../i686-pc-linux-gnu/include"
#include "..." search starts here:
#include <...> search starts here:
 /usr/include/libffi
 /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include/g++-v3
 /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include/g++-v3/i686-pc-linux-gnu
 /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include/g++-v3/backward
 /usr/local/include
 /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include
 /usr/include
End of search list.
 /usr/libexec/gcc/i686-pc-linux-gnu/3.4.5/cc1plus -fpreprocessed test_cmd.ii
-quiet -dumpbase test_cmd.cpp -march=athlon-xp -auxbase test_cmd -O3 -version
-o test_cmd.s
GNU C++ version 3.4.5 (Gentoo 3.4.5-r1, ssp-3.4.5-1.0, pie-8.7.9)
(i686-pc-linux-gnu)
        compiled by GNU C version 3.4.5 (Gentoo 3.4.5, ssp-3.4.5-1.0,
pie-8.7.9).
GGC heuristics: --param ggc-min-expand=99 --param ggc-min-heapsize=129319
---------------------------------------------------

-----------GCC 4.0.2 output -----------------------
Using built-in specs.
Target: i686-pc-linux-gnu
Configured with: /mnt/oktet/tmp/portage/gcc-4.0.2-r3/work/gcc-4.0.2/configure
--prefix=/usr --bindir=/usr/i686-pc-linux-gnu/gcc-bin/4.0.2
--includedir=/usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include
--datadir=/usr/share/gcc-data/i686-pc-linux-gnu/4.0.2
--mandir=/usr/share/gcc-data/i686-pc-linux-gnu/4.0.2/man
--infodir=/usr/share/gcc-data/i686-pc-linux-gnu/4.0.2/info
--with-gxx-include-dir=/usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include/g++-v4
--host=i686-pc-linux-gnu --build=i686-pc-linux-gnu --disable-altivec
--disable-nls --with-system-zlib --disable-checking --disable-werror
--disable-libunwind-exceptions --disable-multilib --disable-libmudflap
--disable-libssp --disable-libgcj --enable-languages=c,c++ --enable-shared
--enable-threads=posix --enable-__cxa_atexit --enable-clocale=gnu
Thread model: posix
gcc version 4.0.2 (Gentoo 4.0.2-r3, pie-8.7.8)
 /usr/libexec/gcc/i686-pc-linux-gnu/4.0.2/cc1plus -E -quiet -v -D_GNU_SOURCE
test_cmd.cpp -march=athlon-xp -O3 -fpch-preprocess -o test_cmd.ii
ignoring nonexistent directory
"/usr/lib/gcc/i686-pc-linux-gnu/4.0.2/../../../../i686-pc-linux-gnu/include"
#include "..." search starts here:
#include <...> search starts here:
 /usr/include/libffi
 /usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include/g++-v4
 /usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include/g++-v4/i686-pc-linux-gnu
 /usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include/g++-v4/backward
 /usr/local/include
 /usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include
 /usr/include
End of search list.
 /usr/libexec/gcc/i686-pc-linux-gnu/4.0.2/cc1plus -fpreprocessed test_cmd.ii
-quiet -dumpbase test_cmd.cpp -march=athlon-xp -auxbase test_cmd -O3 -version
-o test_cmd.s
GNU C++ version 4.0.2 (Gentoo 4.0.2-r3, pie-8.7.8) (i686-pc-linux-gnu)
        compiled by GNU C version 4.0.2 (Gentoo 4.0.2-r3, pie-8.7.8).
GGC heuristics: --param ggc-min-expand=99 --param ggc-min-heapsize=129319
---------------------------------------------------

----------GCC 4.1.0 output-------------------------
Using built-in specs.
Target: i686-pc-linux-gnu
Configured with: /mnt/oktet/tmp/portage/gcc-4.1.0-r2/work/gcc-4.1.0/configure
--prefix=/usr --bindir=/usr/i686-pc-linux-gnu/gcc-bin/4.1.0
--includedir=/usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include
--datadir=/usr/share/gcc-data/i686-pc-linux-gnu/4.1.0
--mandir=/usr/share/gcc-data/i686-pc-linux-gnu/4.1.0/man
--infodir=/usr/share/gcc-data/i686-pc-linux-gnu/4.1.0/info
--with-gxx-include-dir=/usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include/g++-v4
--host=i686-pc-linux-gnu --build=i686-pc-linux-gnu --disable-altivec
--enable-nls --without-included-gettext --with-system-zlib --disable-checking
--disable-werror --disable-libunwind-exceptions --disable-multilib
--disable-libmudflap --disable-libssp --enable-java-awt=gtk
--enable-languages=c,c++,java,objc,fortran --enable-shared
--enable-threads=posix --enable-__cxa_atexit --enable-clocale=gnu
Thread model: posix
gcc version 4.1.0 (Gentoo 4.1.0-r2, pie-8.7.8)
 /usr/libexec/gcc/i686-pc-linux-gnu/4.1.0/cc1plus -E -quiet -v -D_GNU_SOURCE
test_cmd.cpp -march=athlon-xp -O3 -fpch-preprocess -o test_cmd.ii
ignoring nonexistent directory
"/usr/lib/gcc/i686-pc-linux-gnu/4.1.0/../../../../i686-pc-linux-gnu/include"
#include "..." search starts here:
#include <...> search starts here:
 /usr/include/libffi
 /usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include/g++-v4
 /usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include/g++-v4/i686-pc-linux-gnu
 /usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include/g++-v4/backward
 /usr/local/include
 /usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include
 /usr/include
End of search list.
 /usr/libexec/gcc/i686-pc-linux-gnu/4.1.0/cc1plus -fpreprocessed test_cmd.ii
-quiet -dumpbase test_cmd.cpp -march=athlon-xp -auxbase test_cmd -O3 -version
-o test_cmd.s
GNU C++ version 4.1.0 (Gentoo 4.1.0-r2, pie-8.7.8) (i686-pc-linux-gnu)
        compiled by GNU C version 4.1.0 (Gentoo 4.1.0-r2, pie-8.7.8).
GGC heuristics: --param ggc-min-expand=99 --param ggc-min-heapsize=129319
Compiler executable checksum: d3096f5bd00a04a18edac8d63d29a37f
---------------------------------------------------


-- 
           Summary: perfomance regression between gcc 3.4.5 and 4.*
           Product: gcc
           Version: 4.1.0
            Status: UNCONFIRMED
          Severity: normal
          Priority: P3
         Component: regression
        AssignedTo: unassigned at gcc dot gnu dot org
        ReportedBy: nbkolchin at gmail dot com


http://gcc.gnu.org/bugzilla/show_bug.cgi?id=26658

Reply via email to