During "bashmark" memory benchmark perfomance analyze, I found ~100x perfomance regression between gcc 3.4.5 and gcc 4.X.
Compiler options: -march=athlon-xp -O3 test_cmd execution time: - GCC 3.4.5: 0.43user 0.00system 0:00.44elapsed - GCC 4.0.2: 34.83user 0.68system 0:36.09elapsed - GCC 4.1.0: 33.86user 0.58system 0:34.96elapsed Lurking inside assembler generation showed that GCC4 don't inline memcpy and memset calls. (I can attach assembler code on request) So, it looks like GCC4 inliner is broken at some point. --------- GCC 3.4.5 output------------------------------------------ Reading specs from /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/specs Configured with: /mnt/oktet/tmp/portage/gcc-3.4.5-r1/work/gcc-3.4.5/configure --prefix=/usr --bindir=/usr/i686-pc-linux-gnu/gcc-bin/3.4.5 --includedir=/usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include --datadir=/usr/share/gcc-data/i686-pc-linux-gnu/3.4.5 --mandir=/usr/share/gcc-data/i686-pc-linux-gnu/3.4.5/man --infodir=/usr/share/gcc-data/i686-pc-linux-gnu/3.4.5/info --with-gxx-include-dir=/usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include/g++-v3 --host=i686-pc-linux-gnu --build=i686-pc-linux-gnu --disable-altivec --enable-nls --without-included-gettext --with-system-zlib --disable-checking --disable-werror --disable-libunwind-exceptions --disable-multilib --disable-libgcj --enable-languages=c,c++,f77 --enable-shared --enable-threads=posix --enable-__cxa_atexit --enable-clocale=gnu Thread model: posix gcc version 3.4.5 (Gentoo 3.4.5-r1, ssp-3.4.5-1.0, pie-8.7.9) /usr/libexec/gcc/i686-pc-linux-gnu/3.4.5/cc1plus -E -quiet -v -D_GNU_SOURCE test_cmd.cpp -march=athlon-xp -O3 -o test_cmd.ii ignoring nonexistent directory "/usr/lib/gcc/i686-pc-linux-gnu/3.4.5/../../../../i686-pc-linux-gnu/include" #include "..." search starts here: #include <...> search starts here: /usr/include/libffi /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include/g++-v3 /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include/g++-v3/i686-pc-linux-gnu /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include/g++-v3/backward /usr/local/include /usr/lib/gcc/i686-pc-linux-gnu/3.4.5/include /usr/include End of search list. /usr/libexec/gcc/i686-pc-linux-gnu/3.4.5/cc1plus -fpreprocessed test_cmd.ii -quiet -dumpbase test_cmd.cpp -march=athlon-xp -auxbase test_cmd -O3 -version -o test_cmd.s GNU C++ version 3.4.5 (Gentoo 3.4.5-r1, ssp-3.4.5-1.0, pie-8.7.9) (i686-pc-linux-gnu) compiled by GNU C version 3.4.5 (Gentoo 3.4.5, ssp-3.4.5-1.0, pie-8.7.9). GGC heuristics: --param ggc-min-expand=99 --param ggc-min-heapsize=129319 --------------------------------------------------- -----------GCC 4.0.2 output ----------------------- Using built-in specs. Target: i686-pc-linux-gnu Configured with: /mnt/oktet/tmp/portage/gcc-4.0.2-r3/work/gcc-4.0.2/configure --prefix=/usr --bindir=/usr/i686-pc-linux-gnu/gcc-bin/4.0.2 --includedir=/usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include --datadir=/usr/share/gcc-data/i686-pc-linux-gnu/4.0.2 --mandir=/usr/share/gcc-data/i686-pc-linux-gnu/4.0.2/man --infodir=/usr/share/gcc-data/i686-pc-linux-gnu/4.0.2/info --with-gxx-include-dir=/usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include/g++-v4 --host=i686-pc-linux-gnu --build=i686-pc-linux-gnu --disable-altivec --disable-nls --with-system-zlib --disable-checking --disable-werror --disable-libunwind-exceptions --disable-multilib --disable-libmudflap --disable-libssp --disable-libgcj --enable-languages=c,c++ --enable-shared --enable-threads=posix --enable-__cxa_atexit --enable-clocale=gnu Thread model: posix gcc version 4.0.2 (Gentoo 4.0.2-r3, pie-8.7.8) /usr/libexec/gcc/i686-pc-linux-gnu/4.0.2/cc1plus -E -quiet -v -D_GNU_SOURCE test_cmd.cpp -march=athlon-xp -O3 -fpch-preprocess -o test_cmd.ii ignoring nonexistent directory "/usr/lib/gcc/i686-pc-linux-gnu/4.0.2/../../../../i686-pc-linux-gnu/include" #include "..." search starts here: #include <...> search starts here: /usr/include/libffi /usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include/g++-v4 /usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include/g++-v4/i686-pc-linux-gnu /usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include/g++-v4/backward /usr/local/include /usr/lib/gcc/i686-pc-linux-gnu/4.0.2/include /usr/include End of search list. /usr/libexec/gcc/i686-pc-linux-gnu/4.0.2/cc1plus -fpreprocessed test_cmd.ii -quiet -dumpbase test_cmd.cpp -march=athlon-xp -auxbase test_cmd -O3 -version -o test_cmd.s GNU C++ version 4.0.2 (Gentoo 4.0.2-r3, pie-8.7.8) (i686-pc-linux-gnu) compiled by GNU C version 4.0.2 (Gentoo 4.0.2-r3, pie-8.7.8). GGC heuristics: --param ggc-min-expand=99 --param ggc-min-heapsize=129319 --------------------------------------------------- ----------GCC 4.1.0 output------------------------- Using built-in specs. Target: i686-pc-linux-gnu Configured with: /mnt/oktet/tmp/portage/gcc-4.1.0-r2/work/gcc-4.1.0/configure --prefix=/usr --bindir=/usr/i686-pc-linux-gnu/gcc-bin/4.1.0 --includedir=/usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include --datadir=/usr/share/gcc-data/i686-pc-linux-gnu/4.1.0 --mandir=/usr/share/gcc-data/i686-pc-linux-gnu/4.1.0/man --infodir=/usr/share/gcc-data/i686-pc-linux-gnu/4.1.0/info --with-gxx-include-dir=/usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include/g++-v4 --host=i686-pc-linux-gnu --build=i686-pc-linux-gnu --disable-altivec --enable-nls --without-included-gettext --with-system-zlib --disable-checking --disable-werror --disable-libunwind-exceptions --disable-multilib --disable-libmudflap --disable-libssp --enable-java-awt=gtk --enable-languages=c,c++,java,objc,fortran --enable-shared --enable-threads=posix --enable-__cxa_atexit --enable-clocale=gnu Thread model: posix gcc version 4.1.0 (Gentoo 4.1.0-r2, pie-8.7.8) /usr/libexec/gcc/i686-pc-linux-gnu/4.1.0/cc1plus -E -quiet -v -D_GNU_SOURCE test_cmd.cpp -march=athlon-xp -O3 -fpch-preprocess -o test_cmd.ii ignoring nonexistent directory "/usr/lib/gcc/i686-pc-linux-gnu/4.1.0/../../../../i686-pc-linux-gnu/include" #include "..." search starts here: #include <...> search starts here: /usr/include/libffi /usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include/g++-v4 /usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include/g++-v4/i686-pc-linux-gnu /usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include/g++-v4/backward /usr/local/include /usr/lib/gcc/i686-pc-linux-gnu/4.1.0/include /usr/include End of search list. /usr/libexec/gcc/i686-pc-linux-gnu/4.1.0/cc1plus -fpreprocessed test_cmd.ii -quiet -dumpbase test_cmd.cpp -march=athlon-xp -auxbase test_cmd -O3 -version -o test_cmd.s GNU C++ version 4.1.0 (Gentoo 4.1.0-r2, pie-8.7.8) (i686-pc-linux-gnu) compiled by GNU C version 4.1.0 (Gentoo 4.1.0-r2, pie-8.7.8). GGC heuristics: --param ggc-min-expand=99 --param ggc-min-heapsize=129319 Compiler executable checksum: d3096f5bd00a04a18edac8d63d29a37f --------------------------------------------------- -- Summary: perfomance regression between gcc 3.4.5 and 4.* Product: gcc Version: 4.1.0 Status: UNCONFIRMED Severity: normal Priority: P3 Component: regression AssignedTo: unassigned at gcc dot gnu dot org ReportedBy: nbkolchin at gmail dot com http://gcc.gnu.org/bugzilla/show_bug.cgi?id=26658