https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65478
--- Comment #24 from rguenther at suse dot de <rguenther at suse dot de> --- On Wed, 1 Apr 2015, hubicka at ucw dot cz wrote: > https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65478 > > --- Comment #23 from Jan Hubicka <hubicka at ucw dot cz> --- > > Seems to be a regression with -flto only? I also see EON regressing without > > -flto. > Yes, the inlining is cross file. > > > > http://gcc.opensuse.org/SPEC/CINT/sb-megrez-head-64/index.html > > Saw that one too. It is in between > Feb 10, 2015 18:20 UTC > (Values: Base: 164.gzip: 1558, 175.vpr: 2392, 176.gcc: 2845, 181.mcf: > 3766, > 186.crafty: 2926, 197.parser: 1975, 252.eon: 3726, 255.vortex: 3305, > 256.bzip2: > 2218, 300.twolf: 3257 Peak: , 164.gzip: 1546, 175.vpr: 2397, 176.gcc: 1994, > 181.mcf: 3819, 186.crafty: 2737, 197.parser: 1911, 252.eon: 4461, 255.vortex: > 4364, 256.bzip2: 2348, 300.twolf: 3265) > Feb 10, 2015 09:20 UTC > (Values: Base: 164.gzip: 1549, 175.vpr: 2452, 176.gcc: 2734, 181.mcf: > 3458, > 186.crafty: 2833, 197.parser: 1962, 252.eon: 4083, 255.vortex: 3378, > 256.bzip2: > 2059, 300.twolf: 3231 Peak: , 164.gzip: 1555, 175.vpr: 2241, 176.gcc: 2800, > 181.mcf: 3821, 186.crafty: 2681, 197.parser: 1905, 252.eon: 4415, 255.vortex: > 4363, 256.bzip2: 2379, 300.twolf: 3220) > > So it does not seem to point to inliner changes (fortunately). At Megrez, you > should be able to access the diff? Yes, it's -r220566:220590, so it is very likely +2015-02-10 Richard Biener <rguent...@suse.de> + + PR tree-optimization/64909 + * tree-vect-loop.c (vect_estimate_min_profitable_iters): Properly + pass a scalar-stmt count estimate to the cost model. + * tree-vect-data-refs.c (vect_peeling_hash_get_lowest_cost): Likewise. which fixed a pretty serious vectorizer cost-model issue on all AMD archs. Might be worth investigating (not that the cost modeling is very good...). On megrez -march=native expands to -march=bdver2 -mmmx -mno-3dnow -msse -msse2 -msse3 -mssse3 -msse4a -mcx16 -msahf -mno-movbe -maes -mno-sha -mpclmul -mpopcnt -mabm -mlwp -mfma -mfma4 -mxop -mbmi -mno-bmi2 -mtbm -mavx -mno-avx2 -msse4.2 -msse4.1 -mlzcnt -mno-rtm -mno-hle -mno-rdrnd -mf16c -mno-fsgsbase -mno-rdseed -mprfchw -mno-adx -mfxsr -mxsave -mno-xsaveopt -mno-avx512f -mno-avx512er -mno-avx512cd -mno-avx512pf -mno-prefetchwt1 -mno-clflushopt -mno-xsavec -mno-xsaves -mno-avx512dq -mno-avx512bw -mno-avx512vl -mno-avx512ifma -mno-avx512vbmi -mno-clwb -mno-pcommit --param l1-cache-size=16 --param l1-cache-line-size=64 --param l2-cache-size=2048 -mtune=bdver2