https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65478

--- Comment #24 from rguenther at suse dot de <rguenther at suse dot de> ---
On Wed, 1 Apr 2015, hubicka at ucw dot cz wrote:

> https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65478
> 
> --- Comment #23 from Jan Hubicka <hubicka at ucw dot cz> ---
> > Seems to be a regression with -flto only?  I also see EON regressing without
> > -flto.
> Yes, the inlining is cross file.
> > 
> > http://gcc.opensuse.org/SPEC/CINT/sb-megrez-head-64/index.html
> 
> Saw that one too. It is in between 
> Feb 10, 2015 18:20 UTC
>     (Values: Base: 164.gzip: 1558, 175.vpr: 2392, 176.gcc: 2845, 181.mcf: 
> 3766,
> 186.crafty: 2926, 197.parser: 1975, 252.eon: 3726, 255.vortex: 3305, 
> 256.bzip2:
> 2218, 300.twolf: 3257 Peak: , 164.gzip: 1546, 175.vpr: 2397, 176.gcc: 1994,
> 181.mcf: 3819, 186.crafty: 2737, 197.parser: 1911, 252.eon: 4461, 255.vortex:
> 4364, 256.bzip2: 2348, 300.twolf: 3265)
> Feb 10, 2015 09:20 UTC
>     (Values: Base: 164.gzip: 1549, 175.vpr: 2452, 176.gcc: 2734, 181.mcf: 
> 3458,
> 186.crafty: 2833, 197.parser: 1962, 252.eon: 4083, 255.vortex: 3378, 
> 256.bzip2:
> 2059, 300.twolf: 3231 Peak: , 164.gzip: 1555, 175.vpr: 2241, 176.gcc: 2800,
> 181.mcf: 3821, 186.crafty: 2681, 197.parser: 1905, 252.eon: 4415, 255.vortex:
> 4363, 256.bzip2: 2379, 300.twolf: 3220)
> 
> So it does not seem to point to inliner changes (fortunately).  At Megrez, you
> should be able to access the diff?

Yes, it's -r220566:220590, so it is very likely

+2015-02-10  Richard Biener  <rguent...@suse.de>
+
+       PR tree-optimization/64909
+       * tree-vect-loop.c (vect_estimate_min_profitable_iters): Properly
+       pass a scalar-stmt count estimate to the cost model.
+       * tree-vect-data-refs.c (vect_peeling_hash_get_lowest_cost): Likewise.

which fixed a pretty serious vectorizer cost-model issue on all AMD archs.

Might be worth investigating (not that the cost modeling is very good...).

On megrez -march=native expands to

-march=bdver2 -mmmx -mno-3dnow -msse -msse2 -msse3 -mssse3 -msse4a -mcx16 
-msahf -mno-movbe -maes -mno-sha -mpclmul -mpopcnt -mabm -mlwp -mfma 
-mfma4 -mxop -mbmi -mno-bmi2 -mtbm -mavx -mno-avx2 -msse4.2 -msse4.1 
-mlzcnt -mno-rtm -mno-hle -mno-rdrnd -mf16c -mno-fsgsbase -mno-rdseed 
-mprfchw -mno-adx -mfxsr -mxsave -mno-xsaveopt -mno-avx512f -mno-avx512er 
-mno-avx512cd -mno-avx512pf -mno-prefetchwt1 -mno-clflushopt -mno-xsavec 
-mno-xsaves -mno-avx512dq -mno-avx512bw -mno-avx512vl -mno-avx512ifma 
-mno-avx512vbmi -mno-clwb -mno-pcommit --param l1-cache-size=16 --param 
l1-cache-line-size=64 --param l2-cache-size=2048 -mtune=bdver2

Reply via email to