https://gcc.gnu.org/bugzilla/show_bug.cgi?id=119596
--- Comment #1 from Andrew Pinski <pinskia at gcc dot gnu.org> --- This is 100% a tuning issue. The generic tuning is tuned for a generic target. You could use -mtune= to get a better tuning for the processor you using.