https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87455
--- Comment #2 from vekumar at gcc dot gnu.org --- This tuning was intended to generate movups instead of movupd as movups is 1 byte lesser than movupd. May be we should remove xorps generation part.