https://gcc.gnu.org/bugzilla/show_bug.cgi?id=81071
--- Comment #3 from Jakub Jelinek <jakub at gcc dot gnu.org> --- See above, you should never use the builtins directly. You should be using _mm_load_ss or _mm_movedup_pd etc., or alternatively the generic vectors.