https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116425
--- Comment #4 from Jeffrey A. Law <law at gcc dot gnu.org> --- Wouldn't these cases be better handled doing the scalar load, then splat it across the vector, then the stores? No gather needed at all.