https://gcc.gnu.org/bugzilla/show_bug.cgi?id=65660
--- Comment #5 from Richard Biener <rguenth at gcc dot gnu.org> --- https://gcc.gnu.org/ml/gcc-patches/2015-04/msg00053.html C testcase: void Set (float d, float *data) { for (int i = 0; i < 8; i++) data[i] = d; } note that I didn't really verify it is that specific vectorization causing the slowdown. It just appears a lot in the diff. As said we should recognize that peeling for alignment is stupid here. Will produce a patch for that as well.