https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91069
--- Comment #6 from Richard Biener <rguenth at gcc dot gnu.org> --- -march=x86-64 -mtune=generic (aka "default") is also broken, haswell (AVX2) is fine. -O3 is enough to trigger the issue for that. Disabling either of loop or SLP vectorization hides the issue. Inlining can be tuned down for Compute_Rotation_Transform by not inlining MTimesA, MTranspose (MATRIX *) and MTimesB. Disabling the BIT_INSERT_EXPR generation from match.pd seems to fix it though.