https://gcc.gnu.org/bugzilla/show_bug.cgi?id=108938
--- Comment #10 from Richard Biener <rguenth at gcc dot gnu.org> --- One original idea was to leverage VEC_PERM as well but then at least on x86 a vec_perm can expand to many instructions so costing will be difficult (and there's obviously cross register file movements)