https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68707
--- Comment #17 from rguenther at suse dot de <rguenther at suse dot de> --- On December 14, 2015 4:44:13 PM GMT+01:00, "alalaw01 at gcc dot gnu.org" <gcc-bugzi...@gcc.gnu.org> wrote: >https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68707 > >--- Comment #13 from alalaw01 at gcc dot gnu.org --- >In the longer term, is the issue here, that we aren't comparing costs >of SLP vs >load-lanes, right? We merely compare the cost of whichever of those >vectorization strategies we favour, permutes et al, vs leaving it in >scalar >code? Correct. And the cost of using SLP is not comparable to the cost of interleaving/load-lanes or rather it would almost always favor the latter. BTW, for the patch can you do a compare with SPEC or another benchmark you are usually testing for vectorizer performance?