O3-pr36098.c vectorized using VEC_PERM_EXPR rather than VEC_LOAD_LANES

rguenther at suse dot de Mon, 14 Dec 2015 12:36:34 -0800

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68707

--- Comment #17 from rguenther at suse dot de <rguenther at suse dot de> ---
On December 14, 2015 4:44:13 PM GMT+01:00, "alalaw01 at gcc dot gnu.org"
<gcc-bugzi...@gcc.gnu.org> wrote:
>https://gcc.gnu.org/bugzilla/show_bug.cgi?id=68707
>
>--- Comment #13 from alalaw01 at gcc dot gnu.org ---
>In the longer term, is the issue here, that we aren't comparing costs
>of SLP vs
>load-lanes, right? We merely compare the cost of whichever of those
>vectorization strategies we favour, permutes et al, vs leaving it in
>scalar
>code?

Correct.  And the cost of using SLP is not comparable to the cost of
interleaving/load-lanes or rather it would almost always favor the latter.

BTW, for the patch can you do a compare with SPEC or another benchmark you are
usually testing for vectorizer performance?

[Bug tree-optimization/68707] [6 Regression] testcase gcc.dg/vect/O3-pr36098.c vectorized using VEC_PERM_EXPR rather than VEC_LOAD_LANES

Reply via email to