https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104413

--- Comment #2 from Hongtao.liu <crazylht at gmail dot com> ---
(In reply to Hongtao.liu from comment #1)
> Just note pshufb needs SSE3, and it seems reasonable to define a SSE3
> broadcast pattern which generates pxor + pshufb.

w/ -msse4 gcc generates ideal instructions sequence

       movd    %edi, %xmm0
        pxor    %xmm1, %xmm1
        pshufb  %xmm1, %xmm0

Reply via email to