https://gcc.gnu.org/bugzilla/show_bug.cgi?id=104413
--- Comment #3 from Hongtao.liu <crazylht at gmail dot com> --- (In reply to Hongtao.liu from comment #2) > (In reply to Hongtao.liu from comment #1) > > Just note pshufb needs SSE3, and it seems reasonable to define a SSE3 > > broadcast pattern which generates pxor + pshufb. > It's SSSE3 not SSE3. :( So it looks optimal with -mssse3 movd %edi, %xmm0 pxor %xmm1, %xmm1; pshufb %xmm1, %xmm0