https://gcc.gnu.org/bugzilla/show_bug.cgi?id=43147
--- Comment #20 from Hongtao.liu <crazylht at gmail dot com> --- Fixed in GCC12, now gcc generate optimal codes. main: .LFB532: .cfi_startproc subq $8, %rsp .cfi_def_cfa_offset 16 movaps .LC0(%rip), %xmm0 call printv xorl %eax, %eax addq $8, %rsp .cfi_def_cfa_offset 8 ret .cfi_endproc .LFE532: .size main, .-main .section .rodata.cst16,"aM",@progbits,16 .align 16 .LC0: .long 1073741824 .long 1065353216 .long 1082130432 .long 1077936128 .ident "GCC: (GNU) 12.0.0 20210825 (experimental)" .section .note.GNU-stack,"",@progbits