https://gcc.gnu.org/bugzilla/show_bug.cgi?id=67325
--- Comment #5 from Marc Glisse <glisse at gcc dot gnu.org> --- (In reply to Andrew Pinski from comment #4) > So GCC is able to do it but rejects it because the cost is worse for some > reason. Indeed, and -Os produces the expected movl 4(%rdi), %eax (I did not benchmark)