https://gcc.gnu.org/bugzilla/show_bug.cgi?id=87421
Florian Weimer <fw at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- CC| |fw at gcc dot gnu.org --- Comment #2 from Florian Weimer <fw at gcc dot gnu.org> --- What's your exact target? On x86-64-pc-linux-gnu, on function entry, %rsp + 8 is a multiple of 16, so %rbp is a multiple of 16 throughout the function. This means that %rbp - 48 is a multiple of 16 as well, and the movdqa load is 16-byte-aligned.