https://gcc.gnu.org/bugzilla/show_bug.cgi?id=63281
--- Comment #18 from Segher Boessenkool <segher at gcc dot gnu.org> --- Yes, it is slow. Five sequential dependent integer instructions instead of one load instruction. Depending on how you benchmark this you possibly won't see the slowness, the values are stored to memory and that can happen very many cycles later even, this is totally out of the critical path, will not clog up any pipelines.