https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80881
--- Comment #56 from Julian Waters <tanksherman27 at gmail dot com> --- Ah, I see. I had been under the impression that gcc would see the parallel and realize that the 2 loads could be done at the same time. Since it can see that without the parallel anyway, and doing so allows gcc to emit more efficient code, I'll remove it (Not like the parallel approach worked anyway, it resulted in an unrecognizable insn...)