https://gcc.gnu.org/bugzilla/show_bug.cgi?id=63277
--- Comment #2 from Janne Grunau <janne-gcc at jannau dot net> --- It is not only the vcombine. The handling of the table vectors is even more dreadful. The loads are combined to properly paired registers. Then moved in reverse in order to different registers to be assembled again in the loop to properly paired registers for vtbl2. See the attached arm_neon_excessive_vmov_wo_vcombine.c