https://gcc.gnu.org/bugzilla/show_bug.cgi?id=102838
--- Comment #8 from Jakub Jelinek <jakub at gcc dot gnu.org> --- Ok, so, first question, is GOMP_HAVE_EFFICIENT_ALIGNED_ALLOC defined in your case? Can be seen e.g. from objdump -dr alloc.o and seeing if gomp_aligned_free is just a (tail?) call to free without anything else or if it performs if (ptr) free (((void **) ptr)[-1]); Or perhaps attach preprocessed loop-ull.i and loop-ull.s.