https://gcc.gnu.org/bugzilla/show_bug.cgi?id=101017
--- Comment #8 from Haochen Jiang <haochen.jiang at intel dot com> --- One potential solution is to let the resolver ISA level becomes the highest one in target_clones instead of the default one. Then it will not get the memory/register mismatch when passing/returning arguments.