With -march=cascadelake we use vpermilps instead of shufps. Tested on x86_64-unknown-linux-gnu, pushed.
PR tree-optimization/116258 * gcc.target/i386/pr116258.c: Also allow vpermilps. --- gcc/testsuite/gcc.target/i386/pr116258.c | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/gcc/testsuite/gcc.target/i386/pr116258.c b/gcc/testsuite/gcc.target/i386/pr116258.c index bd7d3a97b2c..cb67e4085c5 100644 --- a/gcc/testsuite/gcc.target/i386/pr116258.c +++ b/gcc/testsuite/gcc.target/i386/pr116258.c @@ -10,5 +10,5 @@ return (x + h(t)); } -/* { dg-final { scan-assembler-times "shufps" 1 } } */ +/* { dg-final { scan-assembler-times "shufps|permilps" 1 } } */ /* { dg-final { scan-assembler-not "unpck" } } */ -- 2.43.0