[Bug tree-optimization/116075] Inefficient SVE INSR codegen

2024-07-25 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116075 --- Comment #5 from Andrew Pinski --- Note the original testcase was fixed via PR 116081 . Patch for the other issue found here: https://gcc.gnu.org/pipermail/gcc-patches/2024-July/658274.html

[Bug tree-optimization/116075] Inefficient SVE INSR codegen

2024-07-24 Thread rguenth at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116075 --- Comment #4 from Richard Biener --- fold_const_all misses VEC_SHL_INSERT folding (and probably many(?) others)

[Bug tree-optimization/116075] Inefficient SVE INSR codegen

2024-07-24 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116075 --- Comment #3 from Andrew Pinski --- Note I filed PR 116081 for maybe the real issue with the original testcase where there is a difference if you use a typedef or not. I am going to keep this for optimizing VEC_SHL_INSERT though because I have

[Bug tree-optimization/116075] Inefficient SVE INSR codegen

2024-07-24 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116075 --- Comment #2 from Andrew Pinski --- Short not using the vectorizer testcase: ``` #include svint8_t f(void) { svint8_t tt; tt = svdup_s8 (0); tt = svinsr (tt, 0); return tt; } ``` Note LLVM does not optimize the above to just `mov z3

[Bug tree-optimization/116075] Inefficient SVE INSR codegen

2024-07-24 Thread pinskia at gcc dot gnu.org via Gcc-bugs
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=116075 Andrew Pinski changed: What|Removed |Added Severity|normal |enhancement Component|target