[Bug tree-optimization/91573] Vectorization failure for a loop to do multiply-add because SLP loads unnecessarily require permutation

2019-12-12 Thread avieira at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91573 avieira at gcc dot gnu.org changed: What|Removed |Added Status|NEW |RESOLVED CC|

[Bug tree-optimization/91573] Vectorization failure for a loop to do multiply-add because SLP loads unnecessarily require permutation

2019-08-28 Thread tnfchris at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91573 Tamar Christina changed: What|Removed |Added CC||tnfchris at gcc dot gnu.org --- Commen

[Bug tree-optimization/91573] Vectorization failure for a loop to do multiply-add because SLP loads unnecessarily require permutation

2019-08-28 Thread hliu at amperecomputing dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91573 --- Comment #5 from Hao Liu --- Great. It seems really a SLP issue. I've learnt a lot about vectorization, dump info and -march. Thanks for your help.

[Bug tree-optimization/91573] Vectorization failure for a loop to do multiply-add because SLP loads unnecessarily require permutation

2019-08-28 Thread rguenth at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91573 Richard Biener changed: What|Removed |Added Keywords||missed-optimization Status|

[Bug tree-optimization/91573] Vectorization failure for a loop to do multiply-add

2019-08-28 Thread pinskia at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91573 --- Comment #3 from Andrew Pinski --- t5.c:10:5: missed: cost model: the vector iteration cost = 12372 divided by the scalar iteration cost = 236 is greater or equal to the vectorization factor = 16. t5.c:10:5: missed: not vectorized: vectoriza

[Bug tree-optimization/91573] Vectorization failure for a loop to do multiply-add

2019-08-28 Thread crazylht at gmail dot com
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91573 Hongtao.liu changed: What|Removed |Added CC||crazylht at gmail dot com --- Comment #2 f

[Bug tree-optimization/91573] Vectorization failure for a loop to do multiply-add

2019-08-28 Thread pinskia at gcc dot gnu.org
https://gcc.gnu.org/bugzilla/show_bug.cgi?id=91573 --- Comment #1 from Andrew Pinski --- >2) If WIDTH is larger. e.g. #define WIDTH 16 Either we are unrolling the loop and not doing SLP on the following loop or it is a cost issue saying it is not profitable for some reason. Use -fno-vect-c