https://gcc.gnu.org/bugzilla/show_bug.cgi?id=80724

Andrew Pinski <pinskia at gcc dot gnu.org> changed:

           What    |Removed                     |Added
----------------------------------------------------------------------------
           Keywords|                            |missed-optimization
             Status|UNCONFIRMED                 |NEW
   Last reconfirmed|                            |2017-07-26
                 CC|                            |pinskia at gcc dot gnu.org
   Target Milestone|---                         |8.0
            Summary|gcc.target/aarch64/pr62178. |[8 Regression]
                   |c failed because of r247885 |gcc.target/aarch64/pr62178.
                   |                            |c failed because of r247885
     Ever confirmed|0                           |1

--- Comment #2 from Andrew Pinski <pinskia at gcc dot gnu.org> ---
The vectorized loop looks like:
.L7:
        ldr     w2, [x1, 4]!
        ldr     q1, [x0], 124
        fmov    s3, w2
        mla     v0.4s, v1.4s, v3.s[0]
        cmp     x0, x3
        bne     .L7


Which is worse than before in GCC 7:
.L7:
        ld1r    {v1.4s}, [x1], 4
        ldr     q2, [x0], 124
        mla     v0.4s, v2.4s, v1.4s
        cmp     x0, x2
        bne     .L7

Reply via email to