https://gcc.gnu.org/bugzilla/show_bug.cgi?id=28366

--- Comment #3 from Martin Sebor <msebor at gcc dot gnu.org> ---
What I meant by suboptimal is the eight vector stores when it seems that just
two instructions are needed to save the two vector registers that hold the
arguments like Clang does:

        addi 3, 1, -48
        addi 4, 1, -64
        stvx 3, 0, 3
        stvx 2, 0, 4
        addi 3, 1, -32
        nop
        lfs 0, -36(1)
        lfs 1, -52(1)
        lfs 2, -40(1)
        lfs 3, -56(1)
        lfs 4, -44(1)
        lfs 5, -60(1)
        lfs 12, -48(1)
        lfs 6, -64(1)
        fdivs 0, 1, 0
        fdivs 2, 3, 2
        fdivs 13, 5, 4
        fdivs 1, 6, 12
        stfs 0, -20(1)
        stfs 2, -24(1)
        stfs 13, -28(1)
        stfs 1, -32(1)
        nop
        lvx 2, 0, 3
        blr
        .long   0
        .quad   0

Reply via email to