https://gcc.gnu.org/bugzilla/show_bug.cgi?id=84844

--- Comment #5 from Jakub Jelinek <jakub at gcc dot gnu.org> ---
(In reply to Uroš Bizjak from comment #4)
> (In reply to Jakub Jelinek from comment #3)
> > Or shall I got for ?*v instead, or ???*v, or !???*v ?
> 
> Or we can revert PR78090 fix and use preferred_for_speed attribute again.
> This will allow the compiler to emit reg->xmm for cold parts.
> 
> BTW: we shouldn't just grep assembly dumps for unoptimal insns, as was done
> and reported in PR78090. There is no harm to emit reg->xmm form of the insn
> in the cold parts.
> 
> Based on this, I prefer the revert of PR78090 patch, perhaps with a run of
> 462.libquantum to assess the performance impact.

I can test the reversion, but don't have any setup for SPEC testing nor AMD HW,
so that would need to be done by somebody else.

Reply via email to