subject:"\[Bug target\/92549\] Use x86 xchg instruction more"

[Bug target/92549] Use x86 xchg instruction more

2021-12-21 Thread pinskia at gcc dot gnu.org via Gcc-bugs

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92549 Andrew Pinski changed: What|Removed |Added Target Milestone|--- |10.0 Status|ASSIGNED

[Bug target/92549] Use x86 xchg instruction more

2019-11-19 Thread jakub at gcc dot gnu.org

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92549 --- Comment #7 from Jakub Jelinek --- Author: jakub Date: Tue Nov 19 09:31:59 2019 New Revision: 278439 URL: https://gcc.gnu.org/viewcvs?rev=278439&root=gcc&view=rev Log: PR target/92549 * config/i386/i386.md (peephole2 for *swap

[Bug target/92549] Use x86 xchg instruction more

2019-11-18 Thread segher at gcc dot gnu.org

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92549 --- Comment #6 from Segher Boessenkool --- (In reply to Richard Biener from comment #2) > But it looks like x86 has exactly patterns like this - but in this case > I guess combine won't ever try this because hardregs are invovled > (not sure if i

[Bug target/92549] Use x86 xchg instruction more

2019-11-18 Thread jakub at gcc dot gnu.org

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92549 --- Comment #5 from Jakub Jelinek --- Further info on the topic: https://stackoverflow.com/questions/45766444/why-is-xchg-reg-reg-a-3-micro-op-instruction-on-modern-intel-architectures

[Bug target/92549] Use x86 xchg instruction more

2019-11-18 Thread jakub at gcc dot gnu.org

https://gcc.gnu.org/bugzilla/show_bug.cgi?id=92549 --- Comment #4 from Jakub Jelinek --- E.g. Agner Fog has in the tables for Atom mov r,r 1uops, latency 1, rec. throughput 1/2, while for xchg r,r 3uops, latency 6, rec. throughput 6. It doesn't look beneficial speed wise then. Though, even say on