On Fri, Feb 03, 2017 at 04:19:58PM +0000, Ramana Radhakrishnan wrote: > > > Would it be acceptable for those users to have loads that perform like > > > CAS loops, especially under contention? Or are these users more > > > concerned about aarch64 not offering a true atomic 16-byte load? > > > > Can the store you need for atomicity be into an automatic var on the stack? > > No, it has to be to the same location.
But then it is the same problem as using cmpxchg16b on x86_64, the location could be read-only, or that it is too slow otherwise for what users expect for atomic load. Jakub