On Fri, May 05, 2023 at 12:18:12PM -0700, Palmer Dabbelt wrote:
> On Fri, 05 May 2023 11:55:31 PDT (-0700), Andrea Parri wrote:
> > On Fri, May 05, 2023 at 10:12:56AM -0700, Patrick O'Neill wrote:
> > > The RISC-V Ztso extension currently has no effect on generated code.
> > > With the additional ordering constraints guarenteed by Ztso, we can emit
> > > more optimized atomic mappings than the RVWMO mappings.
> > > 
> > > This patch implements Andrea Parri's proposed Ztso mappings ("Proposed
> > > Mapping").
> > >   
> > > https://github.com/preames/public-notes/blob/master/riscv-tso-mappings.rst
> > > 
> > > LLVM has implemented this same mapping (Ztso is still behind a
> > > experimental flag in LLVM, so there is *not* a defined ABI for this yet).
> > >   https://reviews.llvm.org/D143076
> > 
> > Given the recent patches/discussions, it seems worth pointing out the
> > the Ztso mappings referred to above was designed to be compatible with
> > the mappings in Table A.6 and that they are _not_ compatible with the
> > mappings in Table A.7 or with a "subset" of A.7 (even assuming RVTSO).
> 
> I guess that brings up the question of what we should do about WMO/TSO
> compatibility.  IIUC the general plan has been that WMO binaries would be
> compatible with TSO binaries when run on TSO systems, and that TSO binaries
> would require TSO systems.
> 
> I suppose it would be possible to have TSO produce binaries that would run
> on WMO systems by just emitting a bunch of extra fences, but I don't think
> anyone wants that?
> 
> We've always just assumed that WMO binaries would be compatible with TSO
> binaries, but I don't think it's ever really been concretely discussed.
> Having an ABI break here wouldn't be the craziest idea as it'd let us fix
> some other issues, but that'd certainly need to be pretty widely discussed.
> 
> Do we have an idea of what A.7-compatible TSO mappings would look like?

As in riscv-tso-mappings.rst but with

  atomic_store(memory_order_seq_cst)  |  s{b|h|w|d} ; fence rw,rw

would be A.7-compatible: call the resulting mappings "A.6-tso".

A.6-tso is (also) compatible with the following subset of A.7:

C/C++ Construct                                 | A.7-tso Mapping
------------------------------------------------------------------------------
Non-atomic load                                 | l{b|h|w|d}
atomic_load(memory_order_relaxed                | l{b|h|w|d}
atomic_load(memory_order_acquire)               | l{b|h|w|d}
atomic_load(memory_order_seq_cst)               | l{b|h|w|d}.aq
------------------------------------------------------------------------------
Non-atomic store                                | s{b|h|w|d}
atomic_store(memory_order_relaxed)              | s{b|h|w|d}
atomic_store(memory_order_release)              | s{b|h|w|d}
atomic_store(memory_order_seq_cst)              | s{b|h|w|d}.rl
------------------------------------------------------------------------------
atomic_thread_fence(memory_order_acquire)       | nop
atomic_thread_fence(memory_order_release)       | nop
atomic_thread_fence(memory_order_acq_rel)       | nop
atomic_thread_fence(memory_order_seq_cst)       | fence rw,rw
------------------------------------------------------------------------------
C/C++ Construct                                 | RVTSO AMO Mapping
atomic_<op>(memory_order_relaxed)               | amo<op>.{w|d}
atomic_<op>(memory_order_acquire)               | amo<op>.{w|d}
atomic_<op>(memory_order_release)               | amo<op>.{w|d}
atomic_<op>(memory_order_acq_rel)               | amo<op>.{w|d}
atomic_<op>(memory_order_seq_cst)               | amo<op>.{w|d}
------------------------------------------------------------------------------
C/C++ Construct                                 | RVTSO LR/SC Mapping
atomic_<op>(memory_order_relaxed)               | loop: lr.{w|d} ; <op> ;
                                                |       sc.{w|d} ; bnez loop
atomic_<op>(memory_order_acquire)               | loop: lr.{w|d} ; <op> ;
                                                |       sc.{w|d} ; bnez loop
atomic_<op>(memory_order_release)               | loop: lr.{w|d} ; <op> ;
                                                |       sc.{w|d} ; bnez loop
atomic_<op>(memory_order_acq_rel)               | loop: lr.{w|d} ; <op> ;
                                                |       sc.{w|d} ; bnez loop
atomic_<op>(memory_order_seq_cst)               | loop: lr.{w|d}.aq ; <op> ;
                                                |       sc.{w|d}.rl ; bnez loop

  Andrea

Reply via email to