Re: [cxx-mem-model] compare_exchange implementation

Andrew MacLeod Wed, 19 Oct 2011 08:05:33 -0700

On 10/18/2011 06:25 PM, Andrew MacLeod wrote:

Its impossible to implement a weak compare and swap unless you returnboth parameters in one operation.
the compare_exchange matches the atomic interface for c++, and if wecan't resolve it with a lock free instruction sequence, we have toleave an external call with this format to a library, so I that why Iprovide this built-in.
Neither rth nor I like the addressable parameter, so thats why I leftthe rtl pattern for weak and strong compare and swap without thataddressable argument, and let this builtin generate wrapper codearound it.
You could provide an __atomic version of the bool and val routineswith memory model.... we toyed with making the compare_and_swap returnboth values so we could implement a weak version, but getting 2 returnvalues is not pretty. It could be done with 2 separate built-ins thatrelate to each other, but thats not great either.

I've thought about various longer term schemes, but havent been able tosettle on one I really like. Ideally, if we know we can generate lockfree instructions, we expose the wrapper code to the tree optimizers.


I've considered:

1) adding tree support for a CAS primitive which has 2 results.. thatpretty invasive but has nice features.

2) a 2 part built-in.. one which returns a value, and a second one whichtakes that value and then returns the boolean: ie

   val = __atomic_compare_and_swap (&mem, expected, desired, model)
   if (__atomic_compare_and_swap_success (val))
     ...

and during expansion from SSA to RTL, you look for uses of the result of__atomic_compare_and_swap in __atomic_compare_and_swap_success, and youcan decide what RTL pattern to use and 'merge' the 2 builtins into onepattern. You can optimize the RTL pattern used based on what, if any,other uses there are of the 2 results. I think this should work OK...

3) Waiting for a flash of brilliance (may never come) or <insert yoursuggestion> :-)

I decided for the moment to punt on exploring those and give us moretime to think about the best way to do it. We do need to be able tocall this specific interface for external library calls, but we are notlocked into this for inline expansion of lock free instructions. Inc-common.c where we turn __atomic_exchange_compare into__atomic_compare_exchange_{1,2,4,8,16}, we can instead turn it into acode sequence using a new __atomic_compare_and_swap builtin or treecode. The wrapper code that is currently emitted as RTL could then beemitted as tree expressions before the SSA optimizers see anything. Sono later than the next release of GCC, I would expect to have a fullyflushed out solution that gives us all the nice inlines and removesaddr-taken flags and such. I just don't feel like there is time atthe moment to make the correct decision while I'm trying to get thelibrary ABI right.


If I find some time, I may experiment with #2 next week.

Andrew

Re: [cxx-mem-model] compare_exchange implementation

Reply via email to