Re: basic asm and memory clobbers

Jeff Law Tue, 17 Nov 2015 13:32:43 -0800

On 11/16/2015 10:55 PM, David Wohlferd wrote:


- There is no standard that says it must do this.

True. But these after all are extensions and extensions have beennotoriously under-documented through the years.

- I'm only aware of 1 person who has ever asked for this change. And the
request has been deemed so unimportant it has languished for a very long
time.

True. But I'd say for this case it means it just hasn't been highenough on anyone's priority list to get resolved. I wouldn't be at allsurprised if Richard filed this bug to ensure that it didn't get lost.That's standard development procedure for items we notice, but aren'tactively working on.

It's unfortunate that Andrew muddied the waters. Andrew's conclusions,particularly in c#6 are simply wrong.

- There is a plausible work-around with extended asm, which (mostly) has
clear semantics regarding clobbers.

Converting an old-style asm to extended asm can be painful. ANd in thecase of legacy code the conversion process itself is a potential sourceof bugs.

- While the change probably won't introduce bad code, if it does it will
be in ways that are going to be difficult to track down, in an area
where few have the expertise to debug.
- Existing code that currently does things 'right' (ie push/pop any
modified registers) will suddenly be doing things 'wrong,' or at least
wastefully.
- Other than top-level asm, it seems like every existing basic asm will
(probably) get a new performance penalty (memory usage + code size +
cycles) to allow for situations they may already be handling correctly
or that don't apply.
True, these aren't particularly compelling reasons to not make the
change.  But I also don't see any compelling benefits to offset them.

The benefit is traditional asms do the expected thing. With no way todescribe dataflow, the only rational behaviour for a traditional asm isthat it has to be considered a use/clobber of memory and hard registers.

The fact that it wasn't documented that way eons ago is simply adocumentation bug -- likely due to the fact that back when thedocumentation for traditional asms was written, there were virtually nooptimizations of memory referencing instructions -- essentially folksdidn't ponder (much less document) how these asms would interact withmemory.

In fact, if you go back to the change I made back in 1999 referenced bythis BZ, you'll find that we had a mis-compilation of code around an ASMby a pass to remove redundant stores that had just been significantlyimproved.


For existing users, presumably they have already found whatever solution
they need and will just be annoyed that they have to revisit their code
to see the impact of this change.  Will they need to #if to ensure
consistent performance/function between gcc versions?  For future users,
they will have the docs telling them the behavior, and pointing them to
the (now well documented) extended asm.  Where's the benefit?

Existing users have to change nothing when we fix 24414. The wholepoint behind 24414 is to point out a case where we are not honoring theuses/clobbers all hard regs and memory semantics of traditional asms.


But changing this so gcc tries (probably futilely) to emulate other
implementations of asm...  That seems like a weak case to support a
change to this long-time behavior.  Unless there are other benefits I'm
just not seeing?

When we fix 24414 by honoring the "uses/clobbers all hard registers andmemory" semantics for old-style asms, those old-style asms will be*less* likely to cause problems in the presence of ever-improvingoptimization techniques.


--------------
Ok, that's my best shot.  You have way more expertise and experience
here than I do, so I expect that after you think it over, you'll make
the right call.  And despite my attempt here to defend the opposite
side, I'm not entirely sure what the right call is.  But these seem like
the right questions.

Either way, let me know if I can help.

About the only immediate task would be to ensure that the documentationfor traditional asms clearly documents the desired semantics and somehownote that there are known bugs in the implementation (ie 24414, handlingof flags registers, and probably other oddities)

Whether that means clobbering memory or not, I don't much care -- with
the status quo, if you want your asm to clobber memory you have to use
extended asm; if basic asm is made to clobber memory, if you want your
asm to *not* clobber memory you have to use extended asm (which you
can with no operands by writing e.g.  asm("bork" : );  ).  So both
behaviours are available whether we make a change or not.

But changing things now will likely break user code.

Having an traditional asm clobber memory should not break user code.
It may pessimize it slightly, but if it does, that code was already
broken.


How much pessimism are we talking here?  Wouldn't clobbering everything
effectively force the reloading of (some? most? all?) registers?  And
more memory will be needed to store things that used to just require
registers?  Along with a few more memory writes?  A single line of basic
asm, even a comment, could have a non-trivial impact on the code that
gets generated.

Essentially it means that old style asms become a point where thecompiler has to assume that memory and hard registers areread/clobbered. Without going into all the details the asm essentiallyinvalidates information the compiler might be tracking about the valuein memory locations or hard registers.

So given two stores to the same memory location on opposite sides of thetraditional asm, the compiler is _not_ allowed to remove the first store(because the traditional asm might read the value). Nor is the compilerallowed to remove the second store (because the asm might have stored avalue into that location).


Similar situations occur when reading memory locations.


One common use I've seen for basic asm is "int $3" on x86 to break into
the debugger (the basic asm docs use this as a sample). Changing this to
a clobber-everything will make what used to be a minimally intrusive way
to debug code into a high impact operation that may obscure the very
thing being debugged.

Actually in that specific case, ensuring everything is consistent isactually a *good* thing. And I suspect it's still a lot less intrusivethan you might think.

+Basic @code{asm} statements are not treated as though they used a

"memory"
+clobber, although they do implicitly perform a clobber of the flags
+(@pxref{Clobbers}).

They do not clobber the flags.  Observe:


Ouch.  i386 shows the same thing for basic asm.

Sadly, I suspect this isn't consistent across targets.


Bigger ouch.  I'll follow up on this after the discussion about changing
basic asm is complete (which may render this moot).

It likely depends on how the target models the flags.

jeff

Re: basic asm and memory clobbers

Reply via email to