https://gcc.gnu.org/bugzilla/show_bug.cgi?id=85381
Tom de Vries <vries at gcc dot gnu.org> changed: What |Removed |Added ---------------------------------------------------------------------------- Attachment #43992|0 |1 is obsolete| | --- Comment #9 from Tom de Vries <vries at gcc dot gnu.org> --- Created attachment 43999 --> https://gcc.gnu.org/bugzilla/attachment.cgi?id=43999&action=edit Tentative patch inserting dummy instructions (In reply to Tom de Vries from comment #7) > We may wanna insert dummy ops inbetween (it would be nice if something less > heavy than a membar.cta will work). This patch implements this approach. I've reported the bug at nvidia and see if they come up with a better or more restricted workaround.