On 12/02/15 09:24, Jakub Jelinek wrote:
On Wed, Dec 02, 2015 at 09:23:11AM -0500, Nathan Sidwell wrote:
On 12/02/15 09:22, Jakub Jelinek wrote:
I believe Alex' testing revealed that if you take address of the same .local
objects in several threads, the addresses are the same, and therefore you
refer to your own .local space rather than the other thread's.
Before or after applying cvta?
I'll let Alex answer that.
Nevermind, I've run an experiment, and it appears that local addresses converted
to generic do give the same value regardless of executing thread. I guess that
means that genericization of local addresses to physical memory is done late at
the load/store insn, rather than in the cvta insn.
When I added routine support, I did wonder whether the calling routine would
need to clone its stack frame, but determined against it using the logic I wrote
earlier.
nathan