Re: [PATCH] libgomp: Fix hang when profiling OpenACC programs with CUDA 9.0 nvprof

2020-07-14 Thread Thomas Schwinge
Hi Kwok! On 2020-07-13T16:29:14+0100, Kwok Cheung Yeung wrote: > When the version of nvprof in CUDA 9.0 is run on an OpenACC program, [...] the > program deadlocks. > I have added a testcase that sets up the situation presented by nvprof. Thanks. I have extended this one a little bit, to add s

[PATCH] libgomp: Fix hang when profiling OpenACC programs with CUDA 9.0 nvprof

2020-07-13 Thread Kwok Cheung Yeung
Hello (This patch was previously posted for OG7 at: https://gcc.gnu.org/pipermail/gcc-patches/2018-February/494594.html). When the version of nvprof in CUDA 9.0 is run on an OpenACC program, it sets up a callback that is called on device initialization. Inside the callback, it calls the acc_