On 10/20/2015 08:34 PM, Alexander Monakov wrote:
This patch series ports enough of libgomp.c to get warp-level parallelism working for OpenMP offloading. The overall approach is as follows.
Could you elaborate a bit what you mean by this just so we understand each other in terms of terminology? "Warp-level" sounds to me like you have all threads in a warp executing in lockstep at all times. If individual threads can take different paths, I'd expect it to be called thread-level parallelism or something like that.
What is your end goal in terms of mapping GPU parallelism onto OpenMP? Bernd