On 06/01/2015 08:58 AM, Bernd Schmidt wrote: > This extends the previous vector-single support to also handle > worker-level predication. We can't use the shfl insn because workers > will live across multiple warps, so we use a location in memory to > broadcast the branch target. > This also fixes the oversight where basic blocks inside a parallel > region but outside all loops weren't being predicated. > > A special case is added for worker-single vector-partitioned; we add a > jump over the entire loop that is taken by the inactive workers and add > no predication inside this loop. > > Committed on gomp-4_0-branch.
Thanks. This fixed the problems that I was seeing with variables outside of acc loops. I see that calls are being predicated at the moment. Those will need special handling once we tackle acc routines. Cesar