On 02/02/2022 15:39, Tobias Burnus wrote:
On 09.08.21 15:55, Tobias Burnus wrote:
Now that the GCN/OpenACC patches for this have been committed today,
I think it makes sense to add it to the documentation.
(I was told that some follow-up items are still pending, but as
the feature does work ...)
I think the follow-up patches have now been committed.
How about the attached patch?
We should probably add a qualification "(up to a limit of 40 wavefronts
in total, per CU)" or else were suggesting that there can be 40
workgroups of 16 wavefronts, which the hardware will not do.
Andrew