On 02/02/2022 15:39, Tobias Burnus wrote:
On 09.08.21 15:55, Tobias Burnus wrote:
Now that the GCN/OpenACC patches for this have been committed today,
I think it makes sense to add it to the documentation.
(I was told that some follow-up items are still pending, but as
the feature does work ...)

I think the follow-up patches have now been committed.
How about the attached patch?

We should probably add a qualification "(up to a limit of 40 wavefronts in total, per CU)" or else were suggesting that there can be 40 workgroups of 16 wavefronts, which the hardware will not do.

Andrew

Reply via email to