Now that the GCN/OpenACC patches for this have been committed today, I think it makes sense to add it to the documentation. (I was told that some follow-up items are still pending, but as the feature does work ...)
Cf. also Andrew's talk of last year, https://linuxplumbersconf.org/event/7/contributions/749/attachments/560/988/AMD_GCN_Update_-_LPC_2020.pdf which I utilized when writing the attached wwwdocs patch. Comments and/or suggestions? Tobias ----------------- Siemens Electronic Design Automation GmbH; Anschrift: Arnulfstraße 201, 80634 München; Gesellschaft mit beschränkter Haftung; Geschäftsführer: Thomas Heurung, Frank Thürauf; Sitz der Gesellschaft: München; Registergericht München, HRB 106955
gcc-12/changes.html (GCN): >1 workers per gang diff --git a/htdocs/gcc-12/changes.html b/htdocs/gcc-12/changes.html index 9c2799cf..05e24737 100644 --- a/htdocs/gcc-12/changes.html +++ b/htdocs/gcc-12/changes.html @@ -119,10 +119,14 @@ <h3 id="amdgcn">AMD Radeon (GCN)</h3> <ul> <li>Debug experience with ROCGDB has been improved.</li> <li>Support for the type <code>__int128_t</code>/<code>integer(kind=16)</code> was added.</li> + <li>When used as OpenACC device: the limitation of 1 worker per gang, 2 gangs + per CU has been lifted; now up to 16 workers per gang and 40 gangs per CU + are supported. (Except that the hardware limit of 40 workers total may + not be exceeded.)</li> </ul> <!-- <h3 id="arc">ARC</h3> --> <!-- <h3 id="arm">ARM</h3> -->