On 22/10/2024 17:29, Tobias Burnus wrote:
Andrew Stubbs wrote:
I'm going to push the base patch shortly.
… which happened in commit r15-4540-ga6b26e5ea09779.

Updated patch attached.

Some more testing showed that there was an issue with the builtin defines, which has been fixed and those have been extended as well.

In particular, it now also includes the __GFX11__ macro define as architecture family.

Thanks, this looks OK now.


Tobias

PS: I find it surprising which out-of-bounds writes remain undetected on one machine and show up prominently on the other.

The sizeof/strlen thing? I never saw any trouble from that. :(


PPS: If more macros are desired, clang also defines __AMDGCN_WAVEFRONT_SIZE__ (i.e. 32 or 64) and __AMDGCN_CUMODE__ (1 = CU mode, 0 = WGP).

I don't know what these are for, but if was want to have them then I suppose it's harmless.

Andrew

Reply via email to