jhuber6 wrote:

I think the current use of this intrinsic in 'generic' IR is sound so long as 
it's not guarding anything ABI related. Right now it's just used for loop 
bounds and array offsets pretty much. Though long-term I agree that it's 
probably most sound to just put  these as separate builds, but that would 
require all of our device runtime stuff going through my interface / using 
multilibs.

I think the only thing important here is that we don't do this prematurely, 
i.e. `clang --target=amdgcn-amd-amdhsa -c -flto` does not fold it since it's 
before codegen.

@JonChesterfield was working on 'generic' IR as well, so maybe he has some 
opinions.

https://github.com/llvm/llvm-project/pull/114481
_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to