lialan wrote:

> > I still think we need an intrinsic here because a load + an addtid store 
> > can be scheduled much different from the asynchronous "gather to LDS" - and 
> > because we don't want this load/store to not be optimized
> 
> IMO the intrinsic should only be added as a last resort if we really can't 
> get the pattern based codegen to work well enough.

Beg to differ in particularly this case. In downstream application, I want to 
fine control to use this particular instruction so this gets propagated down to 
LLVM IR, without being changed or modified along the way.

Well, actual reason: we need this instruction now. :-p

https://github.com/llvm/llvm-project/pull/137425
_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to