jhuber6 wrote:

> I think we would be better off teaching an IR optimizer pass to recognize the 
> divide pattern and remap it to the load from the new location, rather than 
> forcing the complexity into every frontend

That's fair. I would've argued that this version should've been the builtin and 
the grid size be the computed one but it's definitely not ideal to have 
multiple versions of this. I'll try to find a place to do this peephole 
optimization. 

https://github.com/llvm/llvm-project/pull/83927
_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to