gtbercea added a comment.


> The downside of this approach is that LLVM doesn't recognize these function 
> calls and doesn't perform optimizations to fold libcalls. For example `pow(a, 
> 2)` is transformed into a multiplication but `__nv_pow(a, 2)` is not.

Doesn't CUDA have the same problem?


Repository:
  rC Clang

https://reviews.llvm.org/D47849



_______________________________________________
cfe-commits mailing list
cfe-commits@lists.llvm.org
http://lists.llvm.org/cgi-bin/mailman/listinfo/cfe-commits

Reply via email to