Issue |
122760
|
Summary |
[NVPTX] atomicrmw on <4 x float> relies on __atomic_compare_exchange_16
|
Labels |
backend:NVPTX
|
Assignees |
Artem-B
|
Reporter |
Artem-B
|
NVPTX currently lowers atomixrmw on `<4 x float>` as a call to `__atomic_compare_exchange_16` which does not exist on the GPU:
https://godbolt.org/z/ovf4cqKK5
Newer GPUs do have support for vectorized atomic ops on some data types, but on the older GPUs they must be lowered without relying on runtime.
_______________________________________________
llvm-bugs mailing list
llvm-bugs@lists.llvm.org
https://lists.llvm.org/cgi-bin/mailman/listinfo/llvm-bugs