Re: [patch][rfc] libgomp: Add OpenMP interop support to nvptx + gcn plugin

Andrew Stubbs Tue, 27 Aug 2024 02:38:07 -0700

On 22/08/2024 19:26, Tobias Burnus wrote:

This patch adds OpenMP's interop support to the libgomp plugins (nvptx:cuda, cuda_driver, hip; gcn: hip, hsa).*
[The idea is that the user can ask OpenMP to return a foreign-runtimehandle (CUdevice, hipCtx_t, …) for to a specified OpenMP device number –and to create a stream (CUstream, hipStream_t, cudaStream_t,hsa_queue_t), where OpenMP can take care of dependencies, .e.g, via the'depend' clause.]
The attached patch comes on top of the interop routine patch,https://gcc.gnu.org/pipermail/gcc-patches/2024-August/661118.html (andthe associated .texi patch,https://gcc.gnu.org/pipermail/gcc-patches/2024-August/661072.html ).
The patch is more a WIP/RFC patch than a final patch as it is currentlynot wired up: while 'GOMP_interop' can be called manually, the properway will be OpenMP's 'interop' directive, currently unimplemented.Hence, this patch is not extensively tested, does not include testcases,and target.c's GOMP_interop will surely change to handle all clauses.
But except that target.c's GOMP_interop will change, the rest of thepatch should be be rather solid – and could in principle be applied.
Therefore:
(A) Any comments, suggestions regarding the patch in general and inparticular the plugin/ related parts?


The code all looks pretty reasonable to me.

The header file conditional includes worry me though: it is addingcomplexity in a way that hurts maintainability, and looks like it mightbreak somebody's hypothetical out-of-tree plugin. Is it not better for aplugin that supports interop to include omp.h itself?

(B) RFC: The *stream* *creation* (hsa_queue_t, cudaStream_t/hipStream_t)functions have tons of options. Thus:
(i) Does the chosen size/flags argument for the stream/queue generationfor GCN/HIP/CUDA make sense? – Or are other values that are more sensible?

I think we want to follow the principle of least surprise, so max sizequeues with the same type we normally use, and certainly no non-blocking.

(ii) Should the user be able to tweak the values?
I mean, the user could say:** 'prefer_type({fr("cuda"),attr("ompx_priority:-2,ompx_non_blocking")},{fr("hsa"),attr("ompx_queue_size:64"})'.
Do we want to permit this? If yes, which of the values should bechangeable?

Is there any prior art for this? It looks like it could be added infuture, without breaking backward compatibility, so I say "no" (at leastfor now).

Tobias
(*) For Nvidia, HIP is just a thin wrapper of defines, typedefs andinline functions around CUDA. Thus, hip, cuda and cuda_driver areeffectively all the same. / The HSA is a new proposal that is currentlyadded additional-definition document. (OpenMP spec Issue #4023.)
(**) The used syntax and in particular 'attr' are new in OpenMP 6.0 (newin TR13). Note that attr only takes string literals [while 'fr' takesstrings and (6.0) identifiers ["omp_ifr_cuda"] or constant integerexpressions (5.1)].

Re: [patch][rfc] libgomp: Add OpenMP interop support to nvptx + gcn plugin

Reply via email to