Hi Bernd and Thomas, Are you planning to support offloading from DSO in PTX/CUDA environment? If yes, how are you going to solve the problem of the collision of function names from different DSOs?
However, if we decide to use element-wise host-target address mapping, there are opportunities to make this approach more robust. E.g. we can store some hash(name) in the compiler-generated tables along with the address and size. When libgomp will perform device initialization, it will compare hashes from the host and target DSOs. This should reveal possible errors during the initialization, and will avoid hard-to-debug silent failures. -- Ilya