Hi, I just started to use openmpi and am trying to run a MPI/GPU code. My code compiles but when I run, I get this error: The library attempted to open the following supporting CUDA libraries, but each of them failed. CUDA-aware support is disabled. /usr/lib/libcuda.so.1: wrong ELF class: ELFCLASS32 /usr/lib/libcuda.so.1: wrong ELF class: ELFCLASS32 If you are not interested in CUDA-aware support, then run with --mca mpi_cuda_support 0 to suppress this message. If you are interested in CUDA-aware support, then try setting LD_LIBRARY_PATH to the location of libcuda.so.1 to get passed this issue.
I could not find a libcuda.so.1 in my system but I do find libcuda.so in /usr/local/cuda/lib64/stubs. Why is openmpi looking for libcuda.so.1 instead of libcuda.so? I created a symbolic link to libcuda.so, now I get CUDA error 35: CUDA driver version is insufficient for CUDA runtime version. I am not sure if this is related to libcuda.so or the driver since I could run this code using mvapich. Any input on the issue is really appreciated. My openmpi version is 1.8.4, my cuda version is 6.5, driver version is 340.65. Thanks. Jason