Re: [OMPI users] OpenMPI-5.0.6: -x LD_LIBRARY_PATH not able to load shared objects

2025-02-14 Thread Patrick Begou
Le 14/02/2025 à 13:22, Sangam B a écrit : Hi, OpenMPI-5.0.6 is compiled with ucx-1.18 and Intel 1api 2024 v2.1 compilers. An mpi program is compiled with this openmpi-5.0.6. While submitting job thru PBS on a Linux cluster, the intel compilers is sourced and the same is passed thru OpenMPI's

[OMPI users] OpenMPI-5.0.6: -x LD_LIBRARY_PATH not able to load shared objects

2025-02-14 Thread Sangam B
Hi, OpenMPI-5.0.6 is compiled with ucx-1.18 and Intel 1api 2024 v2.1 compilers. An mpi program is compiled with this openmpi-5.0.6. While submitting job thru PBS on a Linux cluster, the intel compilers is sourced and the same is passed thru OpenMPI's mpirun command option: " -x LD_LIBRARY_PAT

Re: [OMPI users] OpenMPI-5.0.6: -x LD_LIBRARY_PATH not able to load shared objects

2025-02-14 Thread Gilles Gouaillardet
Sangam, -x LD_LIBRARY_PATH won't do the trick here. mpirun spawns prted daemons on the other nodes (via the tm interface or whatever the latest PBS uses if support was built into Open MPI, or SSH otherwise), and the daemons fail to start because the intel runtime cannot be found. you can chrpath

Re: [OMPI users] OpenMPI-5.0.6: -x LD_LIBRARY_PATH not able to load shared objects

2025-02-14 Thread Patrick Begou
Hi Sangam could you check that the install location of the library is the same on all the nodes ?  May be checking LD_LIBRARY_PATH after sourcing the intel vars.sh file ? I'm using OpenMPI 5.0.6 but in a Slurm context and it works fine. Patrick Le 14/02/2025 à 19:00, Sangam B a écrit : Hi P

[OMPI users] Disable PMPI bindings?

2025-02-14 Thread Joshua Strodtbeck
Hi everyone, I am trying to use Open MPI built with IBM's Open XLF 17.x, and I get compile-time errors in the application (WRF) due to an apparent mismatch between the PMPI argument list and what was actually compiled into the module, e.g.: "mpif-sizeof.h", line 2463.6: 1514-699 (S) Procedure "p

Re: [OMPI users] OpenMPI-5.0.6: -x LD_LIBRARY_PATH not able to load shared objects

2025-02-14 Thread Sangam B
Hi Patrick, Thanks for your reply. Ofcourse, the intel vars.sh is sourced inside the pbs script and I've tried multiple ways to resolve this issue: -x LD_LIBRARY_PATH & -x LD_LIBRARY_PATH=/opt/intel/oneapi/2024/v2.1/compiler/2024.2/opt/compiler/lib:/opt/intel/oneapi/2024/v2.1/compiler/2024.2/lib:

Re: [OMPI users] OpenMPI-5.0.6: -x LD_LIBRARY_PATH not able to load shared objects

2025-02-14 Thread Patrick Begou
Bad answer, sorry I did not managed prted was part of OpenMPI stack. Le 14/02/2025 à 19:19, Patrick Begou a écrit : Hi Sangam could you check that the install location of the library is the same on all the nodes ?  May be checking LD_LIBRARY_PATH after sourcing the intel vars.sh file ? I'm u

Re: [OMPI users] OpenMPI-5.0.6: -x LD_LIBRARY_PATH not able to load shared objects

2025-02-14 Thread Sangam B
Thanks Gilles & Patrick. As Gilles mentioned, while OpenMPI spawns prted daemons on compute nodes, it fails to get launched, because Intel runtime is not available. To resolve this issue, I loaded the Intel runtime before job submission on the terminal session and used #PBS -V in the job script.