I'm using OpenMpi 4.1.2 under Slurm 20.11.8. My 2 process job is successfully
launched, but when the main process rank 0
attempts to create an intercommunicator with process rank 1 on the other node:
MPI_Comm intercom;
MPI_Intercomm_create(MPI_COMM_SELF, 0, MPI_COMM_WORLD, 1, , &intercom);
Op
I see similar issues on platforms with multiple IP addresses, if some of
them are not fully connected. In general, specifying which interface OMPI
can use (with --mca btl_tcp_if_include x.y.z.t/s) solves the problem.
George.
On Wed, Mar 16, 2022 at 5:11 PM Mccall, Kurt E. (MSFC-EV41) via users
George,
Thanks, that was it!
Kurt
From: George Bosilca
Sent: Wednesday, March 16, 2022 4:38 PM
To: Open MPI Users
Cc: Mccall, Kurt E. (MSFC-EV41)
Subject: [EXTERNAL] Re: [OMPI users] MPI_Intercomm_create error
I see similar issues on platforms with multiple IP addresses, if some of them
are