[OMPI users] MPI_Comm_accept() / MPI_Comm_connect() fail between two different machines

2015-07-13 Thread Audet, Martin
Hi OMPI_Developers, It seems that I am unable to establish an MPI communication between two independently started MPI programs using the simplest client/server call sequence I can imagine (see the two attached files) when the client and server process are started on different machines. Note tha

Re: [OMPI users] MPI_Comm_accept() / MPI_Comm_connect() fail between two different machines

2015-07-13 Thread Ralph Castain
Try running it with “—mca oob_base_verbose 100” on both client and server - it will tell us why the connection was refused. > On Jul 13, 2015, at 2:14 PM, Audet, Martin > wrote: > > Hi OMPI_Developers, > > It seems that I am unable to establish an MPI communication between two > independent

Re: [OMPI users] MPI_Comm_accept() / MPI_Comm_connect() fail between two different machines

2015-07-13 Thread Audet, Martin
Thanks Ralph for this quick response. In the two attachements you will find the output I got when running the following commands: [audet@fn1 mpi]$ mpiexec --mca oob_base_verbose 100 -n 1 ./simpleserver 2>&1 | tee server_out.txt [audet@linux15 mpi]$ mpiexec --mca oob_base_verbose 100 -n 1 ./sim

Re: [OMPI users] MPI_Comm_accept() / MPI_Comm_connect() fail between two different machines

2015-07-13 Thread Ralph Castain
I see the problem - it’s a race condition, actually. I’ll try to provide a patch for you to test, if you don’t mind. > On Jul 13, 2015, at 3:03 PM, Audet, Martin > wrote: > > Thanks Ralph for this quick response. > > In the two attachements you will find the output I got when running the >