Can you check with an MPI_Finalize in the receiver ?
Jalel
Le 16/09/2015 16:06, marcin.krotkiewski a écrit :
I have run into a freeze / potential bug when using MPI_Comm_accept in
a simple client / server implementation. I have attached two simplest
programs I could produce:
1. mpi-receiver.c opens a port using MPI_Open_port, saves the port
name to a file
2. mpi-receiver enters infinite loop and waits for connections using
MPI_Comm_accept
3. mpi-sender.c connects to that port using MPI_Comm_connect, sends
one MPI_UNSIGNED_LONG, calls barrier and disconnects using
MPI_Comm_disconnect
4. mpi-receiver reads the MPI_UNSIGNED_LONG, prints it, calls barrier
and disconnects using MPI_Comm_disconnect and goes to point 2 -
infinite loop
All works fine, but only exactly 5 times. After that the receiver
hangs in MPI_Recv, after exit from MPI_Comm_accept. That is 100%
repeatable. I have tried with Intel MPI - no such problem.
I execute the programs using OpenMPI 1.10 as follows
mpirun -np 1 --mca mpi_leave_pinned 0 ./mpi-receiver
Do you have any clues what could be the reason? Am I doing sth wrong,
or is it some problem with internal state of OpenMPI?
Thanks a lot!
Marcin
_______________________________________________
users mailing list
us...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post:
http://www.open-mpi.org/community/lists/users/2015/09/27585.php
--
*------------------------------------------------------------------------*
Jalel CHERGUI, LIMSI-CNRS, Bât. 508 - BP 133, 91403 Orsay cedex, FRANCE
Tél: (33 1) 69 85 81 27 ; Télécopie: (33 1) 69 85 80 88
Mél: jalel.cher...@limsi.fr ; Référence: http://perso.limsi.fr/chergui
*------------------------------------------------------------------------*