Noam, you might also want to try
mpirun --mca btl tcp,self ... to rule out btl (shared memory and/or infiniband) related issues. Once you rebuild Open MPI with --enable-debug, I recommend you first check the arguments of the MPI_Send() and MPI_Recv() functions and make sure - same communicator is used (in C, check comm->c_contextid) - same tag - double check the MPI tasks do wait for each other (in C, check comm->c_my_rank, source and dest) Cheers, Gilles On Fri, Apr 6, 2018 at 5:31 AM, George Bosilca <bosi...@icl.utk.edu> wrote: > Yes, you can do this by adding --enable-debug to OMPI configure (and make > sure your don't have the configure flag --with-platform=optimize). > > George. > > > On Thu, Apr 5, 2018 at 4:20 PM, Noam Bernstein <noam.bernst...@nrl.navy.mil> > wrote: >> >> >> On Apr 5, 2018, at 4:11 PM, George Bosilca <bosi...@icl.utk.edu> wrote: >> >> I attach with gdb on the processes and do a "call mca_pml_ob1_dump(comm, >> 1)". This allows the debugger to make a call our function, and output >> internal information about the library status. >> >> >> Great. But I guess I need to recompile ompi in debug mode? Is that just >> a flag to configure? >> >> thanks, >> Noam >> >> >> ____________ >> | >> | >> | >> U.S. NAVAL >> | >> | >> _RESEARCH_ >> | >> LABORATORY >> >> Noam Bernstein, Ph.D. >> Center for Materials Physics and Technology >> U.S. Naval Research Laboratory >> T +1 202 404 8628 F +1 202 404 7546 >> https://www.nrl.navy.mil >> >> >> _______________________________________________ >> users mailing list >> users@lists.open-mpi.org >> https://lists.open-mpi.org/mailman/listinfo/users > > > > _______________________________________________ > users mailing list > users@lists.open-mpi.org > https://lists.open-mpi.org/mailman/listinfo/users _______________________________________________ users mailing list users@lists.open-mpi.org https://lists.open-mpi.org/mailman/listinfo/users