Re: [OMPI users] Cannot run a simple MPI program

2016-04-25 Thread Jeff Squyres (jsquyres)
On Apr 24, 2016, at 8:12 PM, Gilles Gouaillardet wrote: > > fwiw, once in a while, i > rm -rf /.../ompi_install_dir/lib/openmpi > to get rid of the removed modules If it helps, I usually install Open MPI into a tree all by itself, and then I can "rm -rf $ompi_install_dir" in its entirety

Re: [OMPI users] Cannot run a simple MPI program

2016-04-24 Thread Gilles Gouaillardet
As far as i am concerned, i would consider that as a bug : since the link is down, the psm component should simply disqualify itself, it will follow-up this on the devel ML Cheers, Gilles On 4/25/2016 10:36 AM, dpchoudh . wrote: Hello Gilles Thank you for finding the bug; it was not ther

Re: [OMPI users] Cannot run a simple MPI program

2016-04-24 Thread dpchoudh .
Hello George Adding --mca pml ob1 does make the program run. I just wanted to make sure that was the expected behaviour (as opposed to a bug in mpirun). Thanks Durga 1% of the executables have 99% of CPU privilege! Userspace code! Unite!! Occupy the kernel!!! On Sun, Apr 24, 2016 at 9:43 PM, Ge

Re: [OMPI users] Cannot run a simple MPI program

2016-04-24 Thread George Bosilca
Add --mca pml ob1 to your mpirun command. George On Sunday, April 24, 2016, dpchoudh . wrote: > Hello Gilles > > Thank you for finding the bug; it was not there in the original code; I > added it while trying to 'simplify' the code. > > With the bug fixed, the code now runs in the last scenario

Re: [OMPI users] Cannot run a simple MPI program

2016-04-24 Thread dpchoudh .
Hello Gilles Thank you for finding the bug; it was not there in the original code; I added it while trying to 'simplify' the code. With the bug fixed, the code now runs in the last scenario. But it still hangs with the following command line (even after updating to latest git tree, rebuilding and

Re: [OMPI users] Cannot run a simple MPI program

2016-04-24 Thread Gilles Gouaillardet
two comments : - the program is incorrect : slave() should MPI_Recv(..., MPI_ANY_TAG, ...) - current master uses pmix114, and your traces mention pmix120 so your master is out of sync, or pmix120 is an old module that was not manually removed. fwiw, once in a while, i rm -rf /.../ompi_in