Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-05-12 Thread Jeff Squyres (jsquyres)
Keep in mind that -- in general -- Open MPI has two different kinds of traffic: 1. "Out of band" (OOB) traffic, used for launching/monitoring/killing the job 2. MPI traffic The OOB traffic generally uses TCP, and can be over whatever network you want. The MPI traffic you generally want to use yo

Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-05-12 Thread Jeff Squyres (jsquyres)
Keep in mind that -- in general -- Open MPI has two different kinds of traffic: 1. "Out of band" (OOB) traffic, used for launching/monitoring/killing the job 2. MPI traffic The OOB traffic generally uses TCP, and can be over whatever network you want. The MPI traffic you generally want to use yo

Re: [OMPI users] mpirun command won't run unless the firewalld daemon is disabled

2016-05-12 Thread Jeff Squyres (jsquyres)
Keep in mind that -- in general -- Open MPI has two different kinds of traffic: 1. "Out of band" (OOB) traffic, used for launching/monitoring/killing the job 2. MPI traffic The OOB traffic generally uses TCP, and can be over whatever network you want. The MPI traffic you generally want to use yo

Re: [OMPI users] No core dump in some cases

2016-05-12 Thread Gilles Gouaillardet
Thanks for the tests ! What was fixed in openmpi is handling disconnected infinipath port. Restoring signal handlers when libinfinipath.so is unloaded (when mca_mtl_psm.so is unloaded from our point of view) can only be fixed within libinfinipath.so. It might have already been fixed in the latest

Re: [OMPI users] OMPI users] No core dump in some cases

2016-05-12 Thread Gilles Gouaillardet
Thanks for the tests ! What was fixed in openmpi is handling disconnected infinipath port. Restoring signal handlers when libinfinipath.so is unloaded (when mca_mtl_psm.so is unloaded from our point of view) can only be fixed within libinfinipath.so. It might have already been fixed in the late

Re: [OMPI users] No core dump in some cases

2016-05-12 Thread dpchoudh .
If you configure with --disable-dlopen, then libinfinipath.so is slurped and hence the infinipath signal handler is always set, even if you disable the psm mtl or choose to only use the ob1 pml. if you do not configure with --disable-dlopen, then the infinipath signal handler is set when mca_mtl_

Re: [OMPI users] No core dump in some cases

2016-05-12 Thread Gilles Gouaillardet
If you configure with --disable-dlopen, then libinfinipath.so is slurped and hence the infinipath signal handler is always set, even if you disable the psm mtl or choose to only use the ob1 pml. if you do not configure with --disable-dlopen, then the infinipath signal handler is set when mca_m

Re: [OMPI users] No core dump in some cases

2016-05-12 Thread dpchoudh .
Hello Gilles I am not sure if I understand you correctly, but let me answer based on what I think you mean: the infinipath signal handler only dump the stack (into a .btr file, yeah !) so if your application crashes without it, you should examine the core file and see what is going wrong. If t

Re: [OMPI users] No core dump in some cases

2016-05-12 Thread Gilles Gouaillardet
Durga, the infinipath signal handler only dump the stack (into a .btr file, yeah !) so if your application crashes without it, you should examine the core file and see what is going wrong. note the infinipath signal handler is set in the constructor of libinfinipath.so, and used *not* to be remov