Re: [OMPI users] mpirun problem when running on more than three hosts with OpenMPI 1.8

2014-04-14 Thread Ralph Castain
On Apr 13, 2014, at 11:42 AM, Allan Wu wrote: > Thanks, Ralph! > > Adding MAC parameter 'plm_rsh_no_tree_spawn' solves the problem. > > If I understand correctly, the first layer of daemons are three nodes, and > when there are more than three nodes the second layer of daemons are spawn. >

Re: [OMPI users] mpirun problem when running on more than three hosts with OpenMPI 1.8

2014-04-13 Thread Allan Wu
Thanks, Ralph! Adding MAC parameter ' plm_rsh_no_tree_spawn' solves the problem. If I understand correctly, the first layer of daemons are three nodes, and when there are more than three nodes the second layer of daemons are spawn. So my problem is happened when MPI processes are launched by the

Re: [OMPI users] mpirun problem when running on more than three hosts with OpenMPI 1.8

2014-04-11 Thread Ralph Castain
The problem is with the tree-spawn nature of the rsh/ssh launcher. For scalability, mpirun only launches a first "layer" of daemons. Each of those daemons then launches another layer in a tree-like fanout. The default pattern is such that you first notice it when you have four nodes in your allo

[OMPI users] mpirun problem when running on more than three hosts with OpenMPI 1.8

2014-04-11 Thread Allan Wu
Hello everyone, I am running a simple helloworld program on several nodes using OpenMPI 1.8. Running commands on single node or small number of nodes are successful, but when I tried to run the same binary on four different nodes, problems occurred. I am using 'mpirun' command line like the follo