See these blog entries for information about Open MPI's tree-based launching:
http://blogs.cisco.com/performance/tree-based-launch-in-open-mpi http://blogs.cisco.com/performance/tree-based-launch-in-open-mpi-part-2 > On May 3, 2015, at 8:52 PM, XingFENG <xingf...@cse.unsw.edu.au> wrote: > > Thanks very much Ralph. It works fine now. > > Just some more questions to make it clearer. > > 1. Your respond confuses me. Especially for this one, The rsh launch is > typically done in a tree topology, and so you get ssh from one backend node > to another. > > 2. By adding -mca plm_rsh_no_tree_spawn 1, it seems that daemon process is > disabled because I could not find processes orted and sshd any more. Is rsh > used instead here? > > > > On Mon, May 4, 2015 at 12:54 AM, Ralph Castain <r...@open-mpi.org> wrote: > >> On May 3, 2015, at 5:54 AM, XingFENG <xingf...@cse.unsw.edu.au> wrote: >> >> By changing the order of hosts, the program now can run on 6 machines with 2 >> accounts. >> >> But it is very weird. Why the order of hosts would matter? Could any one >> refer something about how openmpi work, please? I have searched on the >> Internet but nothing related is found. > > If you add —mca plm_rsh_no_tree_spawn 1 to your cmd line, you’ll probably be > okay. The rsh launch is typically done in a tree topology, and so you get ssh > from one backend node to another. If you don’t have password-less ssh from > every machine to each other machine, then you can get the behavior you > describe. > > Adding that parameter will cause all the ssh commands to come directly from > mpirun. > > >> >> >> On Sun, May 3, 2015 at 10:03 PM, XingFENG <xingf...@cse.unsw.edu.au> wrote: >> I got some problems while running MPI programs with different account. >> I have done the following things. >> >> 1. I have set password-less connection between these machines. >> 2. I have specified user names in host file. >> 3. I have successfully run the program on two machines with two different >> accounts. >> >> But when I tried on other machines with the two accounts, openmpi stuck at >> the very beginning. There is no error information. It seems openmpi falls in >> some infinite loop. Any help would be greatly appreciated. >> >> On Wed, Apr 15, 2015 at 6:04 AM, Jeff Squyres (jsquyres) >> <jsquy...@cisco.com> wrote: >> You can also specify per-machine usernames in $HOME/.ssh/config. >> >> See ssh_config(5). >> >> >> >> > On Apr 13, 2015, at 9:19 PM, Ralph Castain <r...@open-mpi.org> wrote: >> > >> > >> >> On Apr 13, 2015, at 5:47 PM, XingFENG <xingf...@cse.unsw.edu.au> wrote: >> >> >> >> Thanks for all who joined the discussion. >> >> Yes, I meant to run the job under both accounts at the same time. >> >> >> >> Hi, Ralph, >> >> >> >> I have searched about specifying username in hostfile and got no related >> >> page. Could you give a sample of specifying username in hostfile, please? >> > >> > you just provide the host name as “username@host” >> > >> >> Another problem is do you also specify password in hostfile? >> > >> > >> > Good heavens, no! It has to be password-less ssh. Frankly, I doubt this >> > will work as I suspect ssh isn’t going to let you do it. >> > >> > >> >> >> >> On Tue, Apr 14, 2015 at 12:05 AM, Ralph Castain <r...@open-mpi.org> wrote: >> >> >> >>> On Apr 13, 2015, at 6:56 AM, Maxime Boissonneault >> >>> <maxime.boissonnea...@calculquebec.ca> wrote: >> >>> >> >>> Le 2015-04-13 09:54, Ralph Castain a écrit : >> >>>>> On Apr 13, 2015, at 6:52 AM, Maxime Boissonneault >> >>>>> <maxime.boissonnea...@calculquebec.ca> wrote: >> >>>>> >> >>>>> Just out of curiosity... how will OpenMPI start processes under >> >>>>> different accounts ? Through SSH while specifying different user names >> >>>>> ? >> >>>>> I am assuming that no resource manager or scheduler will allow this. >> >>>> I’m assuming he just plans to run the entire job as the other user. >> >>>> Essentially, it would be the same as if his friend ran the job for him. >> >>> From this comment : >> >>> >> >>> My problem is that my account is limited to use 4 machines (I need more >> >>> machines to process data). I can borrow my friend's account and thus >> >>> have access to another 4 machines but I am not sure whether it works. >> >>> >> >>> I assumed that he wants to run the job under _both_ accounts at the same >> >>> time. >> >> >> >> I honestly don’t know if that would work. We do provide a way in the >> >> hostfile for someone to specify the username for a remote host, but you’d >> >> have to get the ssh system to accept it. Assuming the other username also >> >> has your host in its authorized keys, then I suppose it might work - I’ve >> >> never tried it myself. >> >> >> >> >> >>> >> >>> >> >>>>> My recommendation would be to contact your sysadmin and ask for an >> >>>>> exception instead of going through with this insanity (forgive the >> >>>>> judgement here). >> >>>> Agreed! >> >>>> >> >>>>> Maxime >> >>>>> >> >>>>> Le 2015-04-13 09:47, Ralph Castain a écrit : >> >>>>>> Let’s hope you sys admin doesn’t find out about it - they tend to >> >>>>>> take a dim view of sharing accounts! >> >>>>>> >> >>>>>> So long as the path and library path are set correctly, we won’t care. >> >>>>>> >> >>>>>> >> >>>>>>> On Apr 12, 2015, at 10:33 PM, XingFENG <xingf...@cse.unsw.edu.au> >> >>>>>>> wrote: >> >>>>>>> >> >>>>>>> Hi all, >> >>>>>>> >> >>>>>>> I am wondering if it is possible that MPI programs can be run on >> >>>>>>> machines with different account? >> >>>>>>> >> >>>>>>> I am doing experiments with some MPI programs on a cluster. My >> >>>>>>> problem is that my account is limited to use 4 machines (I need more >> >>>>>>> machines to process data). I can borrow my friend's account and thus >> >>>>>>> have access to another 4 machines but I am not sure whether it works. >> >>>>>>> >> >>>>>>> -- >> >>>>>>> Best Regards. >> >>>>>>> --- >> >>>>>>> Xing FENG >> >>>>>>> PhD Candidate >> >>>>>>> Database Research Group >> >>>>>>> >> >>>>>>> School of Computer Science and Engineering >> >>>>>>> University of New South Wales >> >>>>>>> NSW 2052, Sydney >> >>>>>>> >> >>>>>>> Phone: (+61) 413 857 288 >> >>>>>>> _______________________________________________ >> >>>>>>> users mailing list >> >>>>>>> us...@open-mpi.org >> >>>>>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> >>>>>>> Link to this post: >> >>>>>>> http://www.open-mpi.org/community/lists/users/2015/04/26687.php >> >>>>>> _______________________________________________ >> >>>>>> users mailing list >> >>>>>> us...@open-mpi.org >> >>>>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> >>>>>> Link to this post: >> >>>>>> http://www.open-mpi.org/community/lists/users/2015/04/26690.php >> >>>>> >> >>>>> -- >> >>>>> --------------------------------- >> >>>>> Maxime Boissonneault >> >>>>> Analyste de calcul - Calcul Québec, Université Laval >> >>>>> Ph. D. en physique >> >>>>> >> >>>>> _______________________________________________ >> >>>>> users mailing list >> >>>>> us...@open-mpi.org >> >>>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> >>>>> Link to this post: >> >>>>> http://www.open-mpi.org/community/lists/users/2015/04/26691.php >> >>>> _______________________________________________ >> >>>> users mailing list >> >>>> us...@open-mpi.org >> >>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> >>>> Link to this post: >> >>>> http://www.open-mpi.org/community/lists/users/2015/04/26692.php >> >>> >> >>> >> >>> -- >> >>> --------------------------------- >> >>> Maxime Boissonneault >> >>> Analyste de calcul - Calcul Québec, Université Laval >> >>> Ph. D. en physique >> >>> >> >>> _______________________________________________ >> >>> users mailing list >> >>> us...@open-mpi.org >> >>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> >>> Link to this post: >> >>> http://www.open-mpi.org/community/lists/users/2015/04/26693.php >> >> >> >> >> >> _______________________________________________ >> >> users mailing list >> >> us...@open-mpi.org >> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> >> Link to this post: >> >> http://www.open-mpi.org/community/lists/users/2015/04/26694.php >> >> >> >> >> >> >> >> -- >> >> Best Regards. >> >> --- >> >> Xing FENG >> >> PhD Candidate >> >> Database Research Group >> >> >> >> School of Computer Science and Engineering >> >> University of New South Wales >> >> NSW 2052, Sydney >> >> >> >> Phone: (+61) 413 857 288 >> >> _______________________________________________ >> >> users mailing list >> >> us...@open-mpi.org >> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> >> Link to this post: >> >> http://www.open-mpi.org/community/lists/users/2015/04/26707.php >> > >> > _______________________________________________ >> > users mailing list >> > us...@open-mpi.org >> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> > Link to this post: >> > http://www.open-mpi.org/community/lists/users/2015/04/26708.php >> >> >> -- >> Jeff Squyres >> jsquy...@cisco.com >> For corporate legal information go to: >> http://www.cisco.com/web/about/doing_business/legal/cri/ >> >> _______________________________________________ >> users mailing list >> us...@open-mpi.org >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> Link to this post: >> http://www.open-mpi.org/community/lists/users/2015/04/26729.php >> >> >> >> -- >> Best Regards. >> --- >> Xing FENG >> PhD Candidate >> Database Research Group >> >> School of Computer Science and Engineering >> University of New South Wales >> NSW 2052, Sydney >> >> Phone: (+61) 413 857 288 >> >> >> >> -- >> Best Regards. >> --- >> Xing FENG >> PhD Candidate >> Database Research Group >> >> School of Computer Science and Engineering >> University of New South Wales >> NSW 2052, Sydney >> >> Phone: (+61) 413 857 288 >> _______________________________________________ >> users mailing list >> us...@open-mpi.org >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users >> Link to this post: >> http://www.open-mpi.org/community/lists/users/2015/05/26831.php > > > _______________________________________________ > users mailing list > us...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2015/05/26832.php > > > > -- > Best Regards. > --- > Xing FENG > PhD Candidate > Database Research Group > > School of Computer Science and Engineering > University of New South Wales > NSW 2052, Sydney > > Phone: (+61) 413 857 288 > _______________________________________________ > users mailing list > us...@open-mpi.org > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users > Link to this post: > http://www.open-mpi.org/community/lists/users/2015/05/26833.php -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/