Thanks very much Ralph. It works fine now.

Just some more questions to make it clearer.

1. Your respond confuses me. Especially for this one, *The rsh launch is
typically done in a tree topology, and so you get ssh from one backend node
to another.*

2. By adding -mca plm_rsh_no_tree_spawn 1, it seems that daemon process is
disabled because I could not find processes orted and sshd  any more. Is
rsh used instead here?



On Mon, May 4, 2015 at 12:54 AM, Ralph Castain <r...@open-mpi.org> wrote:

>
> On May 3, 2015, at 5:54 AM, XingFENG <xingf...@cse.unsw.edu.au> wrote:
>
> By changing the order of hosts, the program now can run on 6 machines with
> 2 accounts.
>
> But it is very weird. Why the order of hosts would matter? Could any one
> refer something about how openmpi work, please? I have searched on the
> Internet but nothing related is found.
>
>
> If you add —mca plm_rsh_no_tree_spawn 1 to your cmd line, you’ll probably
> be okay. The rsh launch is typically done in a tree topology, and so you
> get ssh from one backend node to another. If you don’t have password-less
> ssh from every machine to each other machine, then you can get the behavior
> you describe.
>
> Adding that parameter will cause all the ssh commands to come directly
> from mpirun.
>
>
>
>
> On Sun, May 3, 2015 at 10:03 PM, XingFENG <xingf...@cse.unsw.edu.au>
> wrote:
>
>> I got some problems while running MPI programs with different account.
>> I have done the following things.
>>
>> 1. I have set password-less connection between these machines.
>> 2. I have specified user names in host file.
>> 3. I have successfully run the program on two machines with two different
>> accounts.
>>
>> But when I tried on other machines with the two accounts, openmpi stuck
>> at the very beginning. There is no error information. It seems openmpi
>> falls in some infinite loop. Any help would be greatly appreciated.
>>
>> On Wed, Apr 15, 2015 at 6:04 AM, Jeff Squyres (jsquyres) <
>> jsquy...@cisco.com> wrote:
>>
>>> You can also specify per-machine usernames in $HOME/.ssh/config.
>>>
>>> See ssh_config(5).
>>>
>>>
>>>
>>> > On Apr 13, 2015, at 9:19 PM, Ralph Castain <r...@open-mpi.org> wrote:
>>> >
>>> >
>>> >> On Apr 13, 2015, at 5:47 PM, XingFENG <xingf...@cse.unsw.edu.au>
>>> wrote:
>>> >>
>>> >> Thanks for all who joined the discussion.
>>> >> Yes, I meant to run the job under both accounts at the same time.
>>> >>
>>> >> Hi, Ralph,
>>> >>
>>> >> I have searched about specifying username in hostfile and got no
>>> related page. Could you give a sample of specifying username in hostfile,
>>> please?
>>> >
>>> > you just provide the host name as “username@host”
>>> >
>>> >>   Another problem is do you also specify password in hostfile?
>>> >
>>> >
>>> > Good heavens, no! It has to be password-less ssh. Frankly, I doubt
>>> this will work as I suspect ssh isn’t going to let you do it.
>>> >
>>> >
>>> >>
>>> >> On Tue, Apr 14, 2015 at 12:05 AM, Ralph Castain <r...@open-mpi.org>
>>> wrote:
>>> >>
>>> >>> On Apr 13, 2015, at 6:56 AM, Maxime Boissonneault <
>>> maxime.boissonnea...@calculquebec.ca> wrote:
>>> >>>
>>> >>> Le 2015-04-13 09:54, Ralph Castain a écrit :
>>> >>>>> On Apr 13, 2015, at 6:52 AM, Maxime Boissonneault <
>>> maxime.boissonnea...@calculquebec.ca> wrote:
>>> >>>>>
>>> >>>>> Just out of curiosity... how will OpenMPI start processes under
>>> different accounts ? Through SSH while specifying different user names ?
>>> >>>>> I am assuming that no resource manager or scheduler will allow
>>> this.
>>> >>>> I’m assuming he just plans to run the entire job as the other user.
>>> Essentially, it would be the same as if his friend ran the job for him.
>>> >>> From this comment :
>>> >>>
>>> >>> My problem is that my account is limited to use 4 machines (I need
>>> more machines to process data). I can borrow my friend's account and thus
>>> have access to another 4 machines but I am not sure whether it works.
>>> >>>
>>> >>> I assumed that he wants to run the job under _both_ accounts at the
>>> same time.
>>> >>
>>> >> I honestly don’t know if that would work. We do provide a way in the
>>> hostfile for someone to specify the username for a remote host, but you’d
>>> have to get the ssh system to accept it. Assuming the other username also
>>> has your host in its authorized keys, then I suppose it might work - I’ve
>>> never tried it myself.
>>> >>
>>> >>
>>> >>>
>>> >>>
>>> >>>>> My recommendation would be to contact your sysadmin and ask for an
>>> exception instead of going through with this insanity (forgive the
>>> judgement here).
>>> >>>> Agreed!
>>> >>>>
>>> >>>>> Maxime
>>> >>>>>
>>> >>>>> Le 2015-04-13 09:47, Ralph Castain a écrit :
>>> >>>>>> Let’s hope you sys admin doesn’t find out about it - they tend to
>>> take a dim view of sharing accounts!
>>> >>>>>>
>>> >>>>>> So long as the path and library path are set correctly, we won’t
>>> care.
>>> >>>>>>
>>> >>>>>>
>>> >>>>>>> On Apr 12, 2015, at 10:33 PM, XingFENG <xingf...@cse.unsw.edu.au>
>>> wrote:
>>> >>>>>>>
>>> >>>>>>> Hi all,
>>> >>>>>>>
>>> >>>>>>> I am wondering if it is possible that MPI programs can be run on
>>> machines with different account?
>>> >>>>>>>
>>> >>>>>>> I am doing experiments with some MPI programs on a cluster. My
>>> problem is that my account is limited to use 4 machines (I need more
>>> machines to process data). I can borrow my friend's account and thus have
>>> access to another 4 machines but I am not sure whether it works.
>>> >>>>>>>
>>> >>>>>>>  --
>>> >>>>>>> Best Regards.
>>> >>>>>>> ---
>>> >>>>>>> Xing FENG
>>> >>>>>>> PhD Candidate
>>> >>>>>>> Database Research Group
>>> >>>>>>>
>>> >>>>>>> School of Computer Science and Engineering
>>> >>>>>>> University of New South Wales
>>> >>>>>>> NSW 2052, Sydney
>>> >>>>>>>
>>> >>>>>>> Phone: (+61) 413 857 288
>>> >>>>>>> _______________________________________________
>>> >>>>>>> users mailing list
>>> >>>>>>> us...@open-mpi.org
>>> >>>>>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> >>>>>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2015/04/26687.php
>>> >>>>>> _______________________________________________
>>> >>>>>> users mailing list
>>> >>>>>> us...@open-mpi.org
>>> >>>>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> >>>>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2015/04/26690.php
>>> >>>>>
>>> >>>>> --
>>> >>>>> ---------------------------------
>>> >>>>> Maxime Boissonneault
>>> >>>>> Analyste de calcul - Calcul Québec, Université Laval
>>> >>>>> Ph. D. en physique
>>> >>>>>
>>> >>>>> _______________________________________________
>>> >>>>> users mailing list
>>> >>>>> us...@open-mpi.org
>>> >>>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> >>>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2015/04/26691.php
>>> >>>> _______________________________________________
>>> >>>> users mailing list
>>> >>>> us...@open-mpi.org
>>> >>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> >>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2015/04/26692.php
>>> >>>
>>> >>>
>>> >>> --
>>> >>> ---------------------------------
>>> >>> Maxime Boissonneault
>>> >>> Analyste de calcul - Calcul Québec, Université Laval
>>> >>> Ph. D. en physique
>>> >>>
>>> >>> _______________________________________________
>>> >>> users mailing list
>>> >>> us...@open-mpi.org
>>> >>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> >>> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2015/04/26693.php
>>> >>
>>> >>
>>> >> _______________________________________________
>>> >> users mailing list
>>> >> us...@open-mpi.org
>>> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> >> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2015/04/26694.php
>>> >>
>>> >>
>>> >>
>>> >> --
>>> >> Best Regards.
>>> >> ---
>>> >> Xing FENG
>>> >> PhD Candidate
>>> >> Database Research Group
>>> >>
>>> >> School of Computer Science and Engineering
>>> >> University of New South Wales
>>> >> NSW 2052, Sydney
>>> >>
>>> >> Phone: (+61) 413 857 288
>>> >> _______________________________________________
>>> >> users mailing list
>>> >> us...@open-mpi.org
>>> >> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> >> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2015/04/26707.php
>>> >
>>> > _______________________________________________
>>> > users mailing list
>>> > us...@open-mpi.org
>>> > Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> > Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2015/04/26708.php
>>>
>>>
>>> --
>>> Jeff Squyres
>>> jsquy...@cisco.com
>>> For corporate legal information go to:
>>> http://www.cisco.com/web/about/doing_business/legal/cri/
>>>
>>> _______________________________________________
>>> users mailing list
>>> us...@open-mpi.org
>>> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
>>> Link to this post:
>>> http://www.open-mpi.org/community/lists/users/2015/04/26729.php
>>
>>
>>
>>
>> --
>> Best Regards.
>> ---
>> Xing FENG
>> PhD Candidate
>> Database Research Group
>>
>> School of Computer Science and Engineering
>> University of New South Wales
>> NSW 2052, Sydney
>>
>> Phone: (+61) 413 857 288
>>
>
>
>
> --
> Best Regards.
> ---
> Xing FENG
> PhD Candidate
> Database Research Group
>
> School of Computer Science and Engineering
> University of New South Wales
> NSW 2052, Sydney
>
> Phone: (+61) 413 857 288
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post:
> http://www.open-mpi.org/community/lists/users/2015/05/26831.php
>
>
>
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post:
> http://www.open-mpi.org/community/lists/users/2015/05/26832.php
>



-- 
Best Regards.
---
Xing FENG
PhD Candidate
Database Research Group

School of Computer Science and Engineering
University of New South Wales
NSW 2052, Sydney

Phone: (+61) 413 857 288

Reply via email to