Dear Patrick and all,
Finally I solved the problem.
I need to mount -t nfs the home directory of host to the node/home
And then I can run in the cluster

Thank you for your time.
Best regards
Ha Chi

On Thu, 4 Jun 2020 at 17:09, Patrick Bégou <
patrick.be...@legi.grenoble-inp.fr> wrote:

> Ha Chi,
>
> first running MPI applications as root in not a good idea. You must create
> users in your rocks cluster without admin rights for all that is not system
> management.
>
> Let me know a little more about how you launch this:
> 1) Do you run "mpirun" from the rocks frontend or from a node ?
> 2) Ok from ssh from the frontend to the node but BETWEEN 2 nodes ?
>
> Patrick
>
> Le 04/06/2020 à 10:02, Hà Chi Nguyễn Nhật a écrit :
>
> Dear Patrick,
> Thanks so much for your reply,
> Yes, we use ssh to log on the node. From the frontend, we can ssh to the
> nodes without password.
> the mpirun --version in all 3 nodes are identical, openmpi 2.1.1, and same
> place when testing with "whereis mpirun"
> So is there any problem with mpirun causing it to not launch to other
> nodes?
>
> Regards
> HaChi
>
> On Thu, 4 Jun 2020 at 14:35, Patrick Bégou via users <
> users@lists.open-mpi.org> wrote:
>
>> Hi Ha Chi
>>
>> do you use a batch scheduler with Rocks Cluster or do you log on the node
>> with ssh ?
>> If ssh, can you check  that you can ssh from one node to the other
>> without password ?
>> Ping just says the network is alive, not that you can connect.
>>
>> Patrick
>>
>> Le 04/06/2020 à 09:06, Hà Chi Nguyễn Nhật via users a écrit :
>>
>> Dear Open MPI users,
>>
>> Please help me to find the solution for the problem using mpirun with a
>> ROCK cluster, 3 nodes. I use the command:
>> mpirun -np 12 --machinefile machinefile.txt --allow-run-as-root ./wrf.exe
>> But mpirun was unable to access other nodes (as the below photo). But
>> actually I checked the connection of three nodes by command "ping node's
>> IP", they are well connected.
>> [image: 2.png]
>> My machinefile.txt includes IP of three nodes (frontend and 2 connected
>> nodes), like this:
>> 10.1.85.1 slots=4
>> 10.1.85.254 slots=4
>> 10.1.85.253 slots=4
>>
>> My cluster is built by a ROCK cluster, with 3 nodes, CPUS 8 per each node.
>> *My question is: How can I connect 3 nodes to run together?*
>>
>> Please advise
>> Thanks
>> Ha Chi
>>
>> --
>> *Ms. Nguyen Nhat Ha Chi*
>> PhD student
>> Environmental Engineering and Management
>> Asian Institute of Technology (AIT)
>> Thailand
>>
>>
>>
>
> --
> *Ms. Nguyen Nhat Ha Chi*
> PhD student
> Environmental Engineering and Management
> Asian Institute of Technology (AIT)
> Thailand
>
>
>

-- 
*Ms. Nguyen Nhat Ha Chi*
PhD student
Environmental Engineering and Management
Asian Institute of Technology (AIT)
Thailand

Reply via email to