[OMPI users] error running mpirun command

2019-05-03 Thread Eric F. Alemany via users
Hello everyone,

I am new to Open MPI please forgive me for my beginner mistake. I read through 
the FAQ of open-mpi.org website and built a small cluster 
(9 nodes - including a master node).
I thought i followed the instructions accordingly but i am having issue running 
a simple mpirun.

$ mpirun -np 84  - -hostfile hostsfile ./openmpi_hello.c

mpirun: Forwarding signal 20 to job
--
ORTE does not know how to route a message to the specified daemon
located on the indicated node:

  my node:   phaser-manager
  target node:  radonc-phaser01

This is usually an internal programming error that should be
reported to the developers. In the meantime, a workaround may
be to set the MCA param routed=direct on the command line or
in your environment. We apologize for the problem.
—

I dont understand the meaning of the error message. I can share more of my 
configuration files if someone would be interested in helping me.

Thank you in advance for your help.


Best,
Eric

_

Eric F.  Alemany
System Administrator for Research

IRT
Division of Radiation & Cancer  Biology
Department of Radiation Oncology

Stanford University School of Medicine
Stanford, California 94305

Tel:1-650-498-7969  No Texting
Fax:1-650-723-7382





___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users

Re: [OMPI users] error running mpirun command

2019-05-03 Thread Gilles Gouaillardet via users
Eric,

which version of Open MPI are you using ? how many hosts in your hostsfile ?

The error message suggests this could be a bug within Open MPI, and a
potential workaround for you would be to try
mpirun -np 84  - -hostfile hostsfile --mca routed direct ./openmpi_hello.c

You might also want to double check all your hosts can access each
other with TCP/IP and on all ports (e.g. no firewall should be
running)

Cheers,

Gilles


On Sat, May 4, 2019 at 9:41 AM Eric F. Alemany via users
 wrote:
>
> Hello everyone,
>
> I am new to Open MPI please forgive me for my beginner mistake. I read 
> through the FAQ of open-mpi.org website and built a small cluster (9 nodes - 
> including a master node).
> I thought i followed the instructions accordingly but i am having issue 
> running a simple mpirun.
>
> $ mpirun -np 84  - -hostfile hostsfile ./openmpi_hello.c
>
> mpirun: Forwarding signal 20 to job
> --
> ORTE does not know how to route a message to the specified daemon
> located on the indicated node:
>
>   my node:   phaser-manager
>   target node:  radonc-phaser01
>
> This is usually an internal programming error that should be
> reported to the developers. In the meantime, a workaround may
> be to set the MCA param routed=direct on the command line or
> in your environment. We apologize for the problem.
> —
>
> I dont understand the meaning of the error message. I can share more of my 
> configuration files if someone would be interested in helping me.
>
> Thank you in advance for your help.
>
>
> Best,
> Eric
>
> _
>
> Eric F.  Alemany
> System Administrator for Research
>
> IRT
> Division of Radiation & Cancer  Biology
> Department of Radiation Oncology
>
> Stanford University School of Medicine
> Stanford, California 94305
>
> Tel:1-650-498-7969  No Texting
> Fax:1-650-723-7382
>
>
>
>
>
> ___
> users mailing list
> users@lists.open-mpi.org
> https://lists.open-mpi.org/mailman/listinfo/users
___
users mailing list
users@lists.open-mpi.org
https://lists.open-mpi.org/mailman/listinfo/users