Make sure you're using the "right" mpirun. If you have both MPICH and Open MPI installed, you need to install them into the different directories.

As for not finding the "orted" on the remote nodes, it simply means that your PATH / LD_LIBRARY_PATH is not set properly on the remote nodes. See:

http://www.open-mpi.org/faq/?category=running#run-prereqs
http://www.open-mpi.org/faq/?category=running#adding-ompi-to-path
http://www.open-mpi.org/faq/?category=running#mpirun-prefix

A few notes about your command line syntax:

- "-np X" specifies how many processes to launch (where X is an integer). If you don't specify this, OMPI defaults to as many slots are defined in your hostfile.

    mpirun -np 4 ....

- "--host LIST", where LIST is a comma-delimited list of hostnames to launch on (*instead* of specifying a hostfile).

    mpirun --host host1,host2,host3,host4 ....

- "--hostfile FILENAME", where FILENAME is a list of a hostfile as described in the FAQ.

    mpirun --hostfile my_hostfile.txt ...

You can't mix and match these params, like your example:

    mpirun --host -np 2 node0,node1 /bin/hostname



On Apr 22, 2008, at 4:40 AM, gildo....@gmx.de wrote:

Hi Jodi,

thanks for the quick answer!

Have you tried the command ompi_info?
What is its output?

No, I haven't. I can try this evening, when I'm back home from work.


Furthermore, the command you gave
 mpirun -np 2 --hostfile /home/stephan/mpd.hosts
is incomplete. There is no executable.

Sorry, it should correctly be

mpirun -np 2 --hostfile /home/stephan/mpd.hosts /bin/hostname

The output shows, that the program is only executed on the local machine. This is the same on each of the machines.


Try to make sure wether your host file has the correct
format. If i remember correctly  Open-MPI does not use
the same format for hostfiles as MPICH.

Yes, thats true according to the documentation. There is a possibility to define the number of slots and the max-number of slots in OpenMPI, which I haven't seen in MPICH so far (at least not within the hostfile). But when I use a hostfile without a specification regarding the slots, e.g.

shell$ cat my-hosts
node0
node1

(which is the same as in MPICH), it should interpret the default values, i.e. slots=1 and max_slots=unlimited). That should work for that simple test.

See the FAQ for more info
http://www.open-mpi.org/faq/?category=running#mpirun-scheduling

I also tried the way to specify the hosts in the command, e.g.

mpirun --host -np 2 node0,node1 /bin/hostname

but the result is the same. The program runs only on one host.

If you don't use a hostfile, mpirun will start
all processes on the local machine.

Yes, that's what it obviously does. It doesn't recognise the cluster partner in each direction...

Kind Regards

Stephan

On Tue, Apr 22, 2008 at 8:56 AM,  <gildo....@gmx.de> wrote:
Dear all,

I wanted to compare MPICH and OpenMPI. MPICH works fine. So I installed
OpenMPI the same way (configure, make, make install). The commands are
found in the OpenMPI installation directory.

When I tried to run programs I was a little bit confused, that there
seems not to be a default hosts-file like in MPICH. I included it in the
command with "--hostfile".

When I now want to run my first test with

 mpirun -np 2 --hostfile /home/stephan/mpd.hosts

I get the error-message:

 orted: command not found

The "orted"-executable resides as well as the "mpirun"- and
"mpiexec"-executables in the directory /home/stephan/openmpi- install. "orted" is also
found by "which orted".

What might be the problem? How does "orted" work? I'm not conscious
about anything equivalent in MPICH...

Thanks in advance for your help!

Kind Regards

Stephan

--
Psssst! Schon vom neuen GMX MultiMessenger gehört?
Der kann`s mit allen: http://www.gmx.net/de/go/multimessenger
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users


--
Jeff Squyres
Cisco Systems


Reply via email to