FWIW, you might not want to comment out this line, but rather move up
any OMPI setup to *before* this line (so that it is processed both
during interactive and non-interactive logins).
The check for a non/interactive login may well be there for a reason;
there may well be things past that line that are really only suitable
for interactive logins. YMMV.
On Apr 22, 2008, at 8:11 AM, Simon Hammond wrote:
Hi,
Have you checked the .bashrc or /etc/bash.bashrc file on the
machine. We have seen this problem before on both Debian and Ubuntu
systems. If you find the following line:
# If not running interactively, don't do anything
[ -z "$PS1" ] && return
It means that the remainder of the file is not processed (which may
include setting up your PATH/LD_LIBRARY_PATH) if you are connecting
via a non-interactive login. Since this command only works for non-
interactive logins the user falsely gets the idea that the PATH/
LD_LIBRARY_PATH when they log in over SSH.
We commented this line out and it worked for us, so just an idea.
Si Hammond
Performance Modelling and Analysis Team,
Department of Computer Science,
University of Warwick, UK
On 22/04/2008, Jeff Squyres <jsquy...@cisco.com> wrote: Make sure
you're using the "right" mpirun. If you have both MPICH and
Open MPI installed, you need to install them into the different
directories.
As for not finding the "orted" on the remote nodes, it simply means
that your PATH / LD_LIBRARY_PATH is not set properly on the remote
nodes. See:
http://www.open-mpi.org/faq/?category=running#run-prereqs
http://www.open-mpi.org/faq/?category=running#adding-ompi-to-path
http://www.open-mpi.org/faq/?category=running#mpirun-prefix
A few notes about your command line syntax:
- "-np X" specifies how many processes to launch (where X is an
integer). If you don't specify this, OMPI defaults to as many slots
are defined in your hostfile.
mpirun -np 4 ....
- "--host LIST", where LIST is a comma-delimited list of hostnames to
launch on (*instead* of specifying a hostfile).
mpirun --host host1,host2,host3,host4 ....
- "--hostfile FILENAME", where FILENAME is a list of a hostfile as
described in the FAQ.
mpirun --hostfile my_hostfile.txt ...
You can't mix and match these params, like your example:
mpirun --host -np 2 node0,node1 /bin/hostname
On Apr 22, 2008, at 4:40 AM, gildo....@gmx.de wrote:
> Hi Jodi,
>
> thanks for the quick answer!
>
>> Have you tried the command ompi_info?
>> What is its output?
>
> No, I haven't. I can try this evening, when I'm back home from work.
>
>>
>> Furthermore, the command you gave
>>> mpirun -np 2 --hostfile /home/stephan/mpd.hosts
>> is incomplete. There is no executable.
>
> Sorry, it should correctly be
>
> mpirun -np 2 --hostfile /home/stephan/mpd.hosts /bin/hostname
>
> The output shows, that the program is only executed on the local
> machine. This is the same on each of the machines.
>
>>
>> Try to make sure wether your host file has the correct
>> format. If i remember correctly Open-MPI does not use
>> the same format for hostfiles as MPICH.
>
> Yes, thats true according to the documentation. There is a
> possibility to define the number of slots and the max-number of
> slots in OpenMPI, which I haven't seen in MPICH so far (at least not
> within the hostfile).
> But when I use a hostfile without a specification regarding the
> slots, e.g.
>
> shell$ cat my-hosts
> node0
> node1
>
> (which is the same as in MPICH), it should interpret the default
> values, i.e. slots=1 and max_slots=unlimited). That should work for
> that simple test.
>
>> See the FAQ for more info
>> http://www.open-mpi.org/faq/?category=running#mpirun-scheduling
>
> I also tried the way to specify the hosts in the command, e.g.
>
> mpirun --host -np 2 node0,node1 /bin/hostname
>
> but the result is the same. The program runs only on one host.
>>
>> If you don't use a hostfile, mpirun will start
>> all processes on the local machine.
>
> Yes, that's what it obviously does. It doesn't recognise the cluster
> partner in each direction...
>
> Kind Regards
>
> Stephan
>
>> On Tue, Apr 22, 2008 at 8:56 AM, <gildo....@gmx.de> wrote:
>>> Dear all,
>>>
>>> I wanted to compare MPICH and OpenMPI. MPICH works fine. So I
>>> installed
>> OpenMPI the same way (configure, make, make install). The commands
>> are
>> found in the OpenMPI installation directory.
>>>
>>> When I tried to run programs I was a little bit confused, that
there
>> seems not to be a default hosts-file like in MPICH. I included it
>> in the
>> command with "--hostfile".
>>>
>>> When I now want to run my first test with
>>>
>>> mpirun -np 2 --hostfile /home/stephan/mpd.hosts
>>>
>>> I get the error-message:
>>>
>>> orted: command not found
>>>
>>> The "orted"-executable resides as well as the "mpirun"- and
>> "mpiexec"-executables in the directory /home/stephan/openmpi-
>> install. "orted" is also
>> found by "which orted".
>>>
>>> What might be the problem? How does "orted" work? I'm not
conscious
>> about anything equivalent in MPICH...
>>>
>>> Thanks in advance for your help!
>>>
>>> Kind Regards
>>>
>>> Stephan
>
> --
> Psssst! Schon vom neuen GMX MultiMessenger gehört?
> Der kann`s mit allen: http://www.gmx.net/de/go/multimessenger
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
--
Jeff Squyres
Cisco Systems
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
--
Jeff Squyres
Cisco Systems