Hi,

Have you checked the .bashrc or /etc/bash.bashrc file on the machine. We
have seen this problem before on both Debian and Ubuntu systems. If you find
the following line:

# If not running interactively, don't do anything
[ -z "$PS1" ] && return

It means that the remainder of the file is not processed (which may include
setting up your PATH/LD_LIBRARY_PATH) if you are connecting via a
non-interactive login. Since this command only works for non-interactive
logins the user falsely gets the idea that the PATH/LD_LIBRARY_PATH when
they log in over SSH.

We commented this line out and it worked for us, so just an idea.



Si Hammond

Performance Modelling and Analysis Team,
Department of Computer Science,
University of Warwick, UK


On 22/04/2008, Jeff Squyres <jsquy...@cisco.com> wrote:
>
> Make sure you're using the "right" mpirun.  If you have both MPICH and
> Open MPI installed, you need to install them into the different
> directories.
>
> As for not finding the "orted" on the remote nodes, it simply means
> that your PATH / LD_LIBRARY_PATH is not set properly on the remote
> nodes.  See:
>
> http://www.open-mpi.org/faq/?category=running#run-prereqs
> http://www.open-mpi.org/faq/?category=running#adding-ompi-to-path
> http://www.open-mpi.org/faq/?category=running#mpirun-prefix
>
> A few notes about your command line syntax:
>
> - "-np X" specifies how many processes to launch (where X is an
> integer).  If you don't specify this, OMPI defaults to as many slots
> are defined in your hostfile.
>
>      mpirun -np 4 ....
>
> - "--host LIST", where LIST is a comma-delimited list of hostnames to
> launch on (*instead* of specifying a hostfile).
>
>      mpirun --host host1,host2,host3,host4 ....
>
> - "--hostfile FILENAME", where FILENAME is a list of a hostfile as
> described in the FAQ.
>
>      mpirun --hostfile my_hostfile.txt ...
>
> You can't mix and match these params, like your example:
>
>
>      mpirun --host -np 2 node0,node1 /bin/hostname
>
>
>
>
> On Apr 22, 2008, at 4:40 AM, gildo....@gmx.de wrote:
>
> > Hi Jodi,
> >
> > thanks for the quick answer!
> >
> >> Have you tried the command ompi_info?
> >> What is its output?
> >
> > No, I haven't. I can try this evening, when I'm back home from work.
> >
> >>
> >> Furthermore, the command you gave
> >>>  mpirun -np 2 --hostfile /home/stephan/mpd.hosts
> >> is incomplete. There is no executable.
> >
> > Sorry, it should correctly be
> >
> > mpirun -np 2 --hostfile /home/stephan/mpd.hosts /bin/hostname
> >
> > The output shows, that the program is only executed on the local
> > machine. This is the same on each of the machines.
> >
> >>
> >> Try to make sure wether your host file has the correct
> >> format. If i remember correctly  Open-MPI does not use
> >> the same format for hostfiles as MPICH.
> >
> > Yes, thats true according to the documentation. There is a
> > possibility to define the number of slots and the max-number of
> > slots in OpenMPI, which I haven't seen in MPICH so far (at least not
> > within the hostfile).
> > But when I use a hostfile without a specification regarding the
> > slots, e.g.
> >
> > shell$ cat my-hosts
> > node0
> > node1
> >
> > (which is the same as in MPICH), it should interpret the default
> > values, i.e. slots=1 and max_slots=unlimited). That should work for
> > that simple test.
> >
> >> See the FAQ for more info
> >> http://www.open-mpi.org/faq/?category=running#mpirun-scheduling
> >
> > I also tried the way to specify the hosts in the command, e.g.
> >
> > mpirun --host -np 2 node0,node1 /bin/hostname
> >
> > but the result is the same. The program runs only on one host.
> >>
> >> If you don't use a hostfile, mpirun will start
> >> all processes on the local machine.
> >
> > Yes, that's what it obviously does. It doesn't recognise the cluster
> > partner in each direction...
> >
> > Kind Regards
> >
> > Stephan
> >
> >> On Tue, Apr 22, 2008 at 8:56 AM,  <gildo....@gmx.de> wrote:
> >>> Dear all,
> >>>
> >>> I wanted to compare MPICH and OpenMPI. MPICH works fine. So I
> >>> installed
> >> OpenMPI the same way (configure, make, make install). The commands
> >> are
> >> found in the OpenMPI installation directory.
> >>>
> >>> When I tried to run programs I was a little bit confused, that there
> >> seems not to be a default hosts-file like in MPICH. I included it
> >> in the
> >> command with "--hostfile".
> >>>
> >>> When I now want to run my first test with
> >>>
> >>>  mpirun -np 2 --hostfile /home/stephan/mpd.hosts
> >>>
> >>> I get the error-message:
> >>>
> >>>  orted: command not found
> >>>
> >>> The "orted"-executable resides as well as the "mpirun"- and
> >> "mpiexec"-executables in the directory /home/stephan/openmpi-
> >> install. "orted" is also
> >> found by "which orted".
> >>>
> >>> What might be the problem? How does "orted" work? I'm not conscious
> >> about anything equivalent in MPICH...
> >>>
> >>> Thanks in advance for your help!
> >>>
> >>> Kind Regards
> >>>
> >>> Stephan
> >
> > --
> > Psssst! Schon vom neuen GMX MultiMessenger gehört?
> > Der kann`s mit allen: http://www.gmx.net/de/go/multimessenger
> > _______________________________________________
> > users mailing list
> > us...@open-mpi.org
> > http://www.open-mpi.org/mailman/listinfo.cgi/users
>
>
>
> --
> Jeff Squyres
> Cisco Systems
>
>
>
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users
>

Reply via email to