On May 18, 2007, at 5:01 PM, Steven Truong wrote:
So my shell might have exited when it detect that I ran
non-interactively. But then again, how this parameter
MCA pls: parameter "pls_rsh_agent" (current value: "ssh :rsh")
affect my outcome?
It means that OMPI is going to first look for ssh, and if it doesn't
find it, it will use rsh.
How am I going to set PATH and LD_LIBRARY_PATH to
be like those in .bash_profile in my Torque job files?
Err... I don't know. It depends on what is set in your Torque job
files and how it is set.
Normally, Torque jobs copy the environment from where you invoked
qsub. So if you have PATH / LD_LIBRARY_PATH set properly in the
shell where you invoke qsub, it should automatically propagate out to
where your script runs.
Could you give me some tips here?
[struong@neptune ~]$ cat .bash_profile | grep -v ^#
According to the FAQ entry that I cited to you, you need to set the
PATH / LD_LIBRARY_PATH in .bashrc, not .bash_profile.
http://www.open-mpi.org/faq/?category=running#adding-ompi-to-path
[struong@neptune ~]$ ssh node07 which orted
which: no orted in (/usr/kerberos/bin:/usr/local/bin:/bin:/usr/bin)
If you're not setting the PATH / LD_LIBRARY_PATH in your .bashrc,
this is to be expected.
[struong@neptune ~]$ /usr/local/openmpi-1.2.1/bin/mpirun --host node07
node07 hostname
Note that you listed "node07" twice. Hence, mpirun thought that the
2nd "node07" was your executable name, resulting in the predictable
error:
----------------------------------------------------------------------
--
Failed to find the following executable:
Host: node07.nanostellar.com
Executable: node07
Cannot continue.
----------------------------------------------------------------------
--
Because there is no executable named "node07".
--
Jeff Squyres
Cisco Systems