On May 18, 2007, at 5:01 PM, Steven Truong wrote:

So my shell might have exited when it detect that I ran
non-interactively.   But then again, how this parameter
MCA pls: parameter "pls_rsh_agent" (current value: "ssh :rsh")
affect my outcome?

It means that OMPI is going to first look for ssh, and if it doesn't find it, it will use rsh.

How am I going to set PATH and LD_LIBRARY_PATH to
be like those in .bash_profile in my Torque job files?

Err... I don't know. It depends on what is set in your Torque job files and how it is set.

Normally, Torque jobs copy the environment from where you invoked qsub. So if you have PATH / LD_LIBRARY_PATH set properly in the shell where you invoke qsub, it should automatically propagate out to where your script runs.

Could you give me some tips here?

[struong@neptune ~]$ cat .bash_profile | grep -v ^#

According to the FAQ entry that I cited to you, you need to set the PATH / LD_LIBRARY_PATH in .bashrc, not .bash_profile.

http://www.open-mpi.org/faq/?category=running#adding-ompi-to-path

[struong@neptune ~]$  ssh node07 which orted
which: no orted in (/usr/kerberos/bin:/usr/local/bin:/bin:/usr/bin)

If you're not setting the PATH / LD_LIBRARY_PATH in your .bashrc, this is to be expected.

[struong@neptune ~]$ /usr/local/openmpi-1.2.1/bin/mpirun --host node07
node07 hostname

Note that you listed "node07" twice. Hence, mpirun thought that the 2nd "node07" was your executable name, resulting in the predictable error:

---------------------------------------------------------------------- --
Failed to find the following executable:

Host:       node07.nanostellar.com
Executable: node07

Cannot continue.
---------------------------------------------------------------------- --

Because there is no executable named "node07".

--
Jeff Squyres
Cisco Systems

Reply via email to