On Sep 17, 2008, at 5:49 AM, Paul Kapinos wrote:

But the setting of the environtemt variable OPAL_PREFIX to an appropriate value (assuming PATH and LD_LIBRARY_PATH are setted too) is not enough to let the OpenMPI rock&roll from the new lokation.
Hmm.  It should be.

(update) it works with "truly" OpenMPI, but it works *not* with SUN Cluster Tools 8.0 (which is also an OpenMPI). So, it seems be an SUN problem and not general problem of openMPI. Sorry for false relating the problem.

Ah, gotcha. I guess my Sun colleagues on this list will need to address that. ;-)

The only trouble we have now are the error messages like

--------------------------------------------------------------------------
Sorry!  You were supposed to get help about:
   no hca params found
from the file:
   help-mpi-btl-openib.txt
But I couldn't find any file matching that name.  Sorry!
--------------------------------------------------------------------------

(the job still runs without problems! :o)

if running openmpi from new location, and the old location being removed. (if the old location being also persistense there is no error, so it seems to be an attempt to access to an file on old path).

Doh; that's weird.

Maybe we have to explicitly pass the OPAL_PREFIX environment variable to all processes?

Hmm. I don't need to do this in my 1.2.7 installation. I do something like this (I assume you're using rsh/ssh as a launcher?):

# OMPI installed to /home/jsquyres/bogus, then mv'ed to /home/jsquyres/ bogus/foo
tcsh% set path = (/home/jsquyres/bogus/foo/bin $path)
tcsh% setenv LD_LIBRARY_PATH /home/jsquyres/bogus/foo/lib: $LD_LIBRARY_PATH
tcsh% setenv OPAL_PREFIX /home/jsquyres/bogus/foo
tcsh% mpirun --hostfile whatever hostname
...works fine
tcsh% mpicc ring.c -o ring
tcsh% mpirun --hostfile whatever --mca btl openib,self ring
...works fine

Is this different for you?

Note one of configure files contained in Sun ClusterMPI 8.0 (see attached file). The paths are really hard-coded in instead of usage of variables; this makes the package really not relocable without parsing the configure files.

Did you (or anyone reading this message) have any contact to SUN developers to point to this circumstance? *Why* do them use hard- coded paths? :o)

I don't know -- this sounds like an issue with the Sun CT 8 build process. It could also be a by-product of using the combined 32/64 feature...? I haven't used that in forever and I don't remember the restrictions. Terry/Rolf -- can you comment?

--
Jeff Squyres
Cisco Systems

Reply via email to