• Few Additionnal Informations about my Network configuration

/opt is a share point it uses NFS

/Network/opt

is the point where /opt can be found accross the Network

I declared OPAL_PREFIX because openmpi was built with prefix /opt and runs it 
directory /Network/opt


•  If a copy the directory /opt/openmpi-1.4.4 on all my nodes

scp -r /opt/openmpi-1.4.4 root@node2:/opt/.
scp -r /opt/openmpi-1.4.4 root@node3:/opt/.
scp -r /opt/openmpi-1.4.4 root@node4:/opt/.
....

This time my program runs.

• A Question :  is OPAL_PREFIX declaration enought to use /Network/opt rather 
than /opt ?


Christophe


> The problem is that the prefix you configured with doesn't match the prefix 
> you are providing: 
> 
> configure: prefix = /opt/openmpi-1.4.4 
> 
> running: prefix = /Network/opt/openmpi-1.4.4 
> 
> The two have to match in order for the libraries to be found. 
> 
> 
> On Nov 8, 2011, at 6:01 AM, Christophe Peyret wrote: 
> 
> > Hello, 
> > 
> > 
> > I am trying to run a program on a cluster composed with Apple Xserve 
> > running 10.5.8 (Leopard). 
> > 
> > 
> > 1) I am using openmpi-1.4.4 compiled with Intel ifort and icc (V12) 
> > (/opt is a share point mounted in /Network/opt with NFS) 
> > 
> > ./configure --prefix=/opt/openmpi-1.4.4 \ 
> > F77=/Network/opt/intel/composerxe/bin/ifort F77FLAGS="-arch x86_64" \ 
> > FC=/Network/opt/intel/composerxe/bin/ifort FCFLAGS="-arch x86_64" \ 
> > CC=/Network/opt/intel/composerxe/bin/icc CFLAGS="-arch x86_64" \ 
> > CXX=/Network/opt/intel/composerxe/bin/icpc CXXFLAGS="-arch x86_64" 
> > 
> > make 
> > sudo make install 
> > 
> > 
> > Each /etc/profile of my nodes contains : 
> > 
> > export COMP_HOME=/Network/opt/intel/composerxe 
> > export PATH=$COMP_HOME/bin:$COMP_HOME/man:$PATH 
> > export DYLD_LIBRARY_PATH=$COMP_HOME/lib/:$DYLD_LIBRARY_PATH 
> > 
> > export MPI_HOME=/Network/opt/openmpi-1.4.4 
> > export OPAL_PREFIX=/Network/opt/openmpi-1.4.4 
> > 
> > export PATH=${MPI_HOME}/bin:${MPI_HOME}/man:$PATH 
> > export DYLD_LIBRARY_PATH=$MPI_HOME/lib/:$DYLD_LIBRARY_PATH 
> > export LD_LIBRARY_PATH=$MPI_HOME/lib/:$LD_LIBRARY_PATH 
> > 
> > 2) when I lauch mpirun on several nodes, the MPI connections fails and I 
> > have the error message : 
> > 
> > mpirun --prefix /Network/opt/openmpi-1.4.4/ -H node1,node2 -n 2 space64 -f 
> > Test/Euler/eulerRigid.def 
> > dyld: lazy symbol binding failed: Symbol not found: _orte_daemon 
> > Referenced from: /Network/opt/openmpi-1.4.4/bin/orted 
> > Expected in: /usr/lib/libopen-rte.0.dylib 
> > 
> > dyld: Symbol not found: _orte_daemon 
> > Referenced from: /Network/opt/openmpi-1.4.4/bin/orted 
> > Expected in: /usr/lib/libopen-rte.0.dylib 
> > 
> > bash: line 1: 2973 Trace/BPT trap /Network/opt/openmpi-1.4.4/bin/orted 
> > --daemonize -mca ess env -mca orte_ess_jobid 1644560384 -mca orte_ess_vpid 
> > 1 -mca orte_ess_num_procs 2 --hnp-uri 
> > "1644560384.0;tcp://10.0.0.1:50782;tcp://125.1.4.55:50782" 
> > -------------------------------------------------------------------------- 
> > A daemon (pid 41667) died unexpectedly with status 133 while attempting 
> > to launch so we are aborting. 
> > 
> > There may be more information reported by the environment (see above). 
> > 
> > This may be because the daemon was unable to find all the needed shared 
> > libraries on the remote node. You may set your LD_LIBRARY_PATH to have the 
> > location of the shared libraries on the remote nodes and this will 
> > automatically be forwarded to the remote nodes. 
> > -------------------------------------------------------------------------- 
> > -------------------------------------------------------------------------- 
> > mpirun noticed that the job aborted, but has no info as to the process 
> > that caused that situation. 
> > -------------------------------------------------------------------------- 
> > mpirun: clean termination accomplished 
> > 
> > 
> > 3) Does anyone have an idea ? 
> > 
> > 
> > -- 
> > Christophe Peyret 
> > ONERA - DSNA - PS3A 
> > 29 ave de la Division Leclerc 
> > F92320 Chatillon 
> > Tel. : +331 4673 4778 
> > Fax : +331 4673 4166 
> > 
> > http://www.onera.fr/dsna/couplage-methodes-aeroacoustiques 
> > 
> 
> 
--
Christophe Peyret
ONERA - DSNA - PS3A
29 ave de la Division Leclerc
F92320 Chatillon
Tel. : +331 4673 4778
Fax : +331 4673 4166

http://www.onera.fr/dsna/couplage-methodes-aeroacoustiques





Reply via email to