Re: [O-MPI users] Question about support for finding MPI processes from a tool

2005-08-05 Thread David Daniel

On Aug 5, 2005, at 7:05 AM, Jeff Squyres wrote:


On Aug 5, 2005, at 8:43 AM, Jim Galarowicz wrote:



We are in the process of implementing a portion of the Etnus MPI
interface to attach to all the processes of a MPI job.  The interface
does automatic process
acquisition via the MPIR_Breakpoint based method that is described in
the Finding Processes section of this URL:
http://www-unix.mcs.anl.gov/mpi/mpi-debug/

We were wondering if your project will use this interface or if you
are supplying an alternative method.



Yes, Open MPI will support the Etnus (MPIR_Breakpoint) interface to
automatically attach to parallel jobs.


We will also consider supporting other interfaces... if publicly  
documented.


David
--
David Daniel 
Advanced Computing Laboratory, LANL, MS-B287, Los Alamos NM 87545, USA




Re: [OMPI users] Building OMPI-1.0.2 on OS X v10.3.9 with IBM XLC +XLF

2006-04-10 Thread David Daniel

Perhaps this is a bug in xlc++.  Maybe this one...

http://www-1.ibm.com/support/docview.wss?uid=swg1IY78555

My (untested) guess is that removing the const_cast will allow it to  
compile, i.e. in ompi/mpi/cxx/group_inln.h replace

const_cast(ranges)
by
ranges

David


On Apr 10, 2006, at 12:17 PM, Warner Yuen wrote:

I'm running Mac OS X v 10.3.9 Panther and tried to get OpenMPI to  
compile with IBM XLC and XLF. The compilation failed, any ideas  
what might be going wrong? I used the following settings:


export CC=/opt/ibmcmp/vacpp/6.0/bin/xlc
export CXX=/opt/ibmcmp/vacpp/6.0/bin/xlc++
export CFLAGS="-O3"
export CXXFLAGS="-O3"
export FFLAGS="-O3"
./configure --with-gm=/opt/gm --prefix=/home/warner/mpi_src/ompi102

ranlib .libs/libmpi_c_mpi.a
creating libmpi_c_mpi.la
(cd .libs && rm -f libmpi_c_mpi.la && ln -s ../libmpi_c_mpi.la  
libmpi_c_mpi.la)

Making all in cxx
source='mpicxx.cc' object='mpicxx.lo' libtool=yes \
DEPDIR=.deps depmode=none /bin/sh ../../.././config/depcomp \
/bin/sh ../../../libtool --tag=CXX --mode=compile /opt/ibmcmp/vacpp/ 
6.0/bin/xlc++ -DHAVE_CONFIG_H -I. -I. -I../../../include -I../../../ 
include   -I../../../include -I../../.. -I../../.. -I../../../ 
include -I../../../opal -I../../../orte -I../../../ompi  - 
D_REENTRANT  -DNDEBUG -O3  -c -o mpicxx.lo mpicxx.cc

mkdir .libs
/opt/ibmcmp/vacpp/6.0/bin/xlc++ -DHAVE_CONFIG_H -I. -I. -I../../../ 
include -I../../../include -I../../../include -I../../.. -I../../..  
-I../../../include -I../../../opal -I../../../orte -I../../../ompi - 
D_REENTRANT -DNDEBUG -O3 -c mpicxx.cc  -qnocommon -DPIC -o .libs/ 
mpicxx.o
"../../../ompi/mpi/cxx/group_inln.h", line 100.66: 1540-0216 (S) An  
expression of type "const int [][3]" cannot be converted to type  
"int (*)[3]".
"../../../ompi/mpi/cxx/group_inln.h", line 108.66: 1540-0216 (S) An  
expression of type "const int [][3]" cannot be converted to type  
"int (*)[3]".

make[3]: *** [mpicxx.lo] Error 1
make[2]: *** [all-recursive] Error 1
make[1]: *** [all-recursive] Error 1
make: *** [all-recursive] Error 1


-Thanks and have an OpenMPI day!

Warner Yuen
Apple Computer
email: wy...@apple.com
Tel: 408.718.2859
Fax: 408.715.0133


___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users




Re: [OMPI users] Building 32-bit OpenMPI package for 64-bit Opteron platform

2006-04-11 Thread David Daniel
I suspect that to get this to work for bproc, then we will have to  
build mpirun as 64-bit and the library as 32-bit.  That's because a  
32-bit compiled mpirun calls functions in the 32-bit /usr/lib/ 
libbroc.so which don't appear to function when the system is booted  
64-bit.


Of course that would mean we need heterogeneous support to run on a  
single homogeneous system!  Will this work on the 1.0 branch?


An alternative worth thinking about is to bypass the library calls  
and start processes using a system() call to invoke the bpsh  
command.  This is a 64-bit executable linked with /usr/lib64/ 
libbproc.so and which successfully launches both 32- and 64-bit  
executables.


I'm currently trying to solve the same issue for LA-MPI :(

David


On Apr 10, 2006, at 9:18 AM, Brian Barrett wrote:


On Apr 10, 2006, at 11:07 AM, David Gunter wrote:


(flashc 105%) mpiexec -n 4 ./send4
[flashc.lanl.gov:09921] mca: base: component_find: unable to open: /
lib/libc.so.6: version `GLIBC_2.3.4' not found (required by /net/
scratch1/dog/flash64/openmpi/openmpi-1.0.2-32b/lib/openmpi/
mca_paffinity_linux.so) (ignored)
[flashc.lanl.gov:09921] mca: base: component_find: unable to open:
libbproc.so.4: cannot open shared object file: No such file or
directory (ignored)
[flashc.lanl.gov:09921] mca: base: component_find: unable to open:
libbproc.so.4: cannot open shared object file: No such file or
directory (ignored)
[flashc.lanl.gov:09921] mca: base: component_find: unable to open:
libbproc.so.4: cannot open shared object file: No such file or
directory (ignored)
[flashc.lanl.gov:09921] mca: base: component_find: unable to open:
libbproc.so.4: cannot open shared object file: No such file or
directory (ignored)
[flashc.lanl.gov:09921] mca: base: component_find: unable to open:
libbproc.so.4: cannot open shared object file: No such file or
directory (ignored)
mpiexec: relocation error: /net/scratch1/dog/flash64/openmpi/
openmpi-1.0.2-32b/lib/openmpi/mca_soh_bproc.so: undefined symbol:
bproc_nodelist

The problem now looks like /lib/libc.so.6 is not longer available.
Indeed, it is available on the compiler nodes but it cannot be found
on the backend nodes - whoops!


Well, that's interesting.  Is this on a bproc platform?  If so, you
might be best off configuring with either --enable-static or --
disable-dlopen.  Either one will prevent components from loading,
which doesn't seem to always work well.

Also, it looks like at least one of the components has a different
libc its linked against than the others.  This makes me think that
perhaps you have some old components from a previous build in your
tree.  You might want to completely remove your installation prefix
(or lib/openmpi in your installation prefix) and run make install  
again.


Are you no longer seeing the errors about epoll?


The other problem is that the -m32 flag didn't make it into mpicc for
some reason.


This is expected behavior.  There are going to be more and more cases
where Open MPI provides one wrapper compiler that does the right
thing whether the user passes in -m32 / -m64 (or any of the vendor
options for doing the same thing).  So it will be increasingly
impossible for us to know what to add (but as Jeff said, you can tell
configure to always add -m32 to the wrapper compilers if you want).

Brian
___
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users