These errors indicate a mismatching between the components and the
mpi library. Please remove the installation directory and do a make
install again in the open mpi directory. Somewhere around the
revision 9100 we remove some of the components (teg included). This
error say that there is still a .so for the removed component in your
installation directory.
Thanks,
george.
On Mar 7, 2006, at 3:23 PM, Michael Kluskens wrote:
Per the mpi_comm_spawn issues with the 1.0.x releases I started using
1.1r9212, with my sample code I'm getting a messages of
[-:13327] mca: base: component_find: unable to open: dlopen(/usr/
local/lib/openmpi/mca_pml_teg.so, 9): Symbol not found:
_mca_ptl_base_recv_request_t_class
Referenced from: /usr/local/lib/openmpi/mca_pml_teg.so
Expected in: flat namespace
(ignored)
...
[-:13323] [0,0,0] ORTE_ERROR_LOG: GPR data corruption in file base/
soh_base_get_proc_soh.c at line 100
[-:13323] [0,0,0] ORTE_ERROR_LOG: GPR data corruption in file base/
oob_base_xcast.c at line 108
[-:13323] [0,0,0] ORTE_ERROR_LOG: GPR data corruption in file base/
rmgr_base_stage_gate.c at line 276
[-:13323] [0,0,0] ORTE_ERROR_LOG: GPR data corruption in file base/
soh_base_get_proc_soh.c at line 100
[-:13323] [0,0,0] ORTE_ERROR_LOG: GPR data corruption in file base/
oob_base_xcast.c at line 108
[-:13323] [0,0,0] ORTE_ERROR_LOG: GPR data corruption in file base/
rmgr_base_stage_gate.c at line 276
OS X 10.4.5 with g95 from current fink install for FC & F77. Running
on a single machine and launching a single spawned subprocess as a
test case for now. Also on Debian Sarge on Operton built using "./
configure --with-gnu-ld F77=pgf77 FFLAGS=-fastsse FC=pgf90 FCFLAGS=-
fastsse" with PG 6.1.
Are these diagnostic messages of errors in OpenMPI 1.1r9212 or
related to errors in my test code?
Is this information helpful for development purposes?
Michael.
On Mar 4, 2006, at 9:29 AM, Jeff Squyres wrote:
Michael --
Sorry for the delay in replying.
Many thanks for your report! You are exactly right -- our types are
wrong and will not match in the F90 bindings. I have committed a fix
to the trunk for this (it involved changing some types in mpif.h and
adding another interface function for MPI_COMM_SPAWN_MULTIPLE so that
MPI_ARGVS_NULL could be unambiguously matched).
It was also just pointed out to us on this list the other day that we
are missing all the places where MPI choice buffers could be of type
CHARACTER (e.g., MPI_SEND). We're working on fixing that -- it's
just a bunch of menial labor to fix.
I'm hesitant to put these fixes in the 1.0.x series simply because
we're trying to finish that series and advance towards 1.1. Would
you be amenable to using a 1.1.x snapshot? My commit should show up
in any 1.1 snapshot >= r9198.
On Mar 1, 2006, at 12:30 PM, Michael Kluskens wrote:
On Mar 1, 2006, at 9:56 AM, George Bosilca wrote:
Now I look into this problem more and your right it's a missing
interface. Somehow, it didn't get compiled.
From "openmpi-1.0.1/ompi/mpi/f90/mpi-f90-interfaces.h" the
interface
says:
subroutine MPI_Comm_spawn(command, argv, maxprocs, info, root, &
comm, intercomm, array_of_errcodes, ierr)
use mpi_kinds
character(len=*), intent(in) :: command
character(len=*), dimension(*), intent(in) :: argv
integer, intent(in) :: maxprocs
integer, intent(in) :: info
integer, intent(in) :: root
integer, intent(in) :: comm
integer, intent(out) :: intercomm
integer, dimension(*), intent(out) :: array_of_errcodes
integer, intent(out) :: ierr
end subroutine MPI_Comm_spawn
My call is (mostly from the Using MPI-2 book):
call MPI_Comm_spawn('subprocess', MPI_ARGV_NULL, universe_size-1,
MPI_INFO_NULL, 0, &
MPI_COMM_WORLD, slavecomm, MPI_ERRCODES_IGNORE, ierr )
looking at "mpif.h" included by mpi_kinds.f90:
double complex MPI_ARGV_NULL
integer MPI_INFO_NULL
integer MPI_COMM_WORLD
double complex MPI_ERRCODES_IGNORE
What I don't understand how the "double complex" MPI_ARGV_NULL could
work with the "character(len=*), dimension(*), intent(in) :: argv"
interface or how the "double complex" MPI_ERRCODES_IGNORE could work
with the "integer, dimension(*), intent(out) :: array_of_errcodes"
interface.
I have the following for my variables:
integer :: ierr,slavecomm
integer (kind=MPI_ADDRESS_KIND) :: universe_size
My usage of MPI_ADDRESS_KIND and MPI_Comm_spawn is based on pages
236 and 244 of "Using MPI-2"
I'd to resolve the specific error involving the f90 interfaces so I
can continue to "USE MPI" in order to check my interface errors
quickly as I move forward on my project.
Michael
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
--
{+} Jeff Squyres
{+} The Open MPI Project
{+} http://www.open-mpi.org/
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
"Half of what I say is meaningless; but I say it so that the other
half may reach you"
Kahlil Gibran