Hi Terry,
unfortunately I haven't got a stack trace.
OS: Mac OS X 10.4.7 Server on the Xgrid-server and Mac OS X 10.4.7
Client on every node (G4 and G5). For testing-purposes I've installed
OpenMPI 1.1 on a Dual-G4-node and on a Dual-G5-node with my Xgrid
consisting of only either the Dual-G4
Hi Eric (and all),
don't know if this really messes things up, but you have set up lam-mpi
in your path-variables, too:
[enterprise:24786] pls:rsh: reset LD_LIBRARY_PATH:
/export/lca/home/lca0/etudiants/ac38820/openmpi_sun4u/lib:/export/lca/appl/Forte/SUNWspro/WS6U2/lib:/usr/local/lib:*/usr/l
ack would help me try and determine if this is an
OpenMPI issue
or possibly some type of platform problem.
There is another thread with Eric Thibodeau that I am unsure if it is
the same issue
as either of our situation.
--td
>
>Message: 3
>Date: Wed, 28 Jun 2006 14:30:12 +0200
&
@ Terry (and All)!
Enclose you'll find a (minor) bugfix with respect to the BUS_ADRALN I've
reported recently when submitting jobs to the XGrid with OpenMPI 1.1.
The BUS_ADRALN error on SPARC systems might be caused by a similar code
segment. For the "bugfix" see line 55ff of the attached code
Hi All,
when the nodes belong to different subnets the following error messages
pop up:
[powerbook.2-net:20826] *** An error occurred in MPI_Allreduce
[powerbook.2-net:20826] *** on communicator MPI_COMM_WORLD
[powerbook.2-net:20826] *** MPI_ERR_INTERN: internal error
[powerbook.2-net:20826] **
What's the proper setup for using kerberos-single-sign-on with OpenMPI? For now
I've added the two environment variable XGRID_CONTROLLER_HOSTNAME and
XGRID_CONTROLLER_PASSWORD to submit jobs to the grid.
Yours,
Frank