Do your different subnets violate the assumptions listed here?
http://www.open-mpi.org/faq/?category=tcp#tcp-routability
We have not implemented any workarounds to say "subnet X is routable
to subnet Y" because no one had asked for them. Such workarounds are
possible, of course, but I don't know what kind of timeframe we would
be able to implement them in. Contributions would always be
accepted! :-)
Probably the easiest workaround would be a top-level MCA parameter
that effectively tells OMPI to assume that *all* TCP addresses are
routable to each other. That might not be too difficult to implement.
On Apr 3, 2007, at 4:11 PM, Xie, Hugh wrote:
Hi,
I got the follow error message while running: 'mpirun -v -np 2
-machinefile hosts.txt testc.x'
Process 0.1.1 is unable to reach 0.1.0 for MPI communication.
If you specified the use of a BTL component, you may have
forgotten a component (such as "self") in the list of
usable components.
----------------------------------------------------------------------
--
--
----------------------------------------------------------------------
--
--
Process 0.1.0 is unable to reach 0.1.1 for MPI communication.
If you specified the use of a BTL component, you may have
forgotten a component (such as "self") in the list of
usable components.
----------------------------------------------------------------------
--
--
----------------------------------------------------------------------
--
--
It looks like MPI_INIT failed for some reason; your parallel
process is
likely to abort. There are many reasons that a parallel process can
fail during MPI_INIT; some of which are due to configuration or
environment
problems. This failure appears to be an internal failure; here's some
additional information (which may only be relevant to an Open MPI
developer):
PML add procs failed
--> Returned "Unreachable" (-12) instead of "Success" (0)
----------------------------------------------------------------------
--
--
The same commands works if the content in hosts.txt is in same subnet.
Once I switch to hosts in different subnet, it stop working. I am
using
ompi 1.2.
Please help.
- - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - - -
- - - - - - - - - -
This message is intended only for the personal and confidential use
of the designated recipient(s) named above. If you are not the
intended recipient of this message you are hereby notified that any
review, dissemination, distribution or copying of this message is
strictly prohibited. This communication is for information
purposes only and should not be regarded as an offer to sell or as
a solicitation of an offer to buy any financial product, an
official confirmation of any transaction, or as an official
statement of Lehman Brothers. Email transmission cannot be
guaranteed to be secure or error-free. Therefore, we do not
represent that this information is complete or accurate and it
should not be relied upon as such. All information is subject to
change without notice.
--------
IRS Circular 230 Disclosure:
Please be advised that any discussion of U.S. tax matters contained
within this communication (including any attachments) is not
intended or written to be used and cannot be used for the purpose
of (i) avoiding U.S. tax related penalties or (ii) promoting,
marketing or recommending to another party any transaction or
matter addressed herein.
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users
--
Jeff Squyres
Cisco Systems