I need to make that error string be google-able -- I'll add it to the faq. :-)

The problem is likely that you have multiple IP addresses, some of which are not routable to each other (but fail OMPI's routability assumptions). Check out these FAQ entries:

http://www.open-mpi.org/faq/?category=tcp#tcp-routability
http://www.open-mpi.org/faq/?category=tcp#tcp-selection

Does this help?


On Apr 19, 2007, at 11:07 AM, Babu Bhai wrote:

I have migrated from LAM/MPI to OpenMPI. I am not able to execute simple mpi code in which master sends an integer to slave. If i execute code on single machine i.e start 2 instance on same machine (mpirun -np 2 hello) this works fine.

If i execute in cluster using mpirun --prefix /usr /local - np 2 --host 199.63.34.154,199.63.34.36 hello it gives following error "btl_tcp_endpoint.c: 572:mca_btl_tcp_endpoint_complete_connect] connect() failed with errno=113"

I am using openmpi-1.2

regards,
Abhishek
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users


--
Jeff Squyres
Cisco Systems

Reply via email to