On Mar 26, 2009, at 10:59 AM, Alessandro Surace wrote:
Hi Ralph,
what do you mean to create/define a directly interface?
The 3 hosts are network connected and ssh pub key enabled. Every
hosts can see the other but they are not all on the same direct
connected network . More in detail:
grid01 and grid04 are in the same network
grid03 is on different network.
This is the problem. If grid03 is on a different network, then there
is no way that an MPI process on that node can directly communicate
with one on grid04 or grid01. Grid03 must have a common network
interface with each of the machines, though it can be different for
each one.
In other words, grid03 and grid01 -must- have at least one network in
common. And grid04 and grid03 must also share at least one network,
though it can be different from the one that grid03 and grid01 share.
Does that help clarify?
Ralph
Although this difference the jobs on grid01 and grid03 run properly
like that on grid01 and grid04. But the jobs that include
simultaneously grid03 and grid04 fail.
Alex
_______________________________________________
users mailing list
us...@open-mpi.org
http://www.open-mpi.org/mailman/listinfo.cgi/users