Open MPI uses different heuristics depending on whether IP addresses are public 
or private.

All your IP addresses are technically "public" -- they're not in 10.x.x.x or 
192.168.x.x, for example.

So Open MPI assumes that they are all routable to each other.

You might want to change your 3 networks to be 10.1.x.x/16, 10.2.x.x/16, and 
10.3.x.x/16.  See if that makes it work.


> On Sep 17, 2015, at 12:31 PM, Shang Li <shawn.li.x...@gmail.com> wrote:
> 
> Hi all,
> 
> I wanted to setup a 3-node ring network, each connects to the other 2 using 2 
> Ethernet ports directly without a switch/router.
> 
> The interface configurations could be found in the following picture.
> 
> https://www.dropbox.com/s/g75i51rrjs51b21/mpi-graph%20-%20New%20Page.png?dl=0
> 
> I've used ifconfig on each node to configure each port, and made sure I can 
> ssh from each node to the other 2 nodes.
> 
> But a simple ring_c example doesn't work... So I turn on  --mca 
> btl_base_verbose 30, I could see that node1 was trying to use 23.0.0.2  
> (linke between node2 and 3) to get to node2 though there is a direct link to 
> node 2. 
> 
> The output log is like:
> 
> [node1:01828] btl: tcp: attempting to connect() to [[19529,1],1] address 
> 23.0.0.2 on port 1024
> [[19529,1],0][btl_tcp_endpoint.c:606:mca_btl_tcp_endpoint_start_connect] from 
> node1 to: node2 Unable to connect to the peer 23.0.0.2  on port 4: Network is 
> unreachable
> 
> I've read the following posts and FAQs but still couldn't understand this 
> kind of behavior.
> 
> http://www.open-mpi.org/faq/?category=tcp#tcp-routability-1.3
> http://www.open-mpi.org/faq/?category=tcp#tcp-selection
> http://www.open-mpi.org/community/lists/users/2014/11/25810.php
> 
> 
> Any pointers would be appreciated! Thanks in advance!
> 
> My open-mpi info:
> 
>  Package: Open MPI gtbldadm@ubuntu-12 Distribution
>                 Open MPI: 1.0.0.22
>   Open MPI repo revision: git714842d
>    Open MPI release date: May 27, 2015
>                 Open RTE: 1.0.0.22
>   Open RTE repo revision: git714842d
>    Open RTE release date: May 27, 2015
>                     OPAL: 1.0.0.22
>       OPAL repo revision: git714842d
>        OPAL release date: May 27, 2015
>                  MPI API: 2.1
> 
> 
> Best,
> Shawn
> 
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
> Link to this post: 
> http://www.open-mpi.org/community/lists/users/2015/09/27612.php


-- 
Jeff Squyres
jsquy...@cisco.com
For corporate legal information go to: 
http://www.cisco.com/web/about/doing_business/legal/cri/

Reply via email to