Check out this FAQ entry: http://www.open-mpi.org/faq/?category=tcp#tcp-selection
Note that there are btl_tcp_if_include / btl_tcp_if_exclude: these control MPI-level communications. There's also oob_tcp_if_include / oob_tcp_if_exclude (that take the same kinds of values as btl_tcp_if_include/exclude) that control OMPI's run-time environment communications. On Oct 5, 2011, at 12:01 PM, (.-=Kiwi=-.) wrote: > "OMPI always tries to use the lowest numbered address first - just a natural > ordering." > > That doesn't seem to be the reason. We changed the private IPs to 212... (a > higher number than the public 210... IPs) and still MPI tries to go to 212 > afterwards. > > We're reading the oob_tcp and btl_tcp parameters but we're not sure how to do > it. > > "But if hello world doesn't even run, then try running with "mpirun --mca > oob_tcp_if_include <the interface(s) you want to use> ...", per Ralph's > suggestion. If *that* doesn't work, also add "--mca btl_tcp_if_include ..." > as well." > > We tried doing from Computer 1: > > orterun -mca oob_tcp_debug 1 -np 1 -host 212...3 ifconfig > > and everything was ok > > We tried doing from Computer 1: > > orterun -mca oob_tcp_debug 1 -np 1 -host 210...101 ifconfig > > and it says: > > There are no allocated resources for the application > ifconfig > that match the requested mapping: > > > Verify that you have mapped the allocated resources properly using the > --host or --hostfile specification. > -------------------------------------------------------------------------- > -------------------------------------------------------------------------- > A daemon (pid unknown) died unexpectedly on signal 1 while attempting to > launch so we are aborting. [...] > > Any other ideas? > > > On Wed, Oct 5, 2011 at 1:54 AM, Ralph Castain <rhc.open...@gmail.com> wrote: > OMPI always tries to use the lowest numbered address first - just a natural > ordering. You need to tell it to use just the public ones for this topology. > Use the oob_tcp and btl_tcp parameters to do this. See "ompi_info --param oob > tcp" and "ompi_info --param btl tcp" for the exact syntax. > > > Sent from my iPad > > On Oct 4, 2011, at 10:21 AM, "(.-=Kiwi=-.)" <heffe...@gmail.com> wrote: > >> We are constructing a set of computers with Open MPI and there's a small >> problem with mixing public and private IPs. >> >> We aren't sure about what's causing the problem or how to solve it. >> >> The files are shared thanks to NFS and we have a couple computers with >> private IPs and public IPs that we want them to send MPI work to some >> machines that have public IPs. >> >> I'm going to try to describe with example IPs. >> >> Computer 1 sees itself as eth0: 172...2 but has a public IP assigned: >> 210...2 >> Computer 2 sees itself as eth0: 172...3 but has a public IP assigned: >> 210...3 >> Computers outside the subnet directly have public IPs assigned: 210...100+ >> >> The computers outside see Computer 1 and 2 only with 210... they can't see >> the 172... internal IPs. >> >> If an outside computer launches mpirun to Computer 1, it works ok. >> If Computer 1 tries to launch mpirun to Computer 2 (with 172...) it also >> works ok (not with 210... because they don't know that that's their public >> IP, but that's not an issue). >> >> The problem comes when Computer 1 or 2 try to launch mpirun to outside >> computers. >> >> We tried to check out what was happening and installed wireshark on an >> outside computer and it seems that the ssh part works ok (the ssh talk >> between 210...2 and 210...101 is ok), but after that the outside computer >> tries to send a TCP SYN package to 172...2 instead of 210...2 and the rest >> of the packets onward the same. >> >> Is there a way to solve this problem? >> >> I've read this ( >> http://www.open-mpi.org/community/lists/users/2009/11/11184.php ) but I'm >> not really sure what he's doing there. >> >> We have the option of plugging Computer 1 and Computer 2 directly to the >> switch that the outside computers are on, but we'd rather not because we'd >> prefer the computers to stay on the private network, but if there's no other >> way, I guess we can. >> >> Can it be done without having to change the network topology? >> >> Thanks in advance. >> _______________________________________________ >> users mailing list >> us...@open-mpi.org >> http://www.open-mpi.org/mailman/listinfo.cgi/users > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users > > _______________________________________________ > users mailing list > us...@open-mpi.org > http://www.open-mpi.org/mailman/listinfo.cgi/users -- Jeff Squyres jsquy...@cisco.com For corporate legal information go to: http://www.cisco.com/web/about/doing_business/legal/cri/