Re: [OMPI users] Problems with IPoIB and Openib

2017-05-29 Thread Gilles Gouaillardet
Alan, note you do not have to use the *-ib hostnames in your host_file these are only used for SSH, so since oob/tcp is running on your ethernet network, i guess you really want to use sm3 and sm4 host names. did you also run the same netcat test but in the other direction ? do you run 'mpi

Re: [OMPI users] Problems with IPoIB and Openib

2017-05-28 Thread Gilles Gouaillardet
Allan, the "No route to host" error indicates there is something going wrong with IPoIB on your cluster (and Open MPI is not involved whatsoever in that) on sm3 and sm4, you can run /sbin/ifconfig brctl show iptables -L iptables -t nat -L we might be able to figure out what is going wro

Re: [OMPI users] Problems with IPoIB and Openib

2017-05-27 Thread gilles
Allan, about IPoIB, the error message (no route to host) is very puzzling. did you double check IPoIB is ok between all nodes ? this error message suggests IPoIB is not working between sm3 and sm4, this could be caused by the subnet manager, or a firewall. ping is the first tool you should use to