Re: [OMPI users] Error message related to infiniband

2014-01-20 Thread Syed Ahsan Ali
My email was mixture of error messages/warnings. IB Card on compute-01-10 is faulty on ibstatus. In ibstat on other nodes as well as on compute-01-15 there are dual ports as I see status of both ports in ibstat. Firewall in not a problem, I am sure about it. How can I check bad ethernet port. I

Re: [OMPI users] Error message related to infiniband

2014-01-19 Thread Gustavo Correa
Is your IB card in compute-01-10.private.dns.zone working? Did you check it with ibstat? Do you have a dual port IB card in compute-01-15.private.dns.zone? Did you connect both ports to the same switch on the same subnet? TCP "no route to host": If it is not a firewall problem, could it bad Ether

Re: [OMPI users] Error message related to infiniband

2014-01-19 Thread Syed Ahsan Ali
I agree with you and still struglling with subnet ID settings because I couldn't find /var/cache/opensm/opensm.opts file. Secondly, if OMPI is going for TCP then it should be able to find as compute nodes are available via ping and ssh On Sun, Jan 19, 2014 at 9:38 PM, Ralph Castain wrote: > If

Re: [OMPI users] Error message related to infiniband

2014-01-19 Thread Ralph Castain
If OMPI finds infiniband support on the node, it will attempt to use it. In this case, it would appear you have an incorrectly configured IB adaptor on the node, so you get the additional warning about that fact. OMPI then falls back to look for another transport, in this case TCP. However, the

[OMPI users] Error message related to infiniband

2014-01-19 Thread Syed Ahsan Ali
Dear All I am getting infiniband errors while running mpirun applications on cluster. I get these errors even when I don't include infiniband usage flags in mpirun command. Please guide mpirun -np 72 -hostfile hostlist ../bin/regcmMPI regcm.in