Re: [OMPI users] mca_oob_tcp_peer_complete_connect: connection failed

2006-03-16 Thread Charles Wright
That works!! Thanks!! George Bosilca wrote: >Sorry I wasn't clear enough on my previous post. The error messages that >you get are comming from the OOB which is the framework we're using to >setup the MPI run. The options that you use (btl_tcp_if_include) are only >used for MPI communications.

Re: [OMPI users] mca_oob_tcp_peer_complete_connect: connection failed

2006-03-16 Thread George Bosilca
Sorry I wasn't clear enough on my previous post. The error messages that you get are comming from the OOB which is the framework we're using to setup the MPI run. The options that you use (btl_tcp_if_include) are only used for MPI communications. Please add "--mca oob_tcp_include eth0" to force

Re: [OMPI users] mca_oob_tcp_peer_complete_connect: connection failed

2006-03-16 Thread Charles Wright
Thanks for the tip. I see that both number 1 and 2 are true. Openmpi is insisting on using my eth0 (I know this by watching the firewall log on the node it is trying to go to) This is despite the fact that I have the first dns entry go to eth1, normally that is all pbs would need to do the right

Re: [OMPI users] mca_oob_tcp_peer_complete_connect: connection failed

2006-03-16 Thread George Bosilca
I see only 2 possibilities: 1. your trying to run Open MPI on nodes having multiple IP addresses. 2. your nodes are behind firewalls and Open MPI is unable to pass through. Please check the FAQ on http://www.open-mpi.org/faq/ to find out the full answer to your question. Thanks, george.

[OMPI users] mca_oob_tcp_peer_complete_connect: connection failed

2006-03-16 Thread Charles Wright
Hello, I'm just compiled open-mpi and tried to run my code which just measures bandwidth from one node to another. (Code compile fine and runs under other mpi implementations) When I did I got this. uahrcw@c275-6:~/mpi-benchmarks> cat openmpitcp.o15380 c317-6 c317-5 [c317-5:24979] [0,0,2]

Re: [OMPI users] Using Multiple Gigabit Ethernet Interface

2006-03-16 Thread Jayabrata Chakrabarty
Thanks Brian, Thanks Michael I wanted to benchmark the communcation throughput and latency using multiple using gigabit eithernet controller. So here are the results which i want share with you all I used . OpenMPI version 1.0.2a10r9275 Hpcbench Two Dell Precision 650 workstation. The Dell Pre

Re: [OMPI users] Performance of ping-pong using OpenMPI over Infiniband

2006-03-16 Thread George Bosilca
On Thu, 16 Mar 2006, Jean Latour wrote: > My questions are : > a) Is OpenMPI doing in this case TCP/IP over IB ? (I guess so) If the path to the mvapi library is correct then Open MPI will use mvapi not TCP over IB. There is a simple way to check. "ompi_info --param btl mvapi" will print all

Re: [OMPI users] Performance of ping-pong using OpenMPI over Infiniband

2006-03-16 Thread Galen M. Shipman
Hi Jean, Take a look here: http://www.open-mpi.org/faq/?category=infiniband#ib- leave-pinned This should improve performance for micro-benchmarks and some applications. Please let mw know if this doesn't solve the issue. Thanks, Galen On Mar 16, 2006, at 10:34 AM, Jean Latour wrote: Hel

[OMPI users] Performance of ping-pong using OpenMPI over Infiniband

2006-03-16 Thread Jean Latour
Hello, Testing performance of OpenMPI over Infiniband I have the following result : 1) Hardware is : SilversStorm interface 2) Openmpi version is : (from ompi_info) Open MPI: 1.0.2a9r9159 Open MPI SVN revision: r9159 Open RTE: 1.0.2a9r9159 Open RTE SVN revisi

Re: [OMPI users] Memory allocation issue with OpenIB

2006-03-16 Thread Galen M. Shipman
Emanuel, Thanks for the tip on this issue, we will be adding it to the FAQ shortly. - Galen On Mar 15, 2006, at 4:29 PM, Emanuel Ziegler wrote: Hi Davide! You are using the -prefix option. I guess this is due to the fact that You cannot set the paths appropriately. Most likely You are

Re: [OMPI users] problems with OpenMPI-1.0.1 on SunOS 5.9; problems on heterogeneous cluster

2006-03-16 Thread Ravi Manumachu
Hi Brian, I have installed OpenMPI-1.1a1r9260 on my SunOS machines. It has solved the problems. However there is one more issue that I found in my testing and that I failed to report. This concerns Linux machines too. My host file is hosts.txt - csultra06 csultra02 csultra05 csultra08