Re: [OMPI users] problems with MPI_Waitsome/MPI_Allstart and OpenMPI on gigabit and IB networks

2008-07-20 Thread Joe Landman
update 2: (its like I am talking to myself ... :) must start using decaf ...) Joe Landman wrote: Joe Landman wrote: [...] ok, fixed this. Turns out we have ipoib going, and one adapter needed to be brought down and back up. Now the tcp version appears to be running, though I do get the

Re: [OMPI users] problems with MPI_Waitsome/MPI_Allstart and OpenMPI on gigabit and IB networks

2008-07-20 Thread Joe Landman
Joe Landman wrote: 3) using btl to turn off sm and openib, generates lots of these messages: [c1-8][0,1,4][btl_tcp_endpoint.c:572:mca_btl_tcp_endpoint_complete_connect] connect() failed with errno=113 [...] No route to host at -e line 1. This is wrong, all the nodes are visible from all

[OMPI users] problems with MPI_Waitsome/MPI_Allstart and OpenMPI on gigabit and IB networks

2008-07-20 Thread Joe Landman
Hi folks: This is a deeper dive into the code that was giving me fits over the last two weeks. It uses MPI_Waitsome and MPI_Allstart to launch/monitor progress. More on that in a moment. The testing I have done to date on this platform suggests that OpenMPI is working fine, though I