Re: [OMPI users] Large TCP cluster timeout issue

2011-09-21 Thread Henderson, Brent
: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Tuesday, September 20, 2011 6:55 PM To: Open MPI Users Subject: Re: [OMPI users] Large TCP cluster timeout issue Truly am sorry about that - we were just talking today about the need to update an

Re: [OMPI users] Large TCP cluster timeout issue

2011-09-20 Thread Ralph Castain
e. Anyway, if I get access to another large TCP > cluster, I’ll give it a try. > > Thanks, > > brent > > From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On > Behalf Of Ralph Castain > Sent: Tuesday, September 20, 2011 4:15 PM > To: Open MPI U

Re: [OMPI users] Large TCP cluster timeout issue

2011-09-20 Thread Henderson, Brent
ther large TCP cluster, I'll give it a try. Thanks, brent From: users-boun...@open-mpi.org [mailto:users-boun...@open-mpi.org] On Behalf Of Ralph Castain Sent: Tuesday, September 20, 2011 4:15 PM To: Open MPI Users Subject: Re: [OMPI users] Large TCP cluster timeout issue Hmmmper

Re: [OMPI users] Large TCP cluster timeout issue

2011-09-20 Thread Ralph Castain
Hmmmperhaps you didn't notice the mpi_preconnect_all option? It does precisely what you described - it pushes zero-byte messages around a ring to force all the connections open at MPI_Init. On Sep 20, 2011, at 3:06 PM, Henderson, Brent wrote: > I recently had access to a 200+ node Magny Co

[OMPI users] Large TCP cluster timeout issue

2011-09-20 Thread Henderson, Brent
I recently had access to a 200+ node Magny Cours (24 ranks/host) 10G Linux cluster. I was able to use OpenMPI v1.5.4 with hello world, IMB and HPCC, but there were a couple of issues along the way. After setting some system tunables up a little bit on all of the nodes a hello_world program wor