[OMPI users] crashing on MPI_SEND -- program exited with code 021, when ~1200 cores

2010-02-01 Thread Timothy G Thompson
Hello, I've been using openmpi (version 1.3.2) for some time, but recently have had greater than 1000 cores available. My code runs fine with 1000 cores but fails when attempting to use 1200 cores. The only information at the time of the crash is: . Within the debugger I know the crash is occ

[OMPI users] Cluster Configuration Issue???

2010-02-01 Thread Lee Manko
This is my first attempt at configuring a Beowulf cluster running MPI. ALL of the nodes are PS3s running Yellow Dog Linux 6.2 and the host (server) is a Dell i686 Quad-core running Fedora Core 12. Thanks to a couple of members on this forum (in a previous question), I learned that I needed to dow

Re: [OMPI users] Checkpoint/Restart error

2010-02-01 Thread Josh Hursey
Thanks for the bug report. There are a couple of places in the code that, in a sense, hard code '/tmp' as the temporary directory. It shouldn't be to hard to fix since there is a common function used in the code to discovery the 'true' temporary directory (which defaults to /tmp). Of course

Re: [OMPI users] [ompi-1.4.1] compiling without openib, running with openib + ompi141 and gcc3

2010-02-01 Thread Mathieu Gontier
Thank you for all these detailled information. My configuration options are: ./configure --prefix=$PREFIX --enable-static --enable-shared --enable-cxx-exceptions --enable-mpi-f77 --disable-mpi-f90 --disable-mpi-cxx --disable-mpi-cxx-seek --enable-mpi-profile --enable-binaries --enable-mpi-th