[OMPI users] OpenMPI+PGI errors

2008-05-20 Thread Jim Kusznir
Hello all: I've got a user on our ROCKS 4.3 cluster that's having some strange errors. I have other users using the cluster without any such errors reported, but this user also runs this code on other clusters without any problems, so I'm not really sure where the problem lies. They are getting

[OMPI users] ORTE_ERROR_LOG Timeout

2008-05-20 Thread Rudd, James
I have been trying to compile a molecular dynamics program with the Openmpi 1.2.5 included in OFED 1.3. I am running Fedora Core 6; the output of uname -r is 2.6.18-1.2798.fc6. I've traced the problems I've been having back to openmpi because I'm unable to run the test programs such as glob on

Re: [OMPI users] "Sorry! You were supposed to get help about..."

2008-05-20 Thread Alex L.
"rpm -q -f ..." tells me that the files came from RedHat's openmpi-1.2.3-1 RPM package which is distributed with this RHEL 4 U6 release. Of course i can (and will) report this to RedHat, and waite for their bug fix. But for the fast way: without moving any files through the filesystem is there

Re: [OMPI users] Memory manager

2008-05-20 Thread Brian Barrett
Terry - Would you be willing to do an experiment with the memory allocator? There are two values we change to try to make IB run faster (at the cost of corner cases you're hitting). I'm not sure one is strictly necessary, and I'm concerned that it's the one causing problems. If you don

Re: [OMPI users] Memory manager

2008-05-20 Thread Gleb Natapov
On Tue, May 20, 2008 at 12:17:02PM +1000, Terry Frankcombe wrote: > To tell you all what noone wanted to tell me, yes, it does seem to be > the memory manager. Compiling everything with > --with-memory-manager=none returns the vmem use to the more reasonable > ~100MB per process (down from >8GB).