[OMPI users] rcu_sched stalls on CPU

2013-02-25 Thread Simon DeDeo
I have been having some trouble tracing the source of a CPU stall with open MPI on Gentoo. My code is very simple: each process does a Monte Carlo run, saves some data to disk, and sends back a single MPI_DOUBLE to node zero, which picks the best value from all the computations (including the o

Re: [OMPI users] MPI_Abort under slurm

2013-02-25 Thread Ralph Castain
On Feb 25, 2013, at 10:38 AM, Bokassa wrote: > Hi, >I noticed that MPI_Abort() does not abort the tasks if the mpi program is > started using srun. > I call MPI_Abort() from rank 0, this process exit, but the other ranks keep > running or waiting for IO > on the other nodes. The only way

[OMPI users] MPI_Abort under slurm

2013-02-25 Thread Bokassa
Hi, I noticed that MPI_Abort() does not abort the tasks if the mpi program is started using srun. I call MPI_Abort() from rank 0, this process exit, but the other ranks keep running or waiting for IO on the other nodes. The only way to kill the job is to use scancel. However if I use mpirun unde

Re: [OMPI users] uDAPL status

2013-02-25 Thread Dan Beatty
Greetings Jeff, Sounds like another case of "No Bucks - No Buck Rogers" story. As much as we want Open Source and Open Standards to solve problems, it still comes down to a matter of funding. At least, with Open Standards and Open Source combined, if one company goes belly up the product can be c

Re: [OMPI users] MPI send recv confusion

2013-02-25 Thread Jeff Squyres (jsquyres)
In general, though, MPI says that they are different typed. You should use MPI integer in fortran, just to be more correct/safe. Sent from my phone. No type good. On Feb 21, 2013, at 10:56 AM, "Pradeep Jha" mailto:prad...@ccs.engg.nagoya-u.ac.jp>> wrote: 2013/2/21 Gus Correa mailto:g...@ldeo.