Re: [OMPI users] Proper way to throw an error to all nodes?

2008-06-03 Thread David Singleton
This is exactly what MPI_Abort is for. David Terry Frankcombe wrote: Calling MPI_Finalize in a single process won't ever do what you want. You need to get all the processes to call MPI_Finalize for the end to be graceful. What you need to do is have some sort of special message to tell everyo

Re: [OMPI users] Proper way to throw an error to all nodes?

2008-06-03 Thread Terry Frankcombe
Calling MPI_Finalize in a single process won't ever do what you want. You need to get all the processes to call MPI_Finalize for the end to be graceful. What you need to do is have some sort of special message to tell everyone to die. In my codes I have a rather dynamic master-slave model with fl

Re: [OMPI users] File download sizes

2008-06-03 Thread Jeff Squyres
Oops; I goofed in some of the PHP when adding the videos to the site. Fixed now; thanks for the heads-up. On May 30, 2008, at 10:57 AM, Ashley Pittman wrote: I notice on the download page all file sizes are listed as 0KB, this is presumably an error somewhere. http://www.open-mpi.org/s

Re: [OMPI users] Open MPI instructional videos

2008-06-03 Thread Jeff Squyres
On May 30, 2008, at 9:55 AM, Andreas Schäfer wrote: I've never really dig into Open MPI's guts, not because I wasn't interested, but mainly because the time required to get my bearings seemed just too much. Until now. I've watched a couple of the videos while coding and it was pretty awesome. Ea

[OMPI users] OpenMPI scaling > 512 cores

2008-06-03 Thread Scott Shaw
Hi, I hope this is the right forum for my questions. I am running into a problem when scaling >512 cores on a infiniband cluster which has 14,336 cores. I am new to openmpi and trying to figure out the right -mca options to pass to avoid the "mca_oob_tcp_peer_complete_connect: connection failed:"

[OMPI users] Proper way to throw an error to all nodes?

2008-06-03 Thread 8mj6tc902
So I'm working on this program which has many ways it might possibly die at runtime, but one of them that happens frequently is the user types a wrong (non-existant) filename on the command prompt. As it is now, the node looking for the file notices the file doesn't exist and tries to terminate the