Re: [OMPI users] check point restart

2013-07-19 Thread Erik Nelson
Thanks Lloyd, Ralph . . regarding Ralph's comment, >I don't understand the comment about printing and recompiling. Usually, people just have the app >write its intermediate results to a file, and provide a cmd line option .. right, I shouldn't have written compile. It probably wouldn't increase t

[OMPI users] check point restart

2013-07-19 Thread Erik Nelson
state to start again. I seem to remember a simpler approach (check point restart?) in which the state of the .exe code is saved and then simply restarted from its current position. Is there something like this for restarting an mpi program? Thanks, Erik -- Erik Nelson Howard Hughes Medical

Re: [OMPI users] qsub error

2013-02-16 Thread Erik Nelson
yep, runs well now. On Sat, Feb 16, 2013 at 6:50 AM, Jeff Squyres (jsquyres) wrote: > Glad you got it working! > > On Feb 15, 2013, at 6:53 PM, Erik Nelson wrote: > > > I may have deleted any responses to this message. In either case, we > appear to have fixed the proble

Re: [OMPI users] qsub error

2013-02-15 Thread Erik Nelson
I may have deleted any responses to this message. In either case, we appear to have fixed the problem by installing a more current version of openmpi. On Thu, Feb 14, 2013 at 2:27 PM, Erik Nelson wrote: > > I'm encountering an error using qsub that none of us can figure out. MPI >

[OMPI users] qsub error

2013-02-14 Thread Erik Nelson
qsub for the "hello world" program. These don't look like MPI errors, but rather some conflict with, maybe, secure communication accross nodes. Is there something simple I can do to fix this? Thanks, Erik Nelson Howard Hughes Medical Institute 6001 Forest Park Blvd., Room ND10.124

Re: [OMPI users] restricting a job to a set of hosts

2012-07-28 Thread Erik Nelson
system around to manage the > resources. > > -- Reuti > > > I believe SGE doesn't do that - and so the allocation won't include the > submit host, in which case you don't need -nolocal. > > > > > > On Jul 26, 2012, at 5:58 PM, Erik Nelso

Re: [OMPI users] restricting a job to a set of hosts

2012-07-26 Thread Erik Nelson
you don't need -nolocal. > > > On Jul 26, 2012, at 5:58 PM, Erik Nelson wrote: > > I was under the impression that the -nolocal option keeps processes off > the submit > host (since there may be hundreds or thousands of jobs submitted at any > time, > and we don&#x

Re: [OMPI users] restricting a job to a set of hosts

2012-07-26 Thread Erik Nelson
s, I automatically send all processes (parent and child, or master and slave if you prefer) to the specified list of hosts. Reading your email below, it looks like this was the correct understanding. On Thu, Jul 26, 2012 at 5:20 PM, Reuti wrote: > Am 26.07.2012 um 23:58 schrieb Erik Nelson: >

Re: [OMPI users] restricting a job to a set of hosts

2012-07-26 Thread Erik Nelson
s the idea? Erik On Thu, Jul 26, 2012 at 4:48 PM, Reuti wrote: > Am 26.07.2012 um 23:33 schrieb Erik Nelson: > > > I have a purely parallel job that runs ~100 processes. Each process has > ~identical > > overhead so the speed of the program is dominated by the slowest >

[OMPI users] restricting a job to a set of hosts

2012-07-26 Thread Erik Nelson
mpirun -np 101 -nolocal ./executable I would like to restrict the job to nodes compute-5-1 to compute-5-32 on our machine, each containing 8 cpu's (slots). How do I go about this? Thanks, Erik -- Erik Nelson Howard Hughes Medical Institute 6001 Forest Park Blvd., Room ND10.124 Da