Thanks Lloyd, Ralph . . regarding Ralph's comment,
>I don't understand the comment about printing and recompiling. Usually,
people just have the app
>write its intermediate results to a file, and provide a cmd line option ..
right, I shouldn't have written compile. It probably wouldn't increase t
state to start
again.
I seem to remember a simpler approach (check point restart?) in which the
state of the .exe
code is saved and then simply restarted from its current position.
Is there something like this for restarting an mpi program?
Thanks, Erik
--
Erik Nelson
Howard Hughes Medical
yep, runs well now.
On Sat, Feb 16, 2013 at 6:50 AM, Jeff Squyres (jsquyres) wrote:
> Glad you got it working!
>
> On Feb 15, 2013, at 6:53 PM, Erik Nelson wrote:
>
> > I may have deleted any responses to this message. In either case, we
> appear to have fixed the proble
I may have deleted any responses to this message. In either case, we appear
to have fixed the problem
by installing a more current version of openmpi.
On Thu, Feb 14, 2013 at 2:27 PM, Erik Nelson wrote:
>
> I'm encountering an error using qsub that none of us can figure out. MPI
>
qsub for the
"hello world" program.
These don't look like MPI errors, but rather some conflict with, maybe,
secure communication
accross nodes.
Is there something simple I can do to fix this?
Thanks,
Erik Nelson
Howard Hughes Medical Institute
6001 Forest Park Blvd., Room ND10.124
system around to manage the
> resources.
>
> -- Reuti
>
> > I believe SGE doesn't do that - and so the allocation won't include the
> submit host, in which case you don't need -nolocal.
> >
> >
> > On Jul 26, 2012, at 5:58 PM, Erik Nelso
you don't need -nolocal.
>
>
> On Jul 26, 2012, at 5:58 PM, Erik Nelson wrote:
>
> I was under the impression that the -nolocal option keeps processes off
> the submit
> host (since there may be hundreds or thousands of jobs submitted at any
> time,
> and we don
s, I
automatically send all processes (parent and child, or master and slave if
you
prefer) to the specified list of hosts.
Reading your email below, it looks like this was the correct understanding.
On Thu, Jul 26, 2012 at 5:20 PM, Reuti wrote:
> Am 26.07.2012 um 23:58 schrieb Erik Nelson:
>
s the idea?
Erik
On Thu, Jul 26, 2012 at 4:48 PM, Reuti wrote:
> Am 26.07.2012 um 23:33 schrieb Erik Nelson:
>
> > I have a purely parallel job that runs ~100 processes. Each process has
> ~identical
> > overhead so the speed of the program is dominated by the slowest
>
mpirun -np 101 -nolocal ./executable
I would like to restrict the job to nodes compute-5-1 to compute-5-32 on
our machine,
each containing 8 cpu's (slots). How do I go about this?
Thanks, Erik
--
Erik Nelson
Howard Hughes Medical Institute
6001 Forest Park Blvd., Room ND10.124
Da
10 matches
Mail list logo