Hi Prakash
I can't really test this solution as the Torque dynamic host allocator
appears to be something you are adding to that system (so it isn't part of
the released code). However, the attached code should cleanly add any nodes
to any existing allocation known to OpenRTE.
I hope to resume wo
On Apr 2, 2007, at 12:53 PM, Prakash Velayutham wrote:
prakash@wins04:~/thesis/CS/Samples>mpirun -np 4 --bynode --hostfile
machinefile ./parallel.laplace
[wins01:17699] *** An error occurred in MPI_Comm_spawn
[wins01:17699] *** on communicator MPI_COMM_WORLD
[wins01:17699] *** MPI_ERR_ARG: in
Thanks Ralph. I will wait for your Torque dynamic host addition solution.
Prakash
>>> r...@lanl.gov 04/02/07 1:00 PM >>>
Hi Prakash
This is telling you that you have an error in the comm_spawn command itself.
I am no expert there, so I'll have to let someone else identify it for you.
There are
Hi Prakash
This is telling you that you have an error in the comm_spawn command itself.
I am no expert there, so I'll have to let someone else identify it for you.
There are no limits to launching on nodes in a hostfile - they are all
automatically considered "allocated" when the file is read. If
Hello,
Thanks for the patch. I still do not know the internals of Open MPI, so can't
test this right away. But here is another test I ran and that failed too.
I have now removed Torque from the equation. I am NOT requesting nodes through
Torque. I SSH to a compute node and start up the code as
No offense, but I would definitely advise against that path. There are
other, much simpler solutions to dynamically add hosts.
We *do* allow dynamic allocation changes - you just have to know how to do
them. Nobody asked before... ;-) Future variations will include an even
simpler, single API sol
Ralph Castain a écrit :
> The runtime underneath Open MPI (called OpenRTE) will not allow you to spawn
> processes on nodes outside of your allocation. This is for several reasons,
> but primarily because (a) we only know about the nodes that were allocated,
> so we have no idea how to spawn a proc
Thanks for the info, Ralph. It is as I thought, but was hoping wouldn't
be that way.
I am requesting more nodes from the resource manager from inside of my
application code using the RM's API. when I know they are available
(allocated by the RM), I am trying to split the application data across
the
The runtime underneath Open MPI (called OpenRTE) will not allow you to spawn
processes on nodes outside of your allocation. This is for several reasons,
but primarily because (a) we only know about the nodes that were allocated,
so we have no idea how to spawn a process anywhere else, and (b) most