Re: [OMPI users] Open MPI error when using MPI_Comm_spawn

2007-04-04 Thread Ralph H Castain
Hi Prakash I can't really test this solution as the Torque dynamic host allocator appears to be something you are adding to that system (so it isn't part of the released code). However, the attached code should cleanly add any nodes to any existing allocation known to OpenRTE. I hope to resume wo

Re: [OMPI users] Open MPI error when using MPI_Comm_spawn

2007-04-03 Thread Jeff Squyres
On Apr 2, 2007, at 12:53 PM, Prakash Velayutham wrote: prakash@wins04:~/thesis/CS/Samples>mpirun -np 4 --bynode --hostfile machinefile ./parallel.laplace [wins01:17699] *** An error occurred in MPI_Comm_spawn [wins01:17699] *** on communicator MPI_COMM_WORLD [wins01:17699] *** MPI_ERR_ARG: in

Re: [OMPI users] Open MPI error when using MPI_Comm_spawn

2007-04-02 Thread Prakash Velayutham
Thanks Ralph. I will wait for your Torque dynamic host addition solution. Prakash >>> r...@lanl.gov 04/02/07 1:00 PM >>> Hi Prakash This is telling you that you have an error in the comm_spawn command itself. I am no expert there, so I'll have to let someone else identify it for you. There are

Re: [OMPI users] Open MPI error when using MPI_Comm_spawn

2007-04-02 Thread Ralph Castain
Hi Prakash This is telling you that you have an error in the comm_spawn command itself. I am no expert there, so I'll have to let someone else identify it for you. There are no limits to launching on nodes in a hostfile - they are all automatically considered "allocated" when the file is read. If

Re: [OMPI users] Open MPI error when using MPI_Comm_spawn

2007-04-02 Thread Prakash Velayutham
Hello, Thanks for the patch. I still do not know the internals of Open MPI, so can't test this right away. But here is another test I ran and that failed too. I have now removed Torque from the equation. I am NOT requesting nodes through Torque. I SSH to a compute node and start up the code as

Re: [OMPI users] Open MPI error when using MPI_Comm_spawn

2007-04-02 Thread Ralph Castain
No offense, but I would definitely advise against that path. There are other, much simpler solutions to dynamically add hosts. We *do* allow dynamic allocation changes - you just have to know how to do them. Nobody asked before... ;-) Future variations will include an even simpler, single API sol

Re: [OMPI users] Open MPI error when using MPI_Comm_spawn

2007-04-02 Thread Jeremy Buisson
Ralph Castain a écrit : > The runtime underneath Open MPI (called OpenRTE) will not allow you to spawn > processes on nodes outside of your allocation. This is for several reasons, > but primarily because (a) we only know about the nodes that were allocated, > so we have no idea how to spawn a proc

Re: [OMPI users] Open MPI error when using MPI_Comm_spawn

2007-04-02 Thread Prakash Velayutham
Thanks for the info, Ralph. It is as I thought, but was hoping wouldn't be that way. I am requesting more nodes from the resource manager from inside of my application code using the RM's API. when I know they are available (allocated by the RM), I am trying to split the application data across the

Re: [OMPI users] Open MPI error when using MPI_Comm_spawn

2007-04-02 Thread Ralph Castain
The runtime underneath Open MPI (called OpenRTE) will not allow you to spawn processes on nodes outside of your allocation. This is for several reasons, but primarily because (a) we only know about the nodes that were allocated, so we have no idea how to spawn a process anywhere else, and (b) most