On Apr 6, 2010, at 4:59 PM, David Turner wrote:

> Hi Ralph,
> 
>> Are you using a scheduler of some kind? If so, you can add this to your 
>> default mca param file:
> 
> Yes, we are running torque/moab.
> 
>> orte_allocation_required = 1
>> This will prevent anyone running without having an allocation. You can also 
>> set
> 
> Ah.  An "allocation".  Not much info on this on the open-mpi website.
> I believe this is what we will want, to prevent mpirun on login nodes.

Yes - it was added specifically to solve similar problems we had on 
moab-scheduled clusters. The motivator was when someone forgot to get an 
allocation and ran a 256-process job - since we allow oversubscription, that 
hammered the login node into the ground.

With the above mca param set, mpirun will tell you "you need an allocation" and 
cleanly abort.

> 
>> rmaps_base_no_schedule_local = 1
>> which tells mpirun not to schedule any MPI procs on the local node.
> 
> In our batch environment, mpirun will be executing on one of the
> compute nodes.  That is, we don't have dedicated MOM nodes.
> Therefore, I think we will want to schedule (at least) one MPI
> task on the same node.  Actually, when somebody wants to run
> (for example) 256 tasks packed on 32 8-core nodes, I think we'll
> need mpirun to share a *core* with one of the MPI tasks.  The above
> option would prevent that, correct?

Yeah, you don't want to set this one for a torque environment.

> 
>> Does that solve the problem?
> 
> I'll give it a try and let you know.  Thanks!
> 
>> Ralph
>> On Apr 6, 2010, at 3:28 PM, David Turner wrote:
>>> Hi,
>>> 
>>> Our cluster has a handful of login nodes, and then a bunch of
>>> compute nodes.  OpenMPI is installed in a global file system
>>> visible from both sets of nodes.  This means users can type
>>> "mpirun" from an interactive prompt, and quickly oversubscribe
>>> the login node.
>>> 
>>> So, is there a way to explicitly exclude hosts from consideration
>>> for mpirun?  To prevent (what is usually accidental) running
>>> MPI apps on our login nodes?  Thanks!
>>> 
>>> -- 
>>> Best regards,
>>> 
>>> David Turner
>>> User Services Group        email: dptur...@lbl.gov
>>> NERSC Division             phone: (510) 486-4027
>>> Lawrence Berkeley Lab        fax: (510) 486-4316
>>> _______________________________________________
>>> users mailing list
>>> us...@open-mpi.org
>>> http://www.open-mpi.org/mailman/listinfo.cgi/users
>> _______________________________________________
>> users mailing list
>> us...@open-mpi.org
>> http://www.open-mpi.org/mailman/listinfo.cgi/users
> 
> 
> -- 
> Best regards,
> 
> David Turner
> User Services Group        email: dptur...@lbl.gov
> NERSC Division             phone: (510) 486-4027
> Lawrence Berkeley Lab        fax: (510) 486-4316
> _______________________________________________
> users mailing list
> us...@open-mpi.org
> http://www.open-mpi.org/mailman/listinfo.cgi/users


Reply via email to