Hi,

Am 16.08.2016 um 22:59 schrieb Chester Langin:

> Hello,
> 
> We implemented a new 40-unit HPCC October, 2015, and have been using the SGE 
> scheduler with the default configuration.  The faculty have been submitting 
> jobs to the scheduler without a specifying a queue, and the queue defaults to 
> all.q.  I would like to add more queues and force the faculty to specify one 
> of the new queues.  The new queues are gpu.q with just the GPU nodes, hiram.q 
> with just the high-RAM nodes, and standard.q for the remaining queues.  By 
> using @gpu_hosts, @hiram_hosts, and @standard_hosts, I have this running on a 
> development cluster.  I am ok with the faculty using the honor system in 
> determining which queue to use, so long as they do not use all.q.

Instead of specifying a queue it would be better to request resources and SGE 
will choose an appropriate queue for your job. E.g. for a gpu you could define 
a boolean complex and if users need a gpu they have to request something like 
"-l gpu". If you don't want to run non-gpu jobs on such a machine this complex 
could also be defined as "forced" (see `man complex`).

BTW: all.q is nothing special, it's only created for convenience. In case you 
don't need all.q any longer, you can delete it without harm.


> The problem is that if a faculty does not specify a queue, then all.q is used 
> by the scheduler, and a node becomes over-subscribed.

Why? Do you have more than one queue per node now? Did you define more slots 
than installed cores?

-- Reuti


>  What is the best/easiest way to prevent this from happening?
> 
> --
> Chet Langin
> SIU
> 
> _______________________________________________
> users mailing list
> users@gridengine.org
> https://gridengine.org/mailman/listinfo/users


_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to