On 3/12/2019 11:43 AM, Reuti wrote:
Now a followup question. I implemented this rule to ensure that no single user
takes more than 90% of our available slots:
{
name limit90percent
description NONE
enabled TRUE
limit users {*} to slots=536
}
(Our cluster has a total of 596 slots.) This worked fine until someone tried to
submit a parallel environment job with the -pe option. On 16 out of our 24
nodes, it still worked. But if they sent a job hard-queued to one of the upper
nodes 17–24, it would never run, with this in the scheduling info:
What was the submission command? A plain '-q upper'? There was/is an issue where you have to
specify instead '-q "*@@upper"' for a hostgroup named @upper. Or one can try to have a
dedicated PE only for the upper nodes and request this PE (i.e. in the queue configuration
"pe_list …,[@upper=upper]".
The following will fail:
qsub -l vf=1G -l
h="mynode17|mynode18|mynode19|mynode20|mynode21|mynode22|mynode23|mynode24"
-pe threads 1 anyscript.sh
It'll work if you remove "-pe threads 1".
_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users