Dear Lachlan,
thank you very much for your suggestions. I will try to experiment a little
bit with those settings.

Have you a nice day,
Emanuele

2017-10-10 0:02 GMT+02:00 Lachlan Musicman <data...@gmail.com>:

> On 9 October 2017 at 22:06, cyberseawolf . <ebre...@gmail.com> wrote:
>
>> Hello everybody,
>> I'm a young system administrator that is moving from Torque/MAUI to
>> Slurm. I set up a pretty peculiar resource management in the previous queue
>> system and I would like to port it in the new one.
>>
>> - I have the following two partitions that are totally independent to
>> each others (like having to separate queues):
>>
>> Part A  -->  has 24 cores per node at higher speed, 16 nodes in total;
>>
>> Part B -->  has 4 cores per node at lower speed, 11 nodes in total.
>>
>>
>> - There are two kinds of accounts (I hope that this is the right word...):
>>
>> Acc A   -->  every user can request up to 24 cores/6 nodes (i.e. 144
>> total CPUs) for all his/her jobs belonged to Part A, up to 4 cores/11 nodes
>> (i.e. 44 total CPUs) for all his/her jobs belonged to part B, all jobs have
>> very low priority;
>>
>> Acc B  -->  each user can request up to 12 cores/1 node (i.e. 12 total
>> CPUs) per each job in Part A, up to 4 cores/3 nodes (i.e. 12 total CPUs)
>> per each job in Part B, all jobs have high priority, only 10 jobs for all
>> users can be executed at the same time in Part A, only 12 jobs can be
>> queued for each user in Part A, no such limits in Part B.
>>
>>
>> - There are no time limit for all jobs.
>>
>>
>> - I did not use any database to track cluster usage in the past. If
>> needed, I would like to use a very simple one since I have no experience
>> with it.
>>
>>
>> - The purpose of this set up is to give more resources to users of Acc A
>> since they're doing a massive usage of the cluster. This being said, all
>> jobs of Acc B must be executed as soon as resources are available since
>> they are much quicker.
>>
>> Could you please suggest me which keywords I should use in slurm.conf
>> file? And what about the manual, are there any pages I have to check in
>> order to let this set up to work?
>> I would like to use the last version of Slurm to get rid of all bugs and
>> take advantage of the new features that could help me.
>>
>> Thank you very much for your kindness,
>>
>>
>
> Welcome to SLURM Emanuele
>
> You will need to track cluster usage for the prioritisation. Luckily, it's
> essentially built into slurm. Make sure that the head or master node (any
> node really, but I use master) has the slurm-slurmdbd package installed.
> This will do your accounting for you, coupled with a db (mysql works well).
>
> https://slurm.schedmd.com/accounting.html
>
> The Resource Limits page is handy for working out priorities, QoS and how
> they intersect with Partitions, Accounts, and etc:
>
> https://slurm.schedmd.com/resource_limits.html
>
> Keywords:
>
> SelectType=select/cons_res
> SelectTypeParameters=CR_CPU
> PriorityFlags=FAIR_TREE
> PriorityType=priority/multifactor
> PriorityDecayHalfLife=10
> AccountingStorageEnforce=qos,limits,safe
>
>
> It's realtively complex - as you would hope/imagine/fear. But I've found
> that by tackling one thing at a time, you can get a working system
> relatively quickly. As always, working out the syntax is the hardest part.
>
> Cheers
> L.
>
>
> ------
> "The antidote to apocalypticism is *apocalyptic civics*. Apocalyptic
> civics is the insistence that we cannot ignore the truth, nor should we
> panic about it. It is a shared consciousness that our institutions have
> failed and our ecosystem is collapsing, yet we are still here — and we are
> creative agents who can shape our destinies. Apocalyptic civics is the
> conviction that the only way out is through, and the only way through is
> together. "
>
> *Greg Bloom* @greggish https://twitter.com/greggish/
> status/873177525903609857
>

Reply via email to