Re: [slurm-users] Multifactor priority configuration

2020-01-22 Thread Killian Murphy
Thanks for the replies - they will help me think about configuration changes. On Wed, 22 Jan 2020 at 14:05, Loris Bennett wrote: > Hi, > > We have > > PriorityDecayHalfLife=7-0 > PriorityMaxAge=7-0 > > which are the defaults. > > I don't quite understand the point below about the business of

Re: [slurm-users] sbatch sending the working directory from the controller to the node

2020-01-22 Thread Christopher Samuel
On 1/21/20 11:27 AM, Dean Schulze wrote: The sbatch docs say nothing about why the node gets the pwd from the controller.  Why would slurm send a directory to a node that may not exist on the node and expect it to use it? That's a pretty standard expectation from a cluster, that the filesyste

[slurm-users] Spoofing a GPU on a slurm node virtual machine

2020-01-22 Thread Dean Schulze
I'm trying to spoof a gpu on a Centos 7.7 virtual machine that is a slurm node. I just want slurm to see that this node has a gpu. I'm not going to execute any code that uses a gpu. I created a character device with: mknod nvidia0 c 1 1 Here's what it looks like: [root@liqidos-dean-node1 dev]#

[slurm-users] Preemption within same QOS

2020-01-22 Thread Relu Patrascu
We're having a bit of a problem setting up slurm to achieve this: 1. Two QOSs, 'high' and 'normal'. 2. Preemption type: requeue. 3. Any job has a guarantee of running 60 minutes before being preempted. 4. Any job submitted with --qos=high can preempt jobs with --qos=normal if no resources availabl

Re: [slurm-users] SLURM starts new job before CG finishes

2020-01-22 Thread Lyn Gerner
James, you might take a look at CompleteWait and KillWait. Regards, Lyn On Fri, Jan 3, 2020 at 12:27 PM Erwin, James wrote: > Hello, > > > > I’ve recently updated a cluster to SLURM 19.05.4 and notice that new jobs > are starting on nodes still in the CG state. In an epilog I am running node >

Re: [slurm-users] Multifactor priority configuration

2020-01-22 Thread Loris Bennett
Hi, We have PriorityDecayHalfLife=7-0 PriorityMaxAge=7-0 which are the defaults. I don't quite understand the point below about the business of the cluster. If the cluster isn't busy, the jobs won't need to compete and jobs belonging to users with zero shares will still start. For me the

[slurm-users] Build datawarp plugin on non-Cray machine

2020-01-22 Thread Taras Shapovalov
Hey guys, Do you know if there is a way to build Slurm with datawarp plugin on a regular RHEL7 machine without Cray environment (without DataWarp installed)? Best regards, Taras

Re: [slurm-users] Multifactor priority configuration

2020-01-22 Thread Hadrian Djohari
Hi Killian, We choose to penalize the users a little only for their previous busy jobs, so we choose short turnarounds. PriorityDecayHalfLife=1-0 PriorityMaxAge=4-0 The busier the cluster, the longer should the parameters be, so the user previous jobs will restrict the "future" ones more. These s

[slurm-users] Implementation of generic plugin

2020-01-22 Thread subodhp
Dear all, I wish to have stage in, stage out functionality for my burst buffer, for this i have used the burst buffer generic plugin provided by slurm. But i am bit confused on how to add this plugin to my slurm configuration file and call this at time of job scheduling. Regards, Subodh Pandey -

[slurm-users] Multifactor priority configuration

2020-01-22 Thread Killian Murphy
Hi all. I’m interested to learn what people are using for the following configuration items: - PriorityDecayHalfLife - PriorityMaxAge and why they have chosen to set these as they have. I believe we haven’t got these set quite right on our cluster (3-0 for both items), and some understandi