[slurm-users] CfP for VHPC ‘18 - Papers due May 15 (extended) for the 13th Virtualization in High­-Performance Cloud Computing Workshop

2018-04-12 Thread VHPC 18
*Please accept our apologies if you receive multiple copies of this Call for PapersCALL FOR PAPERS 13th Workshop on Virtualization in High­-Performance Cloud Computing (VHPC '18)held in conjunction with the International Supercomp

Re: [slurm-users] job_submit.lua script

2018-04-12 Thread sysadmin.caos
My purpose with job_submit.lua script is to limit a "srun" with more than one node and more than one CPU; in others words, "srun -N 1 -n 1". Because of this reason, in my future script I execute "if" for comparing that values: function slurm_job_submit(job_desc, part_list

[slurm-users] Jobs escaping cgroup device controls after some amount of time.

2018-04-12 Thread Shawn Bobbin
Hi, We’re running slurm 17.11.5 on RHEL 7 and have been having issues with jobs escaping there cgroup controls on GPU devices. For example we have the following steps running: # ps auxn | grep [s]lurmstepd 0 2380 0.0 0.0 538436 3700 ?Sl 07:22 0:02 slurmstepd: [46609.0]

Re: [slurm-users] job_submit.lua script

2018-04-12 Thread Bjørn-Helge Mevik
Christopher Samuel writes: > On 12/04/18 01:47, Bjørn-Helge Mevik wrote: > >> "sysadmin.caos" writes: >> >>> srun: error: slurm_job_submit: parameter error 65534 4294967294 1 >> >> 4294967294 is the special value slurm.NO_VAL, meaning the parameter >> was not specified. It is for 32 bit paramet