Re: [slurm-users] MaxJobs-limits

2020-01-28 Thread zz
Hi Michael, Thanks for quick response what if we submit multiple job with out specifying core or thread in the same, all jobs will run parallely depends on the cpu available in the node, when no resouce available the job will go to queue as pending job. if i have 10 cpu in the system, if i submit

Re: [slurm-users] Virtual memory size requested by slurm

2020-01-28 Thread Mahmood Naderan
>If you want the virtual memory size to be unrestricted by slurm, set VSizeFactor to 0 in slurm.conf, which according >to the documentation disables virtual memory limit enforcement. > >https://slurm.schedmd.com/slurm.conf.html#OPT_VSizeFactor

Re: [slurm-users] Virtual memory size requested by slurm

2020-01-28 Thread Renfro, Michael
On this part, I don’t think that’s always the case. On a node with 384 GB (with 2 GB reserved for the OS), we’ve got several jobs running under mem=32000: = $ grep 'NodeName=gpunode\[00' /etc/slurm/slurm.conf NodeName=gpunode[001-003] CoresPerSocket=14 RealMemory=382000 Sockets=2 ThreadsPe

Re: [slurm-users] MaxJobs-limits

2020-01-28 Thread Renfro, Michael
For the first question: you should be able to define each node’s core count, hyperthreading, or other details in slurm.conf. That would allow Slurm to schedule (well-behaved) tasks to each node without anything getting overloaded. For the second question about jobs that aren’t well-behaved (a jo

Re: [slurm-users] Virtual memory size requested by slurm

2020-01-28 Thread Sean Maxwell
Hi Mahmood, If you want the virtual memory size to be unrestricted by slurm, set VSizeFactor to 0 in slurm.conf, which according to the documentation disables virtual memory limit enforcement. https://slurm.schedmd.com/slurm.conf.html#OPT_VSizeFactor -Sean On Mon, Jan 27, 2020 at 11:47 PM Mahmo

[slurm-users] MaxJobs-limits

2020-01-28 Thread zz
Hi, I am testing slurm for a small cluster, I just want to know that is there anyway I could set a max job limit per node, I have nodes with different specs running under same qos. Please ignore if it is a stupid question. Also I would like to know what will happen when a process which is running