Re: [slurm-users] Simple question but I can't find the answer

2019-01-10 Thread Marcus Wagner
Hi, there is some more information from the same manpage about nodestates: NODE STATE CODES    Node state codes are shortened as required for the field size.  These node states may be followed by a special character to identify state flags associated with the node.  The following node suff

[slurm-users] slurm, memory accounting and memory mapping

2019-01-10 Thread Sergey Koposov
Hi,  I've recently migrated to slurm from pbs on our cluster. Because of that, now the job memory limits are strictly enforced and that causes my code to get killed.  The trick is that my code uses memory mapping (i.e. mmap) of one single large file (~12 Gb) in each thread on each node. With thi

Re: [slurm-users] QoS settings in sacctmgr requires restarting slurmctld to take effect

2019-01-10 Thread Chris Samuel
On 10/1/19 6:15 pm, Jianwen Wei wrote: I am using SLURM 17.11 on CentOS 7. I find that QoS settings, say GrpNodes for an account, in sacctmgr requires restarting slurmcltd to take effect. This sounds like a configuration issue, and usually it's because slurmdbd cannot connect back to the con

[slurm-users] QoS settings in sacctmgr requires restarting slurmctld to take effect

2019-01-10 Thread Jianwen Wei
Hi, I am using SLURM 17.11 on CentOS 7. I find that QoS settings, say GrpNodes for an account, in sacctmgr requires restarting slurmcltd to take effect. Restarting slurmctld may hang slurm service for 15secs~60secs, which is annoying if QoS settings are configured several times a day dy

Re: [slurm-users] Simple question but I can't find the answer

2019-01-10 Thread Jeffrey R. Lang
Yes, I missed the mark here, yes it is after the partition. From: slurm-users On Behalf Of Andy Riebs Sent: Thursday, January 10, 2019 10:22 AM To: slurm-users@lists.schedmd.com Subject: Re: [slurm-users] Simple question but I can't find the answer ◆ This message was sent from a non-UWYO addre

Re: [slurm-users] [Slurm 18.08.4] sacct/seff Inaccurate usercpu on Job Arrays

2019-01-10 Thread Christopher Benjamin Coffey
We've attempted setting JobAcctGatherFrequency=task=0 and there is no change. We have settings: ProctrackType=proctrack/cgroup TaskPlugin=task/cgroup,task/affinity JobAcctGatherType=jobacct_gather/cgroup Odd ... wonder why we don't see it help. Here is how we verify: === #!/bin/bash #SBATCH --

Re: [slurm-users] Larger jobs tend to get starved out on our cluster

2019-01-10 Thread Christopher Benjamin Coffey
Hi D.J., I noticed you have: PriorityFlags=DEPTH_OBLIVIOUS,SMALL_RELATIVE_TO_TIME,FAIR_TREE I'm pretty sure it does not makes sense to have depth oblivious, and fair tree set at the same time. You'll want to choose one of them. That’s not going to be reason for the issue however, but you are l

Re: [slurm-users] Simple question but I can't find the answer

2019-01-10 Thread DENOU, MEHDI
smime.p7m Description: S/MIME encrypted message

Re: [slurm-users] Simple question but I can't find the answer

2019-01-10 Thread Andy Riebs
Is it following a host name, or a partition name? If the latter, it just means that it's the default partition. *From:* Jeffrey R. Lang *Sent:* Thursday, January 10, 2019 11:13AM *To:* Slurm-users *Cc:* *Subject:* [slurm-

[slurm-users] Simple question but I can't find the answer

2019-01-10 Thread Jeffrey R. Lang
Guys When I run sinfo some of the nodes in the list show there hostname with a following asterisk. I've looked through the man pages and what I can find on the web but nothing provides an answer. So what does the asterisk after the hostname mean? Jeff