[slurm-users] changing JobAcctGatherType w/running jobs

2021-09-07 Thread Paul Brunk
Hi all: Running Slurm 20.11.8. I missed a chance at a recent outage to change our JobAcctGatherType from 'linux' to 'cgroup'. Our ProctrackType has been 'cgroup' for a long time. In short, I'm thinking it would harmless for me to do this now, with running jobs, and below I discuss the caveat

Re: [slurm-users] [External] Node utilization for 24 hours

2021-09-07 Thread Sid Young
I use this to get "|" separated data that gets pumped into a dashboard: sreport -t percent -T ALL cluster utilization Start=${YESTERDAY}T${QNOW} End=now -P | grep trihpc #output format #trihpc|cpu|14.62%|4.78%|0.00%|77.08%|3.52%|100.00% #trihpc|mem|18.77%|5.57%|0.00%|75.67%|0.00%|100.00% #trihpc

Re: [slurm-users] [External] Node utilization for 24 hours

2021-09-07 Thread Florian Zillner
Hi, you can run sreport like this: sreport cluster AccountUtilizationByUser Start=$(date -d "last month" +%D) End=$(date -d "this month" +%D) or sreport cluster Utilization Start=$(date -d "last month" +%D) End=$(date -d "this month" +%D) and script something around it, to show what you're look

[slurm-users] Node utilization for 24 hours

2021-09-07 Thread Hemanta Sahu
Hello , Is there any commands or script available to see particular Node utilization percentage in terms of (allocated CPU CoreMins/available CPU Coremins) for certain time period(lets say for last 1 week). Appreciate any help in this regard Thanks Hemanta

[slurm-users] sacct got the job record without partition info

2021-09-07 Thread 刘文晓
Dear all, Now, I found when I submitted a job with error "Requested node configuration is not available" on Slurm 19.05.5, sometimes with "sacct" command i get no partition record. like below: First use command get the job: Wait several seconds: In my local lab, I use the accounting_