-03-15T01:21:21 juju-65df3d-2
Mike
From: slurm-users on behalf of Hanby,
Mike
Date: Wednesday, February 15, 2023 at 1:51 PM
To: slurm-users@lists.schedmd.com
Subject: [slurm-users] Running Containerized Slurmctld and Slurmdb in
Production?
Howdy,
Just wondering if any sites are running contain
Howdy,
Just wondering if any sites are running containerized Slurmctld and Slurmdbd in
production?
We are in the process of planning migrating from a single host running
slurmctld, slurmdbd, and MySQL (and other HPC services) to separate OpenStack
VMs. Our site averages less than 1000’s runnin
mctld Restart
In slurm.conf, we just add the Features to the node description. Is that what
you were looking for?
NodeName=compute-4-4 … Weight=15 Feature=gen10
Jeff
UH IT - HPC
From: slurm-users [mailto:slurm-users-boun...@lists.schedmd.com] On Behalf Of
Hanby, Mike
Sent: Thursday, June 2, 2
Howdy,
I can’t seem to find a solution in ‘man slurm.conf’ for this. How can I make
the following persist a slurmctld restart:
scontrol update NodeName="c001" AvailableFeatures=hi_mem,data,scratch
NodeName=c001 Arch=x86_64 CoresPerSocket=12
CPUAlloc=2 CPUTot=48 CPULoad=6.08
AvailableFeatu
ing reboot"
scontrol cancel_reboot c01
From: "Hanby, Mike"
Date: Friday, August 7, 2020 at 11:43 AM
To: Slurm User Community List
Subject: Cancel "reboot ASAP" for a node
Howdy, (Slurm 18.08)
We have a bunch of node that we've updated to "scontrol reboot A
Howdy, (Slurm 18.08)
We have a bunch of node that we've updated to "scontrol reboot ASAP".
We'd like to cancel a few of those. From the man page, it's suggested that
either of the following should work, however both report the same error "
slurm_update error: Invalid node state specified":
sco
Howdy,
We are running Slurm 18.08. We have a user who has, twice, submitted over 15
thousand jobs to the cluster (the queue normally has a couple thousand jobs at
any given time).
This results in Slurm being unresponsive to user requests / job submits. I
suspect the scheduler is getting bogged
Howdy,
Running Slurm 18.08.8
We have a request to create a 2 node reservation for a class that will meet
every Tues and Thus this semester from 8AM to 9:15AM.
Is there a way to create a reservation match that, or is the closest we can do
is create a weekday reservation for that timeframe, i.e.
.conf with FirstJobId
-b
On 12/24/2018 1:09 AM, Sean Caron wrote:
On Mon, Dec 24, 2018 at 12:13 AM Hanby, Mike
mailto:mha...@uab.edu>> wrote:
Howdy,
We installed a new server to take over the duties of the Slurm master. I
imported our accounting database into MySQL, copied config files etc
Howdy,
We installed a new server to take over the duties of the Slurm master. I
imported our accounting database into MySQL, copied config files etc..
Apparently I missed the “file” that contains the last (or is it next) JOBID to
assign to the next job. The first job submitted to the new master
Thanks, Ole, that's perfect.
Mike Hanby
mhanby @ uab.edu
Systems Analyst II - Enterprise
IT Research Computing Services
The University of Alabama at Birmingham
On 6/13/18, 4:22 AM, "slurm-users on behalf of Ole Holm Nielsen"
wrote:
On 06/12/2018
Howdy,
Is anyone aware of any existing job completion email scripts that provide a
summary of the jobs resource utilization? For example, something like:
Job ID: 123456
Cluster: HPC
User/Group: jdoe/jdoe
State: COMPLETED (exit code 0)
Cores: 1
CPU Utilization: 00:18:45
CPU Efficiency: 98.60% of
12 matches
Mail list logo