Dear all,
thank you for your fast feedback. My initial idea was to run slurmctld and
slurmdb in respective KVMs and running while keeping the worker nodes
physical. From what I see that is a setup that works without problem.
However, I also find interesting some of the suggestions that you
mentio
Well, technically I have run several clusters all in VMs because it is
all in the cloud.
I think the main issue would be how resources are allocated and the
need. Given the choice, I would not run nodes in VMs because the
hypervisor inherently adds overhead that could be used for compute.
How
I built a cluster with Login Node, slurmctld, slurmdbd and MariaDb all on
VMs, and the compute nodes all physical. Works fine. Having a VM as login
node has the added benefit that anyone who tries to run an application
there interactively soon finds that it will not run in small RAM, and in
fact
Hi Jose,
I run my slumrctld (and the database) in a VM. Some of my
test/development nodes are VMs, as well. Actual worker nodes are
hardware, for performance reasons :)
Is it the SLURM controller that you're planning to run as a VM, or the
whole cluster?
Tina
On 12/09/2019 15:23, Jose A wrot
Dear José,
what exactly do you want to put into a VM?
Computes? Slurmctld/Slurmdbd?
We are running slurmctld/slurmdbd in a VM for some time now, but only for a
small Cluster (14 nodes).
The computes are still bare metal.
Until now we do not see issues, also live-migration of the VM to another h
Dear all,
In the expansion of our Cluster we are considering to install SLURM within a
virtual machine in order to simplify updates and reconfigurations.
Does any of your have experience running SLURM in VMs? I would really
appreciate if you could share your ideas and experiences.
Thanks a lo
Hello,
First issue: I have a couple dozen users that show up for an account but
outside of the hierarchical structure in sacctmgr:
sacctmgr show assoc account=
format=Account,User,Cluster,ParentName%20 Tree
When I execute that on a given account, I see that one user resides outside
account where
Hi Chris,
I'm not sure how this works. I'm not very experienced in QoS objects.
Have I to create two QoS objects a and b with UsageThreshold=0.1,Flags=
EnforceUsageThreshold / UsageThreshold=0.9? And I need two different accounts
A and B like Daniel suggested? Or can I use a single account?
Al
Greetings everyone,
I have an issue with jobs I'm submiting, I have no idea how to solve
it.
I submit the following script using sbatch:
#!/bin/bash
#SBATCH -t 1-00:00:00
#SBATCH --mem 8000
#SBATCH -n 512
#SBATCH -p all # partition
#SBATCH -J 1day8GB # job name
ulimit -s unlimited
module pur