[slurm-users] Alias for SlurmctldHost

2021-04-27 Thread Rupert Madden-Abbott
Hi, Is it possible to set SlurmctldHost to something other than the hostname? For example, can I configure /etc/hosts or DNS in some way to allow me to use an alias? I have tried both but on starting slurmctld I get this: slurmctld: error: This host (hostname/hostname) not a valid controller Wh

Re: [slurm-users] Cloud nodes remain in state "alloc#"

2020-10-25 Thread Rupert Madden-Abbott
me an invalid state transition error from ALLOCATION to RESUME * Setting the hostname of the node via scontrol update because the node hostname doesn't match the nodename and I have placed the nodename as an alias in /etc/hosts on the slurmd node. This has no impact. On Sat, 24 Oct

[slurm-users] Cloud nodes remain in state "alloc#"

2020-10-24 Thread Rupert Madden-Abbott
Hi, I'm using Slurm's elastic compute functionality to spin up nodes in the cloud, alongside a controller which is also in the cloud. When executing a job, Slurm correctly places a node into the state "alloc#" and calls my resume program. My resume program successfully provisions the cloud node a

[slurm-users] Cloud Scheduling Cluster Size Limit

2020-09-18 Thread Rupert Madden-Abbott
Hi, The Cloud Scheduling Guide [1] recommends setting the TreeWidth to the maximum cluster size to disable hierarchical communications. The maximum TreeWidth value is 65533. Does this effectively mean that cloud clusters are limited to 65533 nodes? What is the expected behaviour if I run a cloud c