Re: [slurm-users] Partition Hold/Release

2023-03-15 Thread Nicolas Sonoda
Hi Marcus and Kevin, I'm sorry, I forgot to set DefMemPerCPU on my partitions so my preempt was not working. But after I set, the preempt worked fine and my jobs with low priority can suspend. Thank you very much! Regards, Nícolas De: slurm-users em nome de

Re: [slurm-users] Running Containerized Slurmctld and Slurmdb in Production?

2023-03-15 Thread Rigoberto Corujo
Hi Mike, If you run the Slurm daemons in a container, but the Slurm commands are run from the host, you need to make sure that the Slurm commands on the host and the Slurm daemons in the container are running similar versions of Slurm. Otherwise, the commands may not be able to communicate with

Re: [slurm-users] Running Containerized Slurmctld and Slurmdb in Production?

2023-03-15 Thread Hanby, Mike
FYI, after more internet sleuthing (searching for “juju slurm”) I came across this outstanding looking project: Omnivector Slurm Distribution (OSD): https://omnivector-solutions.github.io/osd-documentation/master/index.html This project uses Juju (Canonical project) to deploy, configure and mana

Re: [slurm-users] Partition Hold/Release

2023-03-15 Thread Kevin Broch
Nicolas, It looks like for the partition named "test" you still have *PreemptMode=off ?* On Wed, Mar 15, 2023 at 7:35 AM Wagner, Marcus wrote: > Hi Nicolas, > > > sorry to say, but we have no experience with preemption. > > > Best > > Marcus > > > Am 14.03.2023 um 22:07 schrieb Nicolas Sonoda:

Re: [slurm-users] Partition Hold/Release

2023-03-15 Thread Wagner, Marcus
Hi Nicolas, sorry to say, but we have no experience with preemption. Best Marcus Am 14.03.2023 um 22:07 schrieb Nicolas Sonoda: Hi Marcus, Thank you very much for the response. I set the PriorityTier for my partitions and also set PreemptType=preempt/partition_prio and PreemptMode=SUSPE

Re: [slurm-users] Regarding Multi-Cluster Accounting Information

2023-03-15 Thread Shaghuf Rahman
Hi Yair, Thank you for clarification. Could you please tell me which way is better for accounting related reports. Thanks & Regards, Shaghuf On Wed, 15 Mar 2023 at 15:08, Yair Yarom wrote: > Hi, > > We have several clusters on the same database. There are some entities > which are per cluster

[slurm-users] batched and efficient job status queries by snakemake using sacct

2023-03-15 Thread David Laehnemann
Hi again, everybody, based on the feedback here on the mailing list and on GitHub, and lots of digging into the docs, I have now changed snakemake's behaviour to do much fewer status queries and to do them in database-optimised batches via sacct. As there is a lot of user and admin knowledge aroun

Re: [slurm-users] Regarding Multi-Cluster Accounting Information

2023-03-15 Thread Yair Yarom
Hi, We have several clusters on the same database. There are some entities which are per cluster and some which are per database. accounts - per cluster (you can have same account name with a different account hierarchy, and different limits per cluster) association - per cluster qos - per databas

Re: [slurm-users] sbatch does not work with Debian image

2023-03-15 Thread Sorin Draga
Hello everyone and thank you for the feedback, @Shunran Zang: Indeed, for some reason there is no slurm.conf file in the /etc/slurm/ directory. We previously used the ShedMD's CentOS images and things worked fine. Could this be due to the Debian GCP image? Many thanks! S On Tue, Mar 14, 2023 a