Re: [slurm-users] SLURM in Virtual Machine

2019-09-12 Thread Jose A.
Dear all, thank you for your fast feedback. My initial idea was to run slurmctld and slurmdb in respective KVMs and running while keeping the worker nodes physical. From what I see that is a setup that works without problem. However, I also find interesting some of the suggestions that you mentio

Re: [slurm-users] SLURM in Virtual Machine

2019-09-12 Thread Brian Andrus
Well, technically I have run several clusters all in VMs because it is all in the cloud. I think the main issue would be how resources are allocated and the need. Given the choice, I would not run nodes in VMs because the hypervisor inherently adds overhead that could be used for compute. How

Re: [slurm-users] SLURM in Virtual Machine

2019-09-12 Thread William Brown
I built a cluster with Login Node, slurmctld, slurmdbd and MariaDb all on VMs, and the compute nodes all physical. Works fine. Having a VM as login node has the added benefit that anyone who tries to run an application there interactively soon finds that it will not run in small RAM, and in fact

Re: [slurm-users] SLURM in Virtual Machine

2019-09-12 Thread Tina Friedrich
Hi Jose, I run my slumrctld (and the database) in a VM. Some of my test/development nodes are VMs, as well. Actual worker nodes are hardware, for performance reasons :) Is it the SLURM controller that you're planning to run as a VM, or the whole cluster? Tina On 12/09/2019 15:23, Jose A wrot

Re: [slurm-users] SLURM in Virtual Machine

2019-09-12 Thread von St. Vieth, Benedikt
Dear José, what exactly do you want to put into a VM? Computes? Slurmctld/Slurmdbd? We are running slurmctld/slurmdbd in a VM for some time now, but only for a small Cluster (14 nodes). The computes are still bare metal. Until now we do not see issues, also live-migration of the VM to another h

[slurm-users] SLURM in Virtual Machine

2019-09-12 Thread Jose A
Dear all, In the expansion of our Cluster we are considering to install SLURM within a virtual machine in order to simplify updates and reconfigurations. Does any of your have experience running SLURM in VMs? I would really appreciate if you could share your ideas and experiences. Thanks a lo

[slurm-users] oddity with users showing in sacctmgr and sreport

2019-09-12 Thread David Rhey
Hello, First issue: I have a couple dozen users that show up for an account but outside of the hierarchical structure in sacctmgr: sacctmgr show assoc account= format=Account,User,Cluster,ParentName%20 Tree When I execute that on a given account, I see that one user resides outside account where

Re: [slurm-users] Usage splitting

2019-09-12 Thread Stefan Staeglich
Hi Chris, I'm not sure how this works. I'm not very experienced in QoS objects. Have I to create two QoS objects a and b with UsageThreshold=0.1,Flags= EnforceUsageThreshold / UsageThreshold=0.9? And I need two different accounts A and B like Daniel suggested? Or can I use a single account? Al

[slurm-users] Jobs stop after 1:05:11 with segmentation faul.

2019-09-12 Thread Zacarias Benta
Greetings everyone, I have an issue with jobs I'm submiting, I have no idea how to solve it. I submit the following script using sbatch: #!/bin/bash #SBATCH -t 1-00:00:00 #SBATCH --mem 8000 #SBATCH -n 512 #SBATCH -p all # partition #SBATCH -J 1day8GB # job name ulimit -s unlimited module pur