[slurm-users] Re: Annoying canonical question about converting SLURM_JOB_NODELIST to a host list for mpirun

2024-08-12 Thread Jeffrey Layton via slurm-users
de.html >> >> The canonical way to do it if you need to would be the scontrol show >> hostnames command against the $SLURM_JOB_NODELIST ( >> https://slurm.schedmd.com/scontrol.html#OPT_hostnames). That will give >> you the list of hosts your job is set to run on. >>

[slurm-users] Re: Annoying canonical question about converting SLURM_JOB_NODELIST to a host list for mpirun

2024-08-12 Thread Jeffrey Layton via slurm-users
m/scontrol.html#OPT_hostnames). That will give > you the list of hosts your job is set to run on. > > -Paul Edmon- > On 8/12/2024 8:34 AM, Jeffrey Layton via slurm-users wrote: > > Thanks! I admit I'm not that experienced in Bash. I will give this a whirl > as a test. > >

[slurm-users] Re: Annoying canonical question about converting SLURM_JOB_NODELIST to a host list for mpirun

2024-08-12 Thread Jeffrey Layton via slurm-users
Thanks! I admit I'm not that experienced in Bash. I will give this a whirl as a test. In the meantime, let ask, what is the "canonical" way to create the host list? It would be nice to have this in the Slurm FAQ somewhere. Thanks! Jeff On Fri, Aug 9, 2024 at 1:32 PM Hermann Schwärzler via slu

[slurm-users] Annoying canonical question about converting SLURM_JOB_NODELIST to a host list for mpirun

2024-08-09 Thread Jeffrey Layton via slurm-users
Good afternoon, I know this question has been asked a million times, but what is the canonical way to convert the list of nodes for a job that is container in a Slurm variable, I use SLURM_JOB_NODELIST, to a host list appropriate for mpirun in OpenMPI (perhaps MPICH as well)? Before anyone says,

[slurm-users] Re: With slurm, how to allocate a whole node for a single multi-threaded process?

2024-08-02 Thread Jeffrey Layton via slurm-users
I think all of the replies point to --exclusive being your best solution (only solution?). You need to know exactly the maximum number of cores a particular application or applications will use. Then you allow other applications to use the unused cores. Otherwise, at some point when the applicatio

[slurm-users] Re: Location of Slurm source packages?

2024-05-15 Thread Jeffrey Layton via slurm-users
; Hi Jeff! > > On 5/15/24 10:35 am, Jeffrey Layton via slurm-users wrote: > > > I have an Ubuntu 22.04 server where I installed Slurm from the Ubuntu > > packages. I now want to install pyxis but it says I need the Slurm > > sources. In Ubuntu 22.04, is there a package

[slurm-users] Re: Location of Slurm source packages?

2024-05-15 Thread Jeffrey Layton via slurm-users
> Lloyd > > -- > Lloyd Brown > HPC Systems Administrator > Office of Research Computing > Brigham Young Universityhttp://rc.byu.edu > > On 5/15/24 08:35, Jeffrey Layton via slurm-users wrote: > > Good morning, > > I have an Ubuntu 22.04 server where I installed

[slurm-users] Location of Slurm source packages?

2024-05-15 Thread Jeffrey Layton via slurm-users
Good morning, I have an Ubuntu 22.04 server where I installed Slurm from the Ubuntu packages. I now want to install pyxis but it says I need the Slurm sources. In Ubuntu 22.04, is there a package that has the source code? How to download the sources I need from github? Thanks! Jeff -- slurm-us

[slurm-users] Re: Integrating Slurm with WekaIO

2024-04-19 Thread Jeffrey Layton via slurm-users
> about the config. > > Simple solution: put a copy of slurm.conf in /etc/slurm/ on the node(s). > > Brian Andrus > On 4/19/2024 9:56 AM, Jeffrey Layton via slurm-users wrote: > > Good afternoon, > > I'm working on a cluster of NVIDIA DGX A100's that is using B

[slurm-users] Integrating Slurm with WekaIO

2024-04-19 Thread Jeffrey Layton via slurm-users
Good afternoon, I'm working on a cluster of NVIDIA DGX A100's that is using BCM 10 (Base Command Manager which is based on Bright Cluster Manager). I ran into an error and only just learned that Slurm and Weka don't get along (presumably because Weka pins their client threads to cores). I read thr