You need to find the node which the job started on.
Then look at the slurmd log on that node. You may find an indication of the
reason for the failure.
On Tue, 7 Jan 2025 at 11:30, sportlecon sportlecon via slurm-users <
slurm-users@lists.schedmd.com> wrote:
> slurm 24.11 - squeue displays reaso
As cited on the issue, between Slurm 23.02 and 23.11 the defined type
`hostlist_t` was changed by the developers. Prior to 23.11 it was an opaque
pointer to a hostlist_t data structure; in 23.11 it is the data structure
itself. This means that all hostlist API functions changed, as well, to
d
Hi Jeffrey,
We run Slurm 24.05.5. I now used the RockyLinux 8.10 versions of gcc
and cmake according to the instructions in README.md, but the build is
still failing.
/Ole
On 07-01-2025 20:14, Jeffrey Frey wrote:
Ole,
I added a brief "how to build" to the README.md, hope that helps. I a
Ole,
I added a brief "how to build" to the README.md, hope that helps. I also
commented on your issue on Github — your environment doesn't seem to have the
slurm.h header that's expected. What version of Slurm are you running?
-Jeff
> On Jan 7, 2025, at 13:26, Ole Holm Nielsen via slurm-u
Hi Jeffrey,
Thanks a lot, I'd like to try out snodelist. Not knowing much about
CMake, I couldn't build the tool :-( I've opened an issue on GitHub:
https://github.com/University-of-Delaware-IT-RCI/snodelist/issues/1
Can you help me out?
On 07-01-2025 16:27, Jeffrey Frey via slurm-users wrot
Davide, the 'nodeset' command can be used here
nodeset -e -S '\n' node[03-04,12-22,27-32,36]
On Mon, 6 Jan 2025 at 19:58, Davide DelVento via slurm-users <
slurm-users@lists.schedmd.com> wrote:
> Hi all,
> I remember seeing on this list a slurm command to change a slurm-friendly
> list suc
We use a tool that's compiled against the Slurm library itself so that the
expansion/contraction of lists is always 100% in sync with Slurm itself:
https://github.com/jtfrey/snodelist
> On Jan 7, 2025, at 10:12, Davide DelVento via slurm-users
> wrote:
>
> Wonderful. Thanks Ole for the re
Wonderful. Thanks Ole for the reminder! I had bookmarked your wiki (of
course!) but forgot to check it out in this case. I'll add a more prominent
reminder to self in my notes to always check it!
Happy new year everybody once again
On Tue, Jan 7, 2025 at 1:58 AM Ole Holm Nielsen via slurm-users <
It seems that the variable DOES in fact impact where slurmdbd.conf is
looked for.
I discarded that option because it was only mentioned in the slurmctld
section.
Thanks a ton!
Sven
On 07.01.25 15:11, Groner, Rob wrote:
I'll note that the SLURM_CONF env var is listed on the sbatch docs
pag
I'll note that the SLURM_CONF env var is listed on the sbatch docs page, so is
likely not an override for all slurm commands.
From: Groner, Rob via slurm-users
Sent: Tuesday, January 7, 2025 9:04 AM
To: slurm-users@lists.schedmd.com ; Sven Schulze
Subject: [slu
On 1/7/25 14:54, Sven Schulze via slurm-users wrote:
Is there a way to change the location of the slurmdbd.conf temporarily
when starting slurmdbd?
For all other daemons I can specify "-f", but this doesn't seem to work
for slurmdbd, is there a way to edit the build files to achieve this?
Th
The config file location is set during the ./configure step in building the
source code. I think it is --conf-dir or something. Do ./configure --help to
find the correct syntax. After configure, then rebuild and slurm will now look
in that new location.
There is also a SLURM_CONF env var tha
Hey all,
Is there a way to change the location of the slurmdbd.conf temporarily
when starting slurmdbd?
For all other daemons I can specify "-f", but this doesn't seem to work
for slurmdbd, is there a way to edit the build files to achieve this?
Kind Regards,
Sven
--
slurm-users mailing l
slurm 24.11 - squeue displays reason "launch failed requeued held"
--
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe send an email to slurm-users-le...@lists.schedmd.com
Hi,
My 2 cents: I have collected various Slurm hostlist commands in this Wiki
page:
https://wiki.fysik.dtu.dk/Niflheim_system/Slurm_operations/#expanding-and-collapsing-host-lists
Best regards,
Ole
On 1/7/25 09:25, Steffen Grunewald via slurm-users wrote:
On Mon, 2025-01-06 at 12:55:12 -070
On Sat, 2024-12-28 at 22:59:45 -, Slurm users wrote:
> ls -ls /usr/local/slurm/etc/slurmdbd.conf
> 4 -rw--- 1 slurm slurm 497 Dec 28 16:34 /usr/local/slurm/etc/slurmdbd.conf
>
> sudo -u slurm /usr/local/slurm/sbin/slurmdbd -Dvvv
>
> slurmdbd: error: s_p_parse_file: unable to read
> "/
On Sat, 2025-01-04 at 08:11:21 -, Slurm users wrote:
> JOBID PARTITION NAME USER ST TIME NODES
> NODELIST(REASON)
> 26 cpu myscriptuser1 PD 0:00 4
> (Nodes required for job are DOWN, DRAINED or reserved for jobs in higher
>
On Mon, 2025-01-06 at 12:55:12 -0700, Slurm users wrote:
> Hi all,
> I remember seeing on this list a slurm command to change a slurm-friendly
> list such as
>
> gpu[01-02],node[03-04,12-22,27-32,36]
>
> into a bash friendly list such as
>
> gpu01
> gpu02
> node03
> node04
> node12
> etc
I alwa
18 matches
Mail list logo