A colleague found this, and it resolved the issue for me.
https://bugs.schedmd.com/show_bug.cgi?id=14134
The /etc/hosts on the compute nodes did not have this extra line, but
the file on the login/slurmctld node did have it.
I removed the line and now e.g. srun -x11 -N 1 xclock works.
Allan
Yes, 'salloc --x11' followed by 'ssh -X' to the allocated node works.
'hostname' command gives me the short hostname. However /etc/localhost
contains 'localhost' not the short hostname. I will experiement with
that.
These nodes are all running Ubuntu 20.04 LTS by the way.
Allan
Tina Friedrich
Many thanks! I added " AccountingStorageEnforce=limits" in slurm.conf. See
below that configuration:
# slurm.conf 3
ClusterName=localcluster
SlurmctldHost=gag
MpiDefault=none
#ProctrackType=proctrack/linuxproc
ProctrackType=proctrack/cgroup
ReturnToServ
Dear Sushil: please share the slurm.conf, if possible.
Thanks & Regards,
Sudeep Narayan Banerjee
System Analyst | Scientist B
Supercomputing Facility & Information System and Technology Facility
Academic Block 5, Room 110A
Indian Institute of Technology Gandhinagar [https://iitgn.ac.in/]
Palaj, Gu
did you enter the information slurm needs for the database in
slurmdbd,conf and slurm.conf ?
Mit freundlichen Grüßen / kind regards
--
Jörg Striewski
Information Systems and Machine Learning Lab (ISMLL)
Institute of Computer Science
University of Hildesheim Germany
post address: Universitätsp
Dear all,
I am pretty new to system administration and looking for some help
setup slumdb or maridb in a GPU cluster. We bought a machine but the vendor
simply installed slurm and did not install any database for accounting. I
tried installing MariaDB and then slurmdb as described in the manual bu
I remember having problems in that it worked via SSH - did you check
that (i.e. if you 'ssh -X' to a node it works?) - but not via SLURM.
That seemed to be authorization, and the way the SLURM inbuild method
generated the magic cookies - it couldn't cope with the node hostname
being it's FQDN,