Hi Ken

I am in the same boat as you are meaning that I am also new to SLURM.
This is what I've done from good recommendation.

Install Ubuntu 18.04 on your servers which just got released last week.
Apparently the ubuntu 16.04 package of SLURM is outdated.
Install slurm-llnl on headnode/master
Install slurmd on execute node

The packages will install all libraries and dependencies that you need in the 
right place.

I once tried to install with the tar ball file and had lots of issues/error 
messages.

I hope this will help you to get going.

Eric

._____________________________________________________________________________________________________

Eric F.  Alemany
System Administrator for Research

Division of Radiation & Cancer  Biology
Department of Radiation Oncology

Stanford University School of Medicine
Stanford, California 94305

Tel:1-650-498-7969<tel:1-650-498-7969>  No Texting
Fax:1-650-723-7382<tel:1-650-723-7382>

On May 5, 2018, at 11:46, Kenneth Russell 
<linux-...@comcast.net<mailto:linux-...@comcast.net>> wrote:

I am a new slurm user and am trying to set up a single node test system. I have 
spent endless hours trying to get slurm services to start. I am running Ubuntu 
Server V16.04 and slurm 17.11.5. My MB has an AMD 8 core processor. When I try 
to start slurmdbd or slurmctld services I get messages saying can't access 
shared libraries or pid files missing. At times, I noticed that the pid files 
in /var/run have been deleted. I have made copies of the pid files and copy 
them back to /var/run when they are missing.

I have found that if I reinstall slurm from the tarball, the services will 
start. To speed things up, I have created a bash script to reinstall slurm, 
starting with the tarball extraction step. This is a very inefficient 
work-around.

Can anyone help me solve the problem of why slurm runs only once and then fails 
on subsequent starts?

I can send copies of conf and log files if requested.

Thanks, in advance.

Ken Russell

Reply via email to