sure if this is related but I tried to increase
innodb_buffer_pool_size to 32G in mysql conf, without success.
Any help would be greatly appreciated.
--
Julien Rey
Plate-forme RPBS
Unité BFA - CMPLI
Université de Paris
tel: 01 57 27 83 95
7f2371d7d000)
I just had to recompile drmaa to make it work again.
Somehow the error message I was getting was misleading. The fix was very
simple actually.
Thanks for your help.
Best.
J.
Le 05/10/2023 à 10:15, Rémi Palancher a écrit :
Hello Julien,
Le mercredi 4 octobre 2023 à 19:04,
l7.x86_64
munge-0.5.11-3.el7.x86_64
munge-libs-0.5.11-3.el7.x86_64
Here is the commands I used to compile slurm so I think the munge plugin
was correctly built:
./configure --sysconfdir=/etc/slurm --enable-pam
make -j $(nproc)
make install
ldconfig
I don't know if this is a slurm or a drm
Actually, I was able to fix the problem by starting slurmctld with the
-c option and then clear the runaway jobs with sacctmgr.
Thanks for your help.
J.
Le 20/07/2022 à 17:06, Julien Rey a écrit :
Hello,
Unfortunately, the sacctmgr show runawayjobs is returning the
following error
rrent jobs that have been orphaned on the
local cluster and are now runaway:
sacctmgr show runawayjobs
Read the sacctmgr manual page.
I hope this helps.
/Ole
On 7/20/22 14:19, Julien Rey wrote:
I don't mind losing jobs information but I certainly don't want to
clear the slurm data
useful
notes in my Wiki page
https://wiki.fysik.dtu.dk/niflheim/Slurm_installation#upgrading-slurm
/Ole
On 19-07-2022 16:28, Julien Rey wrote:
I am currently facing an issue with an old install of slurm
(17.02.11). However, I cannot upgrade this version because I had
troubles with database mi
:01.694] debug2: No need to roll cluster cluster this
month 1656626400 <= 1656626400
[2022-07-19T16:00:01.711] debug2: Got 2 of 2 rolled up
[2022-07-19T16:00:01.711] debug2: Everything rolled up
--
Julien Rey
Plate-forme RPBS
Unité BFA - CMPLI
Université de Paris
tel: 01 57 27 83 95
tabase-purge-parameters).
If you are upgrading your Slurm version (or planning to do it), I also
recommend you to read the thread [slurm-users] "Extreme long db
upgrade 16.05.6 -> 17.11.3" from the last few days.
Best regards,
Ole
On 4/5/19 4:28 PM, Julien Rey wrote:
The failure
On 4/5/19 9:05 AM, Julien Rey wrote:
Hi Paul, thanks for your advice. Actually I already tried what you
suggested. No matter what value do I put after PurgeJobAfter I always
end up with the same error:
sacctmgr archive dump Directory=/home/joule/archives/
PurgeJobAfter=1days
sacctmgr: error
of our problems but I don't
recall what version they ended up in, likely newer than 15.08.0.
A solution that can work is to walk up the time so that instead of one
large purge you do several smaller purges. That at least worked for
us in the past.
-Paul Edmon-
On 4/4/19 9:38 AM, Julie
would be the
procedure for deleting the database records altogether and starting on a
fresh new one ?
Thanks in advance.
--
Julien REY
Plate-forme RPBS
Modélisation Computationnelle des Interactions Protéines-Ligand (CMPLI)
Université Paris Diderot - Paris VII
tel : 01 57 27 83 95
11 matches
Mail list logo