[slurm-users] sacctmgr show runawayjobs fails with slurmdbd crash

2023-12-21 Thread Julien Rey
sure if this is related but I tried to increase innodb_buffer_pool_size to 32G in mysql conf, without success. Any help would be greatly appreciated. -- Julien Rey Plate-forme RPBS Unité BFA - CMPLI Université de Paris tel: 01 57 27 83 95

Re: [slurm-users] auth_munge.so: Incompatible Slurm plugin version (21.08.8)

2023-10-06 Thread Julien Rey
7f2371d7d000) I just had to recompile drmaa to make it work again. Somehow the error message I was getting was misleading. The fix was very simple actually. Thanks for your help. Best. J. Le 05/10/2023 à 10:15, Rémi Palancher a écrit : Hello Julien, Le mercredi 4 octobre 2023 à 19:04,

[slurm-users] auth_munge.so: Incompatible Slurm plugin version (21.08.8)

2023-10-04 Thread Julien Rey
l7.x86_64 munge-0.5.11-3.el7.x86_64 munge-libs-0.5.11-3.el7.x86_64 Here is the commands I used to compile slurm so I think the munge plugin was correctly built: ./configure --sysconfdir=/etc/slurm --enable-pam make -j $(nproc) make install ldconfig I don't know if this is a slurm or a drm

Re: [slurm-users] slurmctld up and running but not really working

2022-07-20 Thread Julien Rey
Actually, I was able to fix the problem by starting slurmctld with the -c option and then clear the runaway jobs with sacctmgr. Thanks for your help. J. Le 20/07/2022 à 17:06, Julien Rey a écrit : Hello, Unfortunately, the sacctmgr show runawayjobs is returning the following error

Re: [slurm-users] slurmctld up and running but not really working

2022-07-20 Thread Julien Rey
rrent jobs that have been orphaned on the local cluster and are now runaway: sacctmgr show runawayjobs Read the sacctmgr manual page. I hope this helps. /Ole On 7/20/22 14:19, Julien Rey wrote: I don't mind losing jobs information but I certainly don't want to clear the slurm data

Re: [slurm-users] slurmctld up and running but not really working

2022-07-20 Thread Julien Rey
useful notes in my Wiki page https://wiki.fysik.dtu.dk/niflheim/Slurm_installation#upgrading-slurm /Ole On 19-07-2022 16:28, Julien Rey wrote: I am currently facing an issue with an old install of slurm (17.02.11). However, I cannot upgrade this version because I had troubles with database mi

[slurm-users] slurmctld up and running but not really working

2022-07-19 Thread Julien Rey
:01.694] debug2: No need to roll cluster cluster this month 1656626400 <= 1656626400 [2022-07-19T16:00:01.711] debug2: Got 2 of 2 rolled up [2022-07-19T16:00:01.711] debug2: Everything rolled up -- Julien Rey Plate-forme RPBS Unité BFA - CMPLI Université de Paris tel: 01 57 27 83 95

Re: [slurm-users] slurmdbd purge not working

2019-04-08 Thread Julien Rey
tabase-purge-parameters). If you are upgrading your Slurm version (or planning to do it), I also recommend you to read the thread [slurm-users] "Extreme long db upgrade 16.05.6 -> 17.11.3" from the last few days. Best regards, Ole On 4/5/19 4:28 PM, Julien Rey wrote: The failure

Re: [slurm-users] slurmdbd purge not working

2019-04-05 Thread Julien Rey
On 4/5/19 9:05 AM, Julien Rey wrote: Hi Paul, thanks for your advice. Actually I already tried what you suggested. No matter what value do I put after PurgeJobAfter I always end up with the same error: sacctmgr archive dump Directory=/home/joule/archives/ PurgeJobAfter=1days sacctmgr: error

Re: [slurm-users] slurmdbd purge not working

2019-04-05 Thread Julien Rey
of our problems but I don't recall what version they ended up in, likely newer than 15.08.0. A solution that can work is to walk up the time so that instead of one large purge you do several smaller purges. That at least worked for us in the past. -Paul Edmon- On 4/4/19 9:38 AM, Julie

[slurm-users] slurmdbd purge not working

2019-04-04 Thread Julien Rey
would be the procedure for deleting the database records altogether and starting on a fresh new one ? Thanks in advance. -- Julien REY Plate-forme RPBS Modélisation Computationnelle des Interactions Protéines-Ligand (CMPLI) Université Paris Diderot - Paris VII tel : 01 57 27 83 95