The cluster doesn't exist though. This was what I tried first.
[root@della5 bill]# sacctmgr show RunawayJobs cluster=tukey
sacctmgr: error: Slurmctld running on cluster tukey is not up, can't
check running jobs
Bill
On 7/27/21 4:59 PM, Carlos Fenoy wrote:
Hi,
You can cleanup those jobs with sacctmgr.
https://slurm.schedmd.com/sacctmgr.html
sacctmgr show RunawayJobs
This will list the runaway jobs, and if any will ask if you want to fix
them.
Regards,
Carlos
On Tue, 27 Jul 2021 at 22:49, Bill Wichser <b...@princeton.edu
<mailto:b...@princeton.edu>> wrote:
[root@della5 bill]# sacctmgr -i delete user mable
Error with request: Job(s) active, cancel job(s) before remove
JobID = 602995 C = tukey A = politics U = mable
Yup, when a user has an active job they cannot be deleted from the
database. The thing is, this cluster tukey has been offline for
maybe 5
years now. Probably more.
I don't want to lose the old records in the database. Is there a
way to
say,
Hey, that job you believe is still running on tukey, well it doesn't
exist anymore so please close this job
?
I haven't figured out a way to do this outside the database so suspect
that only DB manipulation is the only answer.
Thanks,
Bill
--
--
Carles Fenoy