The cluster doesn't exist though.  This was what I tried first.

[root@della5 bill]# sacctmgr show RunawayJobs cluster=tukey
sacctmgr: error: Slurmctld running on cluster tukey is not up, can't check running jobs

Bill

On 7/27/21 4:59 PM, Carlos Fenoy wrote:
Hi,

You can cleanup those jobs with sacctmgr.
https://slurm.schedmd.com/sacctmgr.html

sacctmgr show RunawayJobs

This will list the runaway jobs, and if any will ask if you want to fix them.

Regards,
Carlos

On Tue, 27 Jul 2021 at 22:49, Bill Wichser <b...@princeton.edu <mailto:b...@princeton.edu>> wrote:

    [root@della5 bill]# sacctmgr -i delete user mable
       Error with request: Job(s) active, cancel job(s) before remove
        JobID = 602995     C = tukey      A = politics   U = mable

    Yup, when a user has an active job they cannot be deleted from the
    database.  The thing is, this cluster tukey has been offline for
    maybe 5
    years now. Probably more.

    I don't want to lose the old records in the database.  Is there a
    way to
    say,

    Hey, that job you believe is still running on tukey, well it doesn't
    exist anymore so please close this job

    ?

    I haven't figured out a way to do this outside the database so suspect
    that only DB manipulation is the only answer.

    Thanks,
    Bill

--
--
Carles Fenoy

Reply via email to