So I was going to take a stab at trying to rectify this after taking care of 
post-holiday matters.

Paste of the $CLUSTER_job_table table where I think I see the issue, and now I 
just want to sanity check my steps to remediate.
https://rentry.co/qhw6mg <https://rentry.co/qhw6mg> (pastebin alternative 
because markdown is paywalled for pastebin).

There are a number of job steps with a timelimit of 4294967295, where as the 
others of the same job array are 525600.
Obviously I want to edit those time limits to sane limits (match them to the 
others).
I don’t see anything in the $CLUSTER_step_table that looks like it would need 
to be modified to match, though I could be wrong.

But then the part of getting slurm to pick it up is where I’m wanting to make 
sure I’m on the right page.
Should I manually update the mod_time timestamp and slurm will catch that at 
its next rollup?
Or will slurm catch the change in the time limit at update the mod_time when it 
sees it upon rollup?

I also don’t see any documentation stating how to manually trigger a rollup, 
either via slurmdbd.conf or command line flag.
Will it automagically perform a rollup at some predefined, non-configurable 
interval, or when restarting the daemon?

Apologies if this is all trivial information, just trying to measure twice and 
cut once.

Appreciate everyone’s help so far.

Thanks,
Reed

> On Dec 23, 2022, at 7:18 PM, Chris Samuel <ch...@csamuel.org> wrote:
> 
> On 20/12/22 6:01 pm, Brian Andrus wrote:
> 
>> You may want to dump the database, find what table/records need updated and 
>> try updating them. If anything went south, you could restore from the dump.
> 
> +lots to making sure you've got good backups first, and stop slurmdbd before 
> you start on the backups and don't restart it until you've made the changes, 
> including setting the rollup times to be before the jobs started to make sure 
> that the rollups include these changes!
> 
> When you start slurmdbd after making the changes it should see that it needs 
> to do rollups and kick those off.
> 
> All the best,
> Chris
> -- 
> Chris Samuel  :  http://www.csamuel.org/  :  Berkeley, CA, USA

Reply via email to