Re: [slurm-users] Compute node process monitoring tools updated

2021-01-19 Thread Ryan Novosielski
Thanks, that’s great! I do a lot of that by hand (including lots over this weekend), so it will be a nice timesaver. -- #BlackLivesMatter || \\UTGERS, |---*O*--- ||_// the State | Ryan Novosielski - novos...@rutgers.edu

Re: [slurm-users] Compute node process monitoring tools updated

2021-01-19 Thread Alan Orth
Thank you for that, Ole! I will give them a spin on our cluster and send any feedback to GitHub. Cheers, On Mon, Jan 18, 2021 at 4:12 PM Ole Holm Nielsen wrote: > FYI: My Slurm tools for displaying batch job user process information have > been updated. Besides the user process list from "ps",

[slurm-users] Compute node process monitoring tools updated

2021-01-18 Thread Ole Holm Nielsen
FYI: My Slurm tools for displaying batch job user process information have been updated. Besides the user process list from "ps", a summary of the number of processes and threads is now printed as well. We use this for monitoring the sanity of user jobs. For example, we often see jobs that r