On 25/10/18 2:29 pm, Christopher Samuel wrote:

Could explain why this isn't something we see consistently, and why we're both seeing it currently.

This seems to be a handy way to find any processes that are not properly constrained by Slurm cgroups on compute nodes (at least in our configuration):

ps --no-headers -eo pid,user,comm,cgroup | egrep -vw 'root|freezer:/slurm.*devices:/slurm.*cpuacct,cpu:/slurm.*memory:/slurm|cpuset:/slurm.*|dbus-daemon|munged|ntpd|gmond|polkitd'


--
 Chris Samuel  :  http://www.csamuel.org/  :  Melbourne, VIC

Reply via email to