Re: slurm for cluster job scheduling and coordination

2020-03-10 Thread John Sanda
> > I've been working towards organizing an effort around using Kubernetes for > cluster management. There is a lot of work to do but this could be > something really important to tackle as a community if you(or anyone else) > are interested in getting involved. > This is a big area of interest fo

Re: slurm for cluster job scheduling and coordination

2020-03-10 Thread Patrick McFadin
Carl, Slurm might be a nice way to keep things you already have built in some sort of control plane. Things already built meaning terraform, ansible, salt, chef, wrote: > Between repairs, rolling restarts, scheduled maintenance bounces, backups, > upgrades, etc there are lots of cluster-wide tas

slurm for cluster job scheduling and coordination

2020-03-09 Thread Carl Mueller
Between repairs, rolling restarts, scheduled maintenance bounces, backups, upgrades, etc there are lots of cluster-wide tasks that would be nice to be scheduled and viewed. Slurm appears to have some features that support this but might be heavyweight considering its primary application is superco