Hi Slurmoids,

A couple of days ago whilst teaching some Slurm commands I had a bolt of
inspiration. I'm not entirely sure of the process of suggested changes
to the code base, so I thought I'd elaborate on the concept before going
down that path.

The idea is for a new dependency, 'during' to add to the set of before,
beforeok, beforenotok, after, afterok, afternotok, and singleton. A job
with the 'during' dependency will only start if a specified jobID is
also running, and would go into a PD (pending) state otherwise and would
be submitted in the following manner:

`sbatch --dependency=during:jobid:`

Like other dependencies multiple job dependencies could be specified.

A purpose of such a dependency, apart from a sense of aesthetic
completeness, could be for jobs that are providing or receiving
information from other running jobs, such as collecting real-time
real-time metrics of other running jobs.

This will obviously need changes to the slurm_comptetion.sh source,
pmi2_api.c and .h, backfill.c, job_scheduler.c, associated binaries, and
certainly the manual pages from scontrol, srun, salloc, sbatch, along
with the mpi and pbs plugins, along with examples in the testsuite.

What do people think? Is this a worthwhile project to explore or have I
missed something which is why it hasn't *already* been done?

All the best,
 
-- 
Lev Lafayette, BA (Hons), GradCertTerAdEd (Murdoch), GradCertPM, MBA
(Tech Mngmnt) (Chifley)
HPC Support and Training Officer +61383444193 +61432255208
Department of Infrastructure Services, University of Melbourne

Reply via email to