Dear all,

I'm wondering if it's possible to implement a scheduling policy in a SGE
cluster that optimise the execution of jobs running Docker containers, in
such a way that jobs are allocated in the nodes where a docker image,
specified by the user, has already been downloaded.

To clarify the usage scenario, take in consideration this example:

* the docker daemon is installed in each cluster node;
* the user can submit one (or more jobs) that execute a *docker run*
command that requires a docker image X
* when a job starts, docker pull automatically the image X if it has not
yet been downloaded by a previously executed job.

Since a Docker image requires some time (and bandwidth) to be downloaded I
would like the scheduler tries to allocate that job where the image X has
already been downloaded.

If a node with that image does not exist, a low priority is associated to
that job, so that it has a chance to be executed at some time on any node.

Ideally, when a node which the image X become available, the job priority
is updated so that it can be executed as soon as possible.


Is is possible to implement this scheduling strategy with SGE ?


Thanks,
Paolo
_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to