Re: [gridengine users] Requesting GPUs on the qsub command line

Mark Dixon Thu, 16 Mar 2017 02:45:39 -0700

On Tue, 14 Feb 2017, William Hay wrote:
...

Our prolog does a parallel ssh(passing through appropriate envvars) intoevery node assigned to the job and does the equivalent of a run-parts ona directory filled with scripts. Some of these scripts check if theyare running on the head node.


(been meaning to reply to this bit for a while, sorry)

For comparison purposes, we achieved a similar result with an extensiblestarter_method combined with a client-side JSV.

The core starter_method is written in bash and is very basic, but almosteverything can be overridden or supplemented (including the bit thatactually starts the job script). It does this by reading an environmentvariable containing a list of shell fragments to source.

This way the job controls what is executed at launch, both on the MASTERand the SLAVEs, meaning we can easily develop on a production systemsimply by submitting a job that swaps in a new client-side JSV (-clear-jsv ...) - or setting an environment variable with a qsub flag.


This model has worked very well for years :)

The only thing that's broken it so far is this business of managing tmpdirspace. I'm going to have to do something like your method in the epilog ifI want to provide an option to copy the SLAVE tmpdir's to permanentstorage at the end of the job. Annoyingly, I'd also have to stop relyingupon the execd to manage the tmpdir creation/deletion as otherwise they'retoo ephemeral.

...

With the magic option programs permissions are left alone and
jobs only access the gpu we intend for them.  Given that this
is an option to a kernel module I assume that it is responsible
for the reset of permissions.

...

Although the magic kernel module option prevents it from happening, thestrace output I looked at implies that it's actually the user process thatactually resets the permissions. I want to be wrong.


Mark
_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] Requesting GPUs on the qsub command line

Reply via email to