We're running SGE 6.2u5 and we've got a "short" queue, which assigns a higher priority but imposes run-time and CPU-time limits.
We also have a "short-on-interactive" queue to allow short jobs to run on a subset of the slots on our interactive nodes, with the goal of allowing short, high-priority jobs to use idle resources on the interactive machines. The "short-on-interactive" queue is subordinate to the "interactive" queue, so if the interactive server becomes busy, the batch jobs will be suspended. In general, this works fine. However, if a batch job is suspended for too long, it exceeds the h_rt limit and is killed by SGE. Is there any way to prevent a job from accumulating "run time" in SGE's accounting for the period that it is suspended? If this is not possible now, can this be considered as a future request for enhancement? Thanks, Mark _______________________________________________ users mailing list [email protected] https://gridengine.org/mailman/listinfo/users
