[gridengine users] Temporarily stop new job submissions?

2018-01-23 Thread Chester Langin
Is there a way to temporarily reject new job submissions? We are going to be doing scheduled maintenance on the cluster internal network and we want to prevent new jobs from being accepted starting a couple of days prior to this scheduled maintenance. After the maintenance, we want to return t

[gridengine users] Is there a way to let big jobs through?

2017-06-05 Thread Chester Langin
Hello, Large jobs seem to be stuck in "qw" indefinitely on our HPC cluster while a steady input of small jobs take up the slots. Is there an easy way to make the small jobs wait in "qw" longer so that the big jobs can proceed to "r" in a timely manner? Our nodes have 20 cores each. One use

[gridengine users] How to clear node (E)rror status?

2017-05-01 Thread Chester Langin
All, How do I clear a node (E)rror status? One of our users ran a job over the weekend that put 34 of our nodes in (E)rror status, effectively shutting down our cluster. The error status looks like this: $ qstat -f standard.q@compute-0-8.local BIP 0/2/20 2.00 linux-x64 E

Re: [gridengine users] Implementing new queues

2016-08-17 Thread Chester Langin
: Bill Bryce Sent: Tuesday, August 16, 2016 4:26:51 PM To: Chester Langin Cc: users@gridengine.org Subject: Re: [gridengine users] Implementing new queues You can create a sge request file that sets defaults for particular users or you can also have a Job Submission Verifier and it will enforce

[gridengine users] Implementing new queues

2016-08-16 Thread Chester Langin
Hello, We implemented a new 40-unit HPCC October, 2015, and have been using the SGE scheduler with the default configuration. The faculty have been submitting jobs to the scheduler without a specifying a queue, and the queue defaults to all.q. I would like to add more queues and force the fa