Is there a way to temporarily reject new job submissions? We are going to be
doing scheduled maintenance on the cluster internal network and we want to
prevent new jobs from being accepted starting a couple of days prior to this
scheduled maintenance. After the maintenance, we want to return t
Hello,
Large jobs seem to be stuck in "qw" indefinitely on our HPC cluster while a
steady input of small jobs take up the slots. Is there an easy way to make the
small jobs wait in "qw" longer so that the big jobs can proceed to "r" in a
timely manner?
Our nodes have 20 cores each. One use
All,
How do I clear a node (E)rror status? One of our users ran a job over the
weekend that put 34 of our nodes in (E)rror status, effectively shutting down
our cluster. The error status looks like this:
$ qstat -f
standard.q@compute-0-8.local BIP 0/2/20 2.00 linux-x64 E
: Bill Bryce
Sent: Tuesday, August 16, 2016 4:26:51 PM
To: Chester Langin
Cc: users@gridengine.org
Subject: Re: [gridengine users] Implementing new queues
You can create a sge request file that sets defaults for particular users or
you can also have a Job Submission Verifier and it will enforce
Hello,
We implemented a new 40-unit HPCC October, 2015, and have been using the SGE
scheduler with the default configuration. The faculty have been submitting
jobs to the scheduler without a specifying a queue, and the queue defaults to
all.q. I would like to add more queues and force the fa