Hi, > Am 19.04.2017 um 09:00 schrieb John_Tai <john_...@smics.com>: > > I am trying to submit a job to a specific host in the queue: > > # qrsh -verbose -q gui.q@ibm056 > Your job 5049542 ("QRLOGIN") has been submitted > waiting for interactive job to be scheduled ... > > > However it is in waiting state: > > # qstat -u johnt > job-ID prior name user state submit/start at queue > slots ja-task-ID > ----------------------------------------------------------------------------------------------------------------- > 5049542 0.55500 QRLOGIN johnt qw 04/19/2017 14:51:19 > 1
The queue is also defined as being "qtype INTERACTIVE"? > # qstat -j 5049542 |grep gui.q > hard_queue_list: gui.q@ibm056 > queue instance "gui.q@dsbm05" dropped because it > is full > > Here is the current status of the queue: > > # qstat -f |grep gui.q > gui.q@dsbm04 BIP 0/5/45 8.87 lx24-amd64 > gui.q@dsbm05 BIP 0/55/55 7.75 lx24-amd64 And only a load of 7.75? > gui.q@ibm056 BIP 0/11/30 3.15 lx24-amd64 Are there any consumable resource requests? I.e. is the memory perhaps fully used up by the already running jobs (being it h_vmem, virtual-free or any other consumable)? > gui.q@ibm057 BIP 0/11/30 1.34 lx24-amd64 > gui.q@ibm058 BIP 0/11/45 3.47 lx24-amd64 > > > The same goes for ibm057 and ibm058. It seems that dsbm05 being full blocks > all following servers in the queue list. In fact I can submit to dsbm04, > which precedes dsbm05. > > I recently upgraded from sge6.1 to sge6.2u6, though I can’t be sure that’s > the only thing that’s changed. How do I even begin to debug this? Did you upgrade all nodes? -- Reuti
signature.asc
Description: Message signed with OpenPGP using GPGMail
_______________________________________________ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users