Re: [slurm-users] Priority wait

Roe Zohar Mon, 13 Nov 2017 09:44:23 -0800

Hi guys,
Thanks for the replay.
I will try to add my slurm.conf tomorrow. Sadly, its a bit of a problem
since its on a cluster disconected from the net and with no easy way of
getting it out :(

I will try tomorrow with the hope that any body could catch some bad
parameter.

Thanks,
Roy

On Nov 13, 2017 7:38 PM, "Douglas Jacobsen" <dmjacob...@lbl.gov> wrote:

Assuming you are using backfill, I suspect this is caused by using default
schedulerparameters, specifically the bf maxjobs or other similar limits
that would prevent jobs from being reviewed.  Setting debugflags=backfill
will help greatly in debugging these issues.

There are analogous parameters for the lightweight, non backfill scheduler.

Checking slurm.conf man page under schedulerparameters should be very
informative.

On Nov 13, 2017 10:29, "A" <andrealp...@gmail.com> wrote:

> I'm guessing you should have sent them to cluster Decepticon, instead....
>
> In all seriousness though, provide the conf file. You might have
> accidentally set a maximum number of running jobs somewhere
>
>
> On Nov 13, 2017 7:28 AM, "Benjamin Redling" <benjamin.ra...@uni-jena.de>
> wrote:
>
>> Hi Roy,
>>
>> On 11/13/17 2:37 PM, Roe Zohar wrote:
>> [...]
>>
>>> I sent 3000 jobs with feature Optimus and part are running while part
>>> are pendind. Which is ok.
>>> But I have sent 1000 jobs to Megatron and they are all in pending
>>> stating they wait because of priority. Whay os that?
>>>
>>> B.t.w if I change their priority to a higher one, they start to run on
>>> Megatron.
>>>
>>
>> my guess: is if you can provide the slurm.conf of that cluster, the
>> probability anyone will sacrifice his spare-time for you will increase
>> significantly.
>>
>> Regards,
>> Benjamin
>> --
>> FSU Jena | JULIELab.de/Staff/Benjamin+Redling.html
>> ☎ +49 3641 9 44323
>>
>>

Re: [slurm-users] Priority wait

Reply via email to