Hi,

Am 05.01.2017 um 07:54 schrieb Manfred Selz:

> Hi,
>  
> in my SGE 6.2u5 environment, I am seeing a strange issue when submitting jobs 
> to a parallel environment while also providing a hard hostname resource 
> requirement.
> This is not a standard situation, but sometimes certain benchmarks need to be 
> run on one specific host only.
>  
> When submitting a jobs either with a parallel environment or with a hard 
> hostname resource specification, the jobs starts without delay.
> However, the combination of both sometimes keeps jobs waiting for an extended 
> period of time, and I have not been able to get a clear messages from the 
> “qstat -j  <jobID>” report.
>  
> Parallel environment settings is:
> $  qconf -sp local
> pe_name            local
> slots              1000
> user_lists         NONE
> xuser_lists        NONE
> start_proc_args    /bin/true
> stop_proc_args     /bin/true
> allocation_rule    $pe_slots
> control_slaves     FALSE
> job_is_first_task  TRUE
> urgency_slots      min
> accounting_summary TRUE
>  
> The specific host being targeted has 32 slots configured for the queue being 
> used, and all of them are unused at this time.
> Is anybody aware of specific issues with the combination of parallel 
> environments and a hard hostname resource request?
>  
> I have already tested this:
> ·         Removed the parallel environment request - works
> ·         Removed the hostname request - works
> ·         Removed all resource limits (“qconf -mrqs”) - no change
> ·         Increased the “slots” limit in the PE setting - no change
> ·         Changed the PE allocation_rule to “round_robin” - no change 

I only saw problems when requesting a queue and a host at the same time, i.e. 
"-q" & "-l h=" at the same time. The solution may work also in your case: 
request the host by a queue request:

-q "*@node123"


> After all, the final message in the “qstat -j <jobID>” report is always:
> cannot run in PE "local" because it only offers 0 slots

I assume the node is free and you have no backfilling issue where slots are 
reserved.

-- Reuti


> 
> I have seen many older reports for the “only offers 0 slots” message on older 
> pages, but none specifically for the combination with a hostname spec. (only).
>  
> Regards,
> Manfred
>  
>  
> 
> 
> 
> Dialog Semiconductor GmbH
> Neue Str. 95
> D-73230 Kirchheim
> Managing Directors: Dr. Jalal Bagherli, Carsten Dahl
> Chairman of the Supervisory Board: Rich Beyer
> Commercial register: Amtsgericht Stuttgart: HRB 231181
> UST-ID-Nr. DE 811121668
> 
> Legal Disclaimer: This e-mail communication (and any attachment/s) is 
> confidential and contains proprietary information, some or all of which may 
> be legally privileged. It is intended solely for the use of the individual or 
> entity to which it is addressed. Access to this email by anyone else is 
> unauthorized. If you are not the intended recipient, any disclosure, copying, 
> distribution or any action taken or omitted to be taken in reliance on it, is 
> prohibited and may be unlawful.
> 
> 
> Please consider the environment before printing this e-mail
>  
> _______________________________________________
> users mailing list
> users@gridengine.org
> https://gridengine.org/mailman/listinfo/users


_______________________________________________
users mailing list
users@gridengine.org
https://gridengine.org/mailman/listinfo/users

Reply via email to