Never mind. One of my users submitted a job with 139k subjobs.

A few other questions:

1) Is it possible to stop a job submission if I know it’s going to make the 
qmaster croak?
2) Will switching to the berkeley DB setup in the qmaster alleviate this?
3) Can that be done and still retain the existing data in /opt/sge?

Mfg,
Juan Jimenez
System Administrator, BIH HPC Cluster
MDC Berlin / IT-Dept.
Tel.: +49 30 9406 2800
 

On 27.06.17, 10:04, "SGE-discuss on behalf of 
juanesteban.jime...@mdc-berlin.de" <sge-discuss-boun...@liverpool.ac.uk on 
behalf of juanesteban.jime...@mdc-berlin.de> wrote:

    I’ve got a problem with my qmaster. It is running but is unresponsive to 
commands like qstat. The process status is mostly D for disk sleep, and when I 
run it in non-daemon debug mode it spends a LOT of time reading the 
Master_Job_List.
    
    Any clues?
    
    Mfg,
    Juan Jimenez
    System Administrator, BIH HPC Cluster
    MDC Berlin / IT-Dept.
    Tel.: +49 30 9406 2800
    
    _______________________________________________
    SGE-discuss mailing list
    SGE-discuss@liv.ac.uk
    https://arc.liv.ac.uk/mailman/listinfo/sge-discuss
    

_______________________________________________
SGE-discuss mailing list
SGE-discuss@liv.ac.uk
https://arc.liv.ac.uk/mailman/listinfo/sge-discuss

Reply via email to