Re: [gridengine users] About cpu time.

2020-05-07 Thread Jerome IBt
Le 07/05/2020 à 01:14, Reuti a écrit : > Hi, > > It might be, that the application is ignoring the set OMP_NUM_THREADS (or > assumes a max value if unset) and using all cores in a machine. How many > cores are installed? > > -- Reuti > Hi Reuti. The jbs running on a 64 cores computer.. I wi

[gridengine users] About cpu time.

2020-05-06 Thread Jerome IBt
Dear all I'm facing a strange problem with some parralel programs. Ive run a job ina queue with 24 hours limit time. The job Qacct report this (4 cores): qnameall.q hostname compute-0-3.local groupestudiante ownerxairarg project NONE department defaultdepartment

Re: [gridengine users] Job in error states

2020-03-09 Thread Jerome
node? Or are several nodes affected? >> One guess could be that the file system is full. >> >> -- Reuti >> >> >>> Am 05.03.2020 um 18:46 schrieb Jerome : >>> Dear Reuti, Mac Thank's for your answers. There is no filesystem full, nor an NFS

[gridengine users] Job in error states

2020-03-07 Thread Jerome
Dear all I'm facing a strange error in SGE. One job is declared as in error, as i show in the following: == job_number: 1311910 exec_file: job_scripts/1311910 submission_time:Thu Mar 5 08:06

Re: [gridengine users] slots equals cores

2020-02-03 Thread Jerome
Le 31/01/2020 à 11:26, Reuti a écrit : > > Exactly. Doing it on the command line within a loop is not so laborious and > it's a fixed feature of a node which will never change during its lifetime. > > -- Reuti > > >> Thank's >> >> -- >> -- Jérôme >> Quand un arbre tombe, on l'entend ; quand

Re: [gridengine users] slots equals cores

2020-01-31 Thread Jerome IBt
Le 31/01/2020 à 10:19, Reuti a écrit : > Hi Jérôme, > > Personally I would prefer to keep the output of `qquota` short and use it > only for users's limits. I.e. defining the slot limit on an exechost basis > instead. This can also be done in a loop containing a command line like: > > $ qconf -

[gridengine users] slots equals cores

2020-01-31 Thread Jerome
Dear all I'm facing a new problem on my cluster with SGE. I don't show this before.. O maybe I never detect it. I have some nodes with 2 queue, one (named "all.q" ) to run jobs no more than 24h , and another queue (named "lenta.q" ) to run jobs than need more than 24 h. I determine qa resource quo

[gridengine users] issue compiling SoGE on Debian 10.1

2019-10-30 Thread Jerome
Dear all I've trying to compile deb package of SoGE, using the repo on Gitlab "https://gitlab.com/loveshack/sge.git";. I could generate some deb files, as sge_8.1.10-1_amd64.deb, sge-common_8.1.10-1_all.deb. But got this issue : $ dpkg-buildpackage -b ../.. dpkg-deb: building package 'sge-dbg'

Re: [gridengine users] Instalation issue on version 8.1.10-1

2018-10-18 Thread Jerome
Le 17/10/2018 à 14:15, Feng Zhang a écrit : > Maybe you can check the install script, like inst_sge, to see if there's > any typo.  > > Or some script files may have special characters, from like Widows, > Linux, etc.? > I've found where is the problem: --- a/source/dist/util/arch_variables ++

Re: [gridengine users] Instalation issue on version 8.1.10-1

2018-10-18 Thread Jerome
Le 17/10/2018 à 14:15, Feng Zhang a écrit : > Maybe you can check the install script, like inst_sge, to see if there's > any typo.  > > Or some script files may have special characters, from like Widows, > Linux, etc.? > > > Best, > > Feng > > Best, > > Feng > > Dear Feng, You're right, s

[gridengine users] Instalation issue on version 8.1.10-1

2018-10-17 Thread Jerome
Dear all I've trying to install a fresh version of SGE on a Debian server v 9.5 (on virtualbox), using the git repository of Dave Love: https://gitlab.com/loveshack/sge.git The issue occurs on the commit "0271520806868d6be018a1a5c019fc64d5faddf3" . I do the test installing the deb package file

Re: [gridengine users] Dave Love repository issue

2018-10-17 Thread Jerome
Le 17/10/2018 à 04:12, William Hay a écrit : > On Tue, Oct 16, 2018 at 06:53:11PM -0500, Jerome wrote: >> Dear William >> >> I'm watching this trac system, and it seem's to be reserved for >> developper only.. That's seems that to report a bug, one need

Re: [gridengine users] Dave Love repository issue

2018-10-16 Thread Jerome
Dear William Le 15/10/2018 à 04:54, William Hay a écrit : > On Fri, Oct 12, 2018 at 02:13:32PM -0400, Daniel Povey wrote: >>There is an issue tracker here >>https://arc.liv.ac.uk/trac >>but it's not clear whether Dave Love still has access to it (he moved to > The issue tracker has it'

[gridengine users] Dave Love repository issue

2018-10-12 Thread Jerome
Dear all. I follow the discussion about the future of SoGe. I've download the git repository from Dave Love in GitLab (https://gitlab.com/loveshack/sge) , and try to use it on a debian based system. The problem is taht i'v got a segmentation fault in the sge_master binarie. Where can i report thi

Re: [gridengine users] SGE accounting file getting too big...

2018-05-18 Thread Jerome
Le 18/05/2018 à 10:23, Noel Benitez a écrit : > Hi guys, >   > The "accounting" file on our sge master has a filesize of 20Gb. >   > Is there a recommended way of purging this file short of using "cat > /dev/null > accounting"  ? >   Dear Noel What i do in this case is to save the acounting file

Re: [gridengine users] Son of GridEngine succession?

2018-05-11 Thread Jerome
Le 11/05/2018 à 18:02, Christopher Heiny a écrit : > On Fri, 2018-05-11 at 18:49 -0400, Daniel Povey wrote: >> >> I want to start a discussion about how to replace Son of GridEngine. >> As far as I can tell, Dave Love has had no online activity for a >> year, >> is not responding to emails, and my

Re: [gridengine users] install_qmaster issue

2018-04-19 Thread Jerome
Hi Arnau Le 19/04/2018 à 02:50, Arnau a écrit : > Hi, > > why don't you first install all dependecies and the sge? > I don't know what's wrong with your apt, but in mycase apt install -f > does not want to remove sge. That the principle of Debian, all deb should install with all dependencies. Th

Re: [gridengine users] install_qmaster issue

2018-04-16 Thread Jerome
Le 13/04/2018 à 03:09, Arnau a écrit : > Hi, > > $ cat /etc/debian_version > 9.4 > > $ dpkg -l|grep  ssl > ii  libflac8:amd64                   1.3.2-1                        > amd64        Free Lossless Audio Codec - runtime C library > ii  libssl-doc                       1.1.0f-3+deb9u2       

Re: [gridengine users] install_qmaster issue

2018-04-12 Thread Jerome
Le 12/04/2018 à 03:11, Arnau a écrit : > Hi Jerome, > > the deb packages are in this link: > https://drive.google.com/open?id=1tjEh1ygOxAPigoDWEy8IXi0o_nkH--NM > > > HTH, > Arnau > > Dear Arnau Thank's foe get me this deb file. The problem is that they

Re: [gridengine users] install_qmaster issue

2018-04-06 Thread Jerome
Le 21/03/2018 à 06:39, Arnau a écrit : > Hi, > > I've tried to reproduce your issue in AWS. I did not build the packages > but installed from github as you did. > I have to say that everything work perfectly: > > sh scripts/bootstrap.sh > ./aimk -no-java -no-jni -only-core > setenv SGE_ROOT /opt/

Re: [gridengine users] install_qmaster issue

2018-03-23 Thread Jerome
Le 21/03/2018 à 06:39, Arnau a écrit : > Hi, > > I've tried to reproduce your issue in AWS. I did not build the packages > but installed from github as you did. > I have to say that everything work perfectly: > > sh scripts/bootstrap.sh > ./aimk -no-java -no-jni -only-core > setenv SGE_ROOT /opt/

Re: [gridengine users] install_qmaster issue

2018-03-20 Thread Jerome
Le 16/03/2018 à 03:41, Arnau a écrit : > Hi, > > is the name resolution working as expected? is xx.yy.zz =  > invitado.uuab.ibt.unam.mx  ? is the > resolution of  invitado.uuab.ibt.unam.mx >  to 10.0.6.50 ? are you using fqdn or

Re: [gridengine users] install_qmaster issue

2018-03-15 Thread Jerome
> > Best, > > 2018-03-14 22:15 GMT+01:00 Jerome <mailto:jer...@ibt.unam.mx>>: > > Dear all > > I've trying to installing SoGE on a fres Debian 9.4 serveur. I've to > compile for my own the deb pacakge due to change in the libssl lib

Re: [gridengine users] install_qmaster issue

2018-03-15 Thread Jerome
Le 15/03/2018 à 08:21, Arnau a écrit : > Hi,  > > when do you get this error? > is the qmaster really running? can you check if there is anything > running in port 6444?  > if it crashes try to enable debug > http://gridscheduler.sourceforge.net/howto/troubleshooting.html . > Dear Arnau The err

[gridengine users] install_qmaster issue

2018-03-14 Thread Jerome
Dear all I've trying to installing SoGE on a fres Debian 9.4 serveur. I've to compile for my own the deb pacakge due to change in the libssl library between 1.0 and 1.1. Y download the ultimate from gitlab version 9b99ef86. All compile well Now, during the process nstalation, i got an issue that

Re: [gridengine users] strange cpu time and wallclock

2017-12-07 Thread Jerome
Dear Mike So you confir my doubt. I'v to see why this NCBI program is using more than one thread!!! Regards Le 07/12/2017 à 11:07, Mike Serkov a écrit : Jerome, It just means that your job used more than one CPU ( probably multithreaded task, or something is running In parallel

[gridengine users] strange cpu time and wallclock

2017-12-07 Thread Jerome
Dear all. I've a bit confusiong about some accounting data get back from a single job: qnameall.q hostname compute-0-4.local ../.. slots1 failed 0 exit_status 0 ru_wallclock 1761 ru_utime 2066.817 ru_stime 104.779 ru_maxrss56872 ru_ixrss 0 ru_ismrss

Re: [gridengine users] Make qmaster buffer larger

2017-03-10 Thread Jerome Poitout
Hi, Le 09/03/2017 à 23:21, Reuti a écrit : Hi, Am 09.03.2017 um 17:20 schrieb Jerome Poitout: > Hello, > OGS/GE 2011.11p1 > I have an issue while submitting numerous jobs in a short time (over 300

[gridengine users] Make qmaster buffer larger

2017-03-09 Thread Jerome Poitout
Hello, OGS/GE 2011.11p1 I have an issue while submitting numerous jobs in a short time (over 300 - not so much for me...) with -sync y option. It seems that qmaster cannot handle all the requests and i get huge load on the head server (>400) and memory gets almost full (32GB). These jobs are run

Re: [gridengine users] Strange issue with one node

2016-10-24 Thread Jerome
ank's a lot Reuti to let me check where i didn't ! Regards Le 24/10/2016 à 12:34, Reuti a écrit : Hi, Am 24.10.2016 um 19:15 schrieb Jerome : Dear all I've install for a course a Rocks Cluster of 2 nodes, with SGE. Each node are a 4 cores nodes. I do a shutdown of a node, a

Re: [gridengine users] Strange issue with one node

2016-10-24 Thread Jerome
INFINITY h_rss INFINITY s_vmemINFINITY h_vmemINFINITY Thank's Regards Le 24/10/2016 à 12:20, Skylar Thompson a écrit : On Mon, Oct 24, 2016 at 12:15:41PM -0500, Jerome wrote: Dear all I've install for a course a Rocks Cluster

[gridengine users] Strange issue with one node

2016-10-24 Thread Jerome
Dear all I've install for a course a Rocks Cluster of 2 nodes, with SGE. Each node are a 4 cores nodes. I do a shutdown of a node, and so i have ready uniquely 4 cores: $ qstat -f queuename qtype resv/used/tot. load_avg arch states -

Re: [gridengine users] Batch job on interactive queue

2016-07-01 Thread Jerome
Dear Rueti Le 01/07/2016 à 10:07, Reuti a écrit : Hi, That's a do at end. But i can't understand why this issue occurs. Nothing seems to indicate that a job want to run inmediatly when using this PE. The two settings qtype and pe_list are linked with an OR. If a PE is set in a queue and re

Re: [gridengine users] Batch job on interactive queue

2016-07-01 Thread Jerome
Dear Reuti Le 29/06/2016 à 16:34, Reuti a écrit : Hi, Am 30.06.2016 um 03:39 schrieb Jerome : Dear Reuti Thank's for your answer. I use SGE for a while, and don't realize what you explain here with -now yes It specifies whether the job must start immediately, or can st

Re: [gridengine users] Batch job on interactive queue

2016-06-29 Thread Jerome
PE. Regards Le 29/06/2016 16:34, Reuti a écrit : Hi, Am 29.06.2016 um 22:56 schrieb Jerome: Dear all Here we runa Rocks cluster 6.2, whith SGE GE2011. I've configure on our cluster a special queue "express" to run uniquely interactive job. This queue is limited in time for

[gridengine users] Batch job on interactive queue

2016-06-29 Thread Jerome
Dear all Here we runa Rocks cluster 6.2, whith SGE GE2011. I've configure on our cluster a special queue "express" to run uniquely interactive job. This queue is limited in time for 2 hours. When i run qrsh, i go on this queue, so all right. But i have some strange behavior: i send a batch job

Re: [gridengine users] Unable to initialize environment because of error: range_list containes no elements

2016-05-09 Thread Jerome Poitout
stions/4883056/sge-qsub-fails-to-submit-jobs-in-sync-mode Sean From: users-boun...@gridengine.org [users-boun...@gridengine.org] on behalf of Jerome Poitout [network.ad...@dolphin.fr] Sent: Friday, May 06, 2016 12:59 AM To: Gridengine Users Group Subject: [gridengine users

[gridengine users] Unable to initialize environment because of error: range_list containes no elements

2016-05-06 Thread Jerome Poitout
Dear all, I get this error message when submitting lots of jobs using ICLys program. I believe this program submits jobs using -sync option since it is designed to increase predictability of our jobs. Unable to initialize environment because of error: range_list containes no elements I'm using ver

[gridengine users] Accounting

2016-01-28 Thread Jerome Poitout
time / running time? Best regards, Jerome ___ users mailing list users@gridengine.org https://gridengine.org/mailman/listinfo/users

Re: [gridengine users] share ressources configuration.

2016-01-07 Thread Jerome Poitout
-BEGIN PGP SIGNED MESSAGE- Hash: SHA512 Hi all I'm very glad to inform you that the policy set under your advice is a success. Thank you very much for your help ! Jérôme Le 10/12/2015 17:42, Jerome Poitout a écrit : >

Re: [gridengine users] share ressources configuration.

2015-12-15 Thread Jerome Poitout
It seems that the policy applies well... Thanks a lot for this help. Jérôme Le 11/12/2015 11:03, Jerome Poitout a écrit : > > Le 10/12/2015 18:38, Reuti a écrit : >>> Am 10.12.2015 um 15:04 schrieb Jerome Poitout : >>> >>> Running jobs. If team1 is consum

Re: [gridengine users] share ressources configuration.

2015-12-11 Thread Jerome Poitout
Le 10/12/2015 18:38, Reuti a écrit : >> Am 10.12.2015 um 15:04 schrieb Jerome Poitout : >> >> Running jobs. If team1 is consumming 80% of CPU Time and team2 20%, >> then team2 jobs should have higher priority. > Higher priority to which goal - to get also 50%? &g

Re: [gridengine users] share ressources configuration.

2015-12-10 Thread Jerome Poitout
Running jobs. If team1 is consumming 80% of CPU Time and team2 20%, then team2 jobs should have higher priority. Jérôme Le 10/12/2015 12:18, Reuti a écrit : > Do you want to honor the running jobs only (functional policy) or also the > past usage over a moving timeframe of the last 30 days or a

[gridengine users] share ressources configuration.

2015-12-09 Thread Jerome Poitout
Hi all, I have a small compute farm on which users are fighting to get some cpu time. I have almost 650 CPU (grid view) and one user can take all the CPUs available for his jobs. I've tried to set share policies, but whatever I set, I can't handle to get another result than FIFO when users are subm

Re: [gridengine users] Problem when using priorities

2015-02-24 Thread Jerome Poitout
Le 24/02/2015 13:46, Reuti a écrit : > Am 24.02.2015 um 10:23 schrieb Jerome Poitout : >> I can't get it work. > You mean to switch off the repriorization? What is the output of: No that's OK for that point. I switched off the repriorization using qmon - I read this part of

Re: [gridengine users] Problem when using priorities

2015-02-24 Thread Jerome Poitout
I can't get it work. I don't understand how to implement it in the gridengine configuration, and how to see wether it works or not. Jérôme Le 12/02/2015 12:23, Reuti a écrit : > Am 12.02.2015 um 12:05 schrieb Jerome Poitout : >> I have a question regarding jobs priority. &g

[gridengine users] Problem when using priorities

2015-02-12 Thread Jerome Poitout
Hello all, I have a question regarding jobs priority. When I turn on the reprioritize feature, I loose almost 25%-30% in my CPU efficiency. User+Nice Avg is about 68% on all my nodes when reprioritize is activated (User 16 / Nice 52 ). When disabled, User Avg is 91% and my simulations are 30% faste

Re: [gridengine users] Node refuse to run job

2012-02-09 Thread Jerome
ctory? Could you be more precise please? Thank you On 09/02/2012 11:59, "Hung-Sheng Tsao (Lao Tsao 老曹) Ph.D." wrote: check the CELL/spool/ directory of the qmaster and nodes On 2/9/2012 12:51 PM, Jerome wrote: Dera all I have the SGE version GE 6.2u2_1 on a Rocks cluster. Since few days

[gridengine users] Node refuse to run job

2012-02-09 Thread Jerome
Dera all I have the SGE version GE 6.2u2_1 on a Rocks cluster. Since few days, a node refuse to run a job. using "qstat -j jid", i notice this line a the end of the output: cannot run on host "compute-2-15.local" until clean up of an previous run has finished I revise on the node 2-15, but