Re: [SGE-discuss] exit status lost when using parallel environment

2015-06-22 Thread William Hay
On Wed, 17 Jun 2015 08:50:10 + Alexis Huxley wrote: > So now to my question: > > Is it a bug that the exit status is lost when running in a PE or have > I misunderstood something? > I don't know but since returning the exit status of the job's master process could be useful you could open

Re: [SGE-discuss] 'load_formula slots' and get_load_value()

2015-07-14 Thread William Hay
On Tue, 14 Jul 2015 09:10:58 + Alexis Huxley wrote: > then the output is: > > Tue 2015-07-14 10:55:32 CEST: sge_qmaster[42156]: > get_load_value: attrname=slots Tue 2015-07-14 10:55:32 CEST: > sge_qmaster[42156]: get_load_value: get_attribute_by_name(..., > "slots", ...) failed Tue 2

Re: [SGE-discuss] 'load_formula slots' and get_load_value()

2015-07-14 Thread William Hay
On Tue, 14 Jul 2015 11:07:01 + Alexis Huxley wrote: > > How is slots defined on the hosts of your cluster? > > Well, I had thought that the 'qconf -sq all.q' output above meant > that it was defined where it was supposed to be defined, but you Well that is where it normally is defined but

Re: [SGE-discuss] SGE 8.1.8 CGROUP question

2015-12-07 Thread William Hay
On Mon, Dec 07, 2015 at 03:20:08PM +, Ondrej Valousek wrote: > SGE can use CGROUPS for this bookkeeping you mentioned, too. It is just a > different filesystem, so it is quite simple to figure out which process > belongs to which cgroup. > To me, when using cgroups, additional GID should not

Re: [SGE-discuss] external jobs taking all licenses

2016-02-09 Thread William Hay
On Tue, Feb 09, 2016 at 04:01:03PM +0100, Daniel Fink (PDF) wrote: >Hello list, > > > >I have a grid setup where some jobs use a license (complex) that is very >restricted. Most of the time there are only 5 licenses available. > >I use the oelsen script to keep the complexes

Re: [SGE-discuss] ENABLE_ADDGRP_KILL parameter

2016-02-10 Thread William Hay
On Wed, Feb 10, 2016 at 09:57:54AM +, Ondrej Valousek wrote: >Hi List, > > > >We are have set the ENABLE_ADDGRP_KILL=false parameter of the exec daemon >(also using CGROUPS, btw). > >It seems that execd is still killing processes left behind on the >execution host, i

Re: [SGE-discuss] ENABLE_ADDGRP_KILL parameter

2016-02-10 Thread William Hay
On Wed, Feb 10, 2016 at 10:32:23AM +, Ondrej Valousek wrote: > Thanks William, > > Disabling Cgroups did the job, indeed. I did not know stepherd is using > cgroups for tracking purposes as well. > BTW - if it does, why it still adds the supplementary group IDs to the jobs > if Cgroups are e

Re: [SGE-discuss] [SGE-bugs] [SGE] #1569: qsub delay

2016-03-22 Thread William Hay
On Tue, Mar 22, 2016 at 12:36:07PM +, SGE wrote: > #1569: qsub delay > --+-- > Reporter: Narsimha | Owner: Narsimha > Type: defect| Status: new > Priority: normal| Milestone: > Component: sge |Version: 8.1.8 > S

Re: [SGE-discuss] [SGE] #1569: qsub delay

2016-03-23 Thread William Hay
On Wed, Mar 23, 2016 at 03:39:59AM +, SGE wrote: > #1569: qsub delay > --+- > Reporter: Narsimha | Owner: Narsimha > Type: defect| Status: closed > Priority: normal| Milestone: > Component: sge | Version:

Re: [SGE-discuss] Resource Reservation

2016-04-05 Thread William Hay
On Tue, Apr 05, 2016 at 09:49:09AM +0530, Narsimha Reddy wrote: >Dear Sir, >Thank you for reply. > >But my requirement is to stop the backfilling and enable a pure FIFO model >in grid engine. In the above case it is should block the job 194 and allow >193 to get 1 more core to

Re: [SGE-discuss] Resource Reservation

2016-04-06 Thread William Hay
On Wed, Apr 06, 2016 at 09:49:42AM +0530, Narsimha Reddy wrote: >Dear Sir, >Thank you for the reply. >How can a user request a reservation and how to set the wait time for jobs You request a reservation by adding -R y to the qsub command line. You could add this to ${SGE_ROOT}/${SGE_

Re: [SGE-discuss] "Decoding gridengine" workshop

2016-08-24 Thread William Hay
On Wed, Aug 24, 2016 at 10:20:06AM +0100, Mark Dixon wrote: > Hi there, > > Is there any interest for a meeting in the UK looking at the internals of > gridengine? Potential topics might be: > > * Building from source > * How the code is organised > * How to debug or develop gridengine > > The p

Re: [SGE-discuss] Can no longer view cluster config on non-admin hosts?

2016-08-25 Thread William Hay
On Thu, Aug 25, 2016 at 11:34:48AM +0100, Mark Dixon wrote: > Hi there, > > Playing around with CentOS 7 + SoGE 8.1.9, just noticed that attempts to > view the cluster config from a non-admin host fails: > > $ qconf -sconf > denied: host "" is not an admin host > > True for all the '-s*' swi

Re: [SGE-discuss] SGE + kerberos

2016-09-26 Thread William Hay
On Fri, Sep 23, 2016 at 07:23:00PM +, Thomas Beaudry wrote: >Hi, > >I am running into problems when cluster users submit a job to execution >hosts that don't have the users kerberos ticket to access NFS shares. I >tried copying the users's ticket to the excution host but that

Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-13 Thread William Hay
On Thu, Oct 13, 2016 at 11:07:28AM +0530, Himanshu Joshi wrote: > >The error again is > >Error: Unable to access jarfile ./util/gui-installer/installer.jar IIRC you were having issues with building with Java earlier. I suspect the above may be a result of that. Possibly you just need

Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-13 Thread William Hay
On Thu, Oct 13, 2016 at 03:41:27PM +0530, Himanshu Joshi wrote: >Thanks William, > >As per your suggestion I had changed the hostname to MBIALJPJ >hostnamectl status command says > > Static hostname: mbialjpj > Pretty hostname: MBIALJPJ > Icon name: computer-de

Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-14 Thread William Hay
On Thu, Oct 13, 2016 at 08:23:13PM +0530, Himanshu Joshi wrote: >Lets SGE-discuss answer the question, >As you have rightly pointed out I would like to mention that at the time >of first installation, I specified "default" as the cell name. But >default was never used as qmaster ho

Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-18 Thread William Hay
On Tue, Oct 18, 2016 at 09:48:21AM +0530, Himanshu Joshi wrote: >Thanks Love, >Apologies for mixing up the two different pipelines. >Let me start from scratch and follow the procedure recommended by you. >I am hereby requesting you to share your recommended procedure/pipeline >f

Re: [SGE-discuss] Error at the time of Distribution staging

2016-10-20 Thread William Hay
On Thu, Oct 20, 2016 at 11:44:42AM +0530, Himanshu Joshi wrote: >Thanks William and Love, >Now I had downloaded gridengine-8.1.9-1.el6.src >and performed rpm -Uvh gridengine-8.1.9-1.el6.src in mu /opt/sge folder as >a super user > >warning: gridengine-8.1.9-1.el6.src.rpm: Heade

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-10-20 Thread William Hay
On Thu, Oct 20, 2016 at 07:47:38PM +0530, Himanshu Joshi wrote: >-- Forwarded message -- >From: William Hay >Date: Thu, Oct 20, 2016 at 6:41 PM >Subject: Re: [SGE-discuss] Error at the time of Distribution staging >To: Himanshu Joshi &g

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-10-31 Thread William Hay
On Fri, Oct 21, 2016 at 09:53:21AM +0530, Himanshu Joshi wrote: >I am sorry for bothering you so much William, No problem. > >I think rpm file is not getting installed on my system, forget about >building it. i tried with other rpm file >(adobe-release-x86_64-1.0-1.noarch.rpm) requ

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-04 Thread William Hay
On Fri, Nov 04, 2016 at 07:47:41PM +0530, Himanshu Joshi wrote: >On Mon, Oct 31, 2016 at 3:21 PM, William Hay wrote: > > On Fri, Oct 21, 2016 at 09:53:21AM +0530, Himanshu Joshi wrote: > >I am sorry for bothering you so much William, > No problem. >

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-07 Thread William Hay
On Sat, Nov 05, 2016 at 10:55:38AM +0530, Himanshu Joshi wrote: >Redhat enterprise Linux 7.2 with X86-64 architecture >Please find the requested information with other relevant info >hostnamectl status > Static hostname: mbialjpj > Pretty hostname: MBIALJPJ > Ico

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-07 Thread William Hay
On Mon, Nov 07, 2016 at 05:41:54PM +0530, Himanshu Joshi wrote: >On Mon, Nov 7, 2016 at 3:09 PM, William Hay wrote: > > On Sat, Nov 05, 2016 at 10:55:38AM +0530, Himanshu Joshi wrote: > >Redhat enterprise Linux 7.2 with X86-64 architecture > >Plea

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-08 Thread William Hay
On Tue, Nov 08, 2016 at 11:30:35AM +0530, Himanshu Joshi wrote: > I'd try running the command > > /usr/lib/lsb/install_initd /etc/init.d/sgemaster.mbialjpj55||echo $? > > To see if it produces any output. > >Yes the output for this command is >1 Annoyingly silent error. W

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-09 Thread William Hay
On Wed, Nov 09, 2016 at 11:25:42AM +0530, Himanshu Joshi wrote: >On Tue, Nov 8, 2016 at 9:38 PM, William Hay wrote: > > On Tue, Nov 08, 2016 at 11:30:35AM +0530, Himanshu Joshi wrote: > > I'd try running the command > > > > /usr/l

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-09 Thread William Hay
On Wed, Nov 09, 2016 at 04:59:18PM +0530, Himanshu Joshi wrote: >On Wed, Nov 9, 2016 at 2:18 PM, William Hay wrote: > > On Wed, Nov 09, 2016 at 11:25:42AM +0530, Himanshu Joshi wrote: > >On Tue, Nov 8, 2016 at 9:38 PM, William Hay > wrote: > &

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-11 Thread William Hay
On Thu, Nov 10, 2016 at 02:26:35PM +0530, Himanshu Joshi wrote: > I suspect you probably want to use inst_sge to configure the node as an > execd as well. > >Is there any documentation available for doing that because I do not have >any idea how to do it http://arc.liv.ac.uk/SGE/

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-14 Thread William Hay
On Mon, Nov 14, 2016 at 06:03:43PM +0530, Himanshu Joshi wrote: >Thanks William >On Fri, Nov 11, 2016 at 10:31 PM, William Hay wrote: > > On Thu, Nov 10, 2016 at 02:26:35PM +0530, Himanshu Joshi wrote: > > I suspect you probably want to use inst_sge to

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-15 Thread William Hay
On Tue, Nov 15, 2016 at 10:44:01AM +0530, Himanshu Joshi wrote: >On Mon, Nov 14, 2016 at 8:41 PM, William Hay wrote: > > On Mon, Nov 14, 2016 at 06:03:43PM +0530, Himanshu Joshi wrote: > >Thanks William > >On Fri, Nov 11, 2016 at 10:31 PM, Will

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-16 Thread William Hay
On Tue, Nov 15, 2016 at 04:43:08PM +0530, Himanshu Joshi wrote: > As root: > source /opt/sge/default/common/settings.sh > qconf -ae > >Thanks,Please find the outputs and advise >[root@mbialjpj ~]# source /opt/sge/default/common/settings.sh >SGE_ROOT=/opt/sge: Command not

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-18 Thread William Hay
On Wed, Nov 16, 2016 at 07:32:00PM +0530, Himanshu Joshi wrote: >On Wed, Nov 16, 2016 at 6:59 PM, William Hay wrote: > > On Tue, Nov 15, 2016 at 04:43:08PM +0530, Himanshu Joshi wrote: > > As root: > > source /opt/sge/default/common/settings.s

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-18 Thread William Hay
On Fri, Nov 18, 2016 at 08:27:36PM +0530, Himanshu Joshi wrote: >During my install_execd, I was not able to install default queue with some >error message I do not remember I think that might be one of the problem >The hostname is the same which I use to ssh this machine i.e. mbialjpj >

Re: [SGE-discuss] Systemd more friendly sgemaster

2016-11-28 Thread William Hay
On Mon, Nov 28, 2016 at 07:50:11AM +, Ondrej Valousek wrote: >Hello, > >I am just asking if it would be possible to modify sge_qmaster to do not >fork (based on say some env variable). The bug you reference points out that the variable SGE_ND does this already. The problem as see

Re: [SGE-discuss] Systemd more friendly sgemaster

2016-11-28 Thread William Hay
On Mon, Nov 28, 2016 at 10:59:21AM +, Mike Grant wrote: > Writing as the original reporter of that bug, here's a bit more info.. > > On 28/11/16 10:34, William Hay wrote: > > What version of grid engine are you using? > > The bug and the comments below refer to t

Re: [SGE-discuss] Systemd more friendly sgemaster

2016-11-28 Thread William Hay
On Mon, Nov 28, 2016 at 10:46:35AM +, Ondrej Valousek wrote: > I am using GE version 8.1.8. > I did not try SGE_ND yet as I was not aware of it > I will give it a try - will see how it goes. > Thanks for the hint. I'm not a systemd expert by any means but something like the following might

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-28 Thread William Hay
On Mon, Nov 28, 2016 at 06:16:00PM +0530, Himanshu Joshi wrote: > >Now installation of sge is done > >ps aux | grep "sge" command says > >root 7407 0.0 0.2 213524 38396 ?Sl 16:37 0:01 >/opt/sge/bin/lx-amd64/sge_qmaster >root 9962 0.0 0.0 112648 960

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-29 Thread William Hay
On Tue, Nov 29, 2016 at 03:52:05PM +0530, Himanshu Joshi wrote: >On Mon, Nov 28, 2016 at 9:26 PM, William Hay wrote: > > On Mon, Nov 28, 2016 at 06:16:00PM +0530, Himanshu Joshi wrote: > > > >Now installation of sge is done > > > &g

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-29 Thread William Hay
On Tue, Nov 29, 2016 at 05:43:47PM +0530, Himanshu Joshi wrote: >On Tue, Nov 29, 2016 at 5:30 PM, William Hay wrote: > > On Tue, Nov 29, 2016 at 03:52:05PM +0530, Himanshu Joshi wrote: > >On Mon, Nov 28, 2016 at 9:26 PM, William Hay > wrote: > &

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-30 Thread William Hay
On Tue, Nov 29, 2016 at 10:35:35PM +0530, Himanshu Joshi wrote: >On Tue, Nov 29, 2016 at 8:57 PM, William Hay wrote: > > On Tue, Nov 29, 2016 at 05:43:47PM +0530, Himanshu Joshi wrote: > >On Tue, Nov 29, 2016 at 5:30 PM, William Hay > wrote: > &

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-30 Thread William Hay
On Wed, Nov 30, 2016 at 04:50:02PM +0530, Himanshu Joshi wrote: >On Wed, Nov 30, 2016 at 4:04 PM, William Hay wrote: > > On Tue, Nov 29, 2016 at 10:35:35PM +0530, Himanshu Joshi wrote: > >On Tue, Nov 29, 2016 at 8:57 PM, William Hay > wrote: > &

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-11-30 Thread William Hay
On Wed, Nov 30, 2016 at 07:14:48PM +0530, Himanshu Joshi wrote: >On Wed, Nov 30, 2016 at 7:03 PM, William Hay wrote: > > On Wed, Nov 30, 2016 at 04:50:02PM +0530, Himanshu Joshi wrote: > >On Wed, Nov 30, 2016 at 4:04 PM, William Hay > wrote: > &

Re: [SGE-discuss] Fwd: Error at the time of Distribution staging

2016-12-01 Thread William Hay
On Thu, Dec 01, 2016 at 10:54:07AM +0530, Himanshu Joshi wrote: > If so then that sounds like something else > is using the port the sge_execd is trying to use. Also 1024 isn't the > default > port for sge_execd. Did you deliberately set it to something unusual > when >

Re: [SGE-discuss] USE_CGROUPS

2016-12-22 Thread William Hay
On Tue, Dec 20, 2016 at 02:09:22PM +, Ondrej Valousek wrote: >Hi List, > > > >I just enabled USE_CGROUPS execd parameters and I observe that > >- Relevant job cgroup is created in /dev/cpuset/sge > >- Task PIDs can not be found in /dev/cpuset/sge//task