Re: [slurm-users] Persistent Interactive Jobs

2022-06-10 Thread Hadrian Djohari
output of a > > spawned terminal they could just ssh too. > > > > Regards, > > > > -- > > > > Willy Markuske > > > > > > > > HPC Systems Engineer > > > > > > > > Research Data Services > > > > P: (619) 519-4435 > > > > -- > Diego Zuccato > DIFA - Dip. di Fisica e Astronomia > Servizi Informatici > Alma Mater Studiorum - Università di Bologna > V.le Berti-Pichat 6/2 - 40127 Bologna - Italy > tel.: +39 051 20 95786 > > -- Hadrian Djohari Manager of Research Computing Services, [U]Tech Case Western Reserve University (W): 216-368-0395 (M): 216-798-7490

Re: [slurm-users] Unable to start slurmd service

2021-11-16 Thread Hadrian Djohari
CfgTRES=cpu=16,mem=40195M,billing=16 > >AllocTRES= > >CapWatts=n/a > >CurrentWatts=0 LowestJoules=0 ConsumedJoules=0 > >ExtSensorsJoules=n/s ExtSensorsWatts=0 ExtSensorsTemp=n/s > >Reason=Node unexpectedly rebooted [slurm@2021-11-16T14:41:04] > > &

Re: [slurm-users] derived counters

2021-04-13 Thread Hadrian Djohari
ay be difficult to answer your question from the Slurm database. > The sacct > > command displays accounting data for all jobs and job steps, but not > directly > > for partitions. > > > > There are other Slurm monitoring tools which perhaps can supply the data > you

Re: [slurm-users] Multifactor priority configuration

2020-01-22 Thread Hadrian Djohari
tories > University of York > Heslington > York > YO10 5DD > +44 (0)1904 32 4753 > > e-mail disclaimer: http://www.york.ac.uk/docs/disclaimer/email.htm > -- Hadrian Djohari Manager of Research Computing Services, [U]Tech Case Western Reserve University (W): 216-368-0395 (M): 216-798-7490

Re: [slurm-users] Add prolog and epilog to sbatch's job

2018-08-06 Thread Hadrian Djohari
th/to/scontrol show job "$SLURM_JOB_ID")" > echo "print scontrol show job $SLURM_JOB_ID" > echo "print $SLURM_JOB_INFO" > echo "==" > > > -- Hadrian Djohari Manager of Research Computing Services, [U]Tech Case Western Reserve University (W): 216-368-0395 (M): 216-798-7490

Re: [slurm-users] Unable to contact slurm controller

2018-07-31 Thread Hadrian Djohari
t > slurmctld.service entered failed state. > Jul 31 20:02:24 rocks7.jupiterclusterscu.com systemd[1]: > slurmctld.service failed. > > > Regards, > Mahmood > > > > On Tue, Jul 31, 2018 at 9:32 PM, Alex Chekholko > wrote: > >> Seems like your slurmctld is not

Re: [slurm-users] srun --x11 connection rejected because of wrong authentication

2018-06-11 Thread Hadrian Djohari
of slurm and have the X11 stuff work properly. > > Best, > Chris > > — > Christopher Coffey > High-Performance Computing > Northern Arizona University > 928-523-1167 > > > On 6/7/18, 6:49 PM, "slurm-users on behalf of Hadrian Djohari" < > sl

Re: [slurm-users] srun --x11 connection rejected because of wrong authentication

2018-06-07 Thread Hadrian Djohari
tely writeable: > > [root@cn100 ~]# touch /home/cbc/.Xauthority > [root@cn100 ~]# > > Anyone have any ideas? Thanks! > > Best, > Chris > > — > Christopher Coffey > High-Performance Computing > Northern Arizona University > 928-523-1167 > > > -- Hadrian Djohari Manager of Research Computing Services, [U]Tech Case Western Reserve University (W): 216-368-0395 (M): 216-798-7490

Re: [slurm-users] Distribute jobs in similar nodes in the same partition

2018-05-11 Thread Hadrian Djohari
You can use node feature in defining the node types in slurm.conf. Then when requesting for the job, use -C toy just use those node type. On Fri, May 11, 2018, 5:38 AM Antonio Lara wrote: > Hello everyone, > > Hopefully someone can help me with this, I cannot find in the manual if > this is e

Re: [slurm-users] SC17 - Tools for managing users and allocations

2017-11-16 Thread Hadrian Djohari
405)%20325-6371> > ++ > “Big whorls have little whorls, > That feed on their velocity; > And little whorls have lesser whorls, > And so on to viscosity.” > Lewis Fry Richardson (1881-1953) > -- Hadrian Djohari HPCC Manager, [U]Tech Case Western Reserve University (W): 216-368-0395 (M): 216-798-7490