I found the problem. It was between the chair and keyboard:

$ salloc -p general -q qos -t 00:30:00

When I type the qos right, it works:

$ salloc -p general -q debug -t 00:30:00 -A unix
salloc: Granted job allocation 529343

$ scontrol  show job 529343 | grep QOS
   Priority=13736 Nice=0 Account=unix QOS=debug

I will go stand in the corner now...


Prentice Bisbal
Lead Software Engineer
Princeton Plasma Physics Laboratory
http://www.pppl.gov

On 7/15/19 3:18 PM, Prentice Bisbal wrote:

That explanation makes perfect sense, but after adding debug to my list of QOSes in my associations, I still get the same error:

$ sacctmgr show user pbisbal withassoc -p
User|Def Acct|Admin|Cluster|Account|Partition|Share|MaxJobs|MaxNodes|MaxCPUs|MaxSubmit|MaxWall|MaxCPUMins|QOS|Def QOS|
pbisbal|unix|Administrator|pppl|unix||1|||||||dawson,debug,default,ellis,fenx,fielder,general,gque,greene,interactive,interruptible,jassby,kruskal,mccune,mque,normal,pswift,sque|general|
pbisbal|unix|Administrator|pppl|transp||1|||||||dawson,debug,ellis,general,interactive,kruskal,mccune,mque,normal|general|
pbisbal|unix|Administrator|test_pppl|transp||1|||||||dawson,debug,ellis,general,interactive,kruskal,mccune,mque,normal|general|
pbisbal|unix|Administrator|test_pppl|unix||1|||||||dawson,debug,default,ellis,fenx,fielder,general,gque,greene,interactive,interruptible,jassby,kruskal,mccune,mque,normal,pswift,sque|general|

And then:

$ salloc -p general -q qos -t 00:30:00
salloc: error: Job submit/allocate failed: Invalid qos specification

Prentice
On 7/15/19 2:55 PM, David Rhey wrote:
I ran into this recently. You need to make sure your user account has access to that QoS through sacctmgr. Right now I'd say if you did sacctmgr show user <user> withassoc that the QoS you're attempting to use is NOT listed as part of the association.

On Mon, Jul 15, 2019 at 2:53 PM Prentice Bisbal <pbis...@pppl.gov <mailto:pbis...@pppl.gov>> wrote:

    Slurm users,

    I have created a partition named general should allow the QOSes
    'general' and 'debug':

    PartitionName=general Default=YES AllowQOS=general,debug Nodes=.....

    However, when I try to request that QOS, I get an error:

    $ salloc -p general -q debug  -t 00:30:00
    salloc: error: Job submit/allocate failed: Invalid qos specification

    I'm sure I'm overlooking  something obvious. Any idea what that
    may be?
    I'm using slurm 18.08.8 on the slurm controller, and the clients are
    still at 18.08.7 until tomorrow morning.

    --
    Prentice




--
David Rhey
---------------
Advanced Research Computing - Technology Services
University of Michigan

Reply via email to