We installed slurm 23.11.5 and we are receiving "JobId=n has invalid
account" for every sbatch job.
We are not using the slurm accounting/user database; we are using uniform
UIDs and GIDs across the cluster.

The jobs run and complete; can these invalid account errors be ignored or
silenced?

Job Submission Environment:
id joteumer
uid=938401109(joteumer) gid=938400513(SPG) groups=938400513(SPG),27(sudo)

Slurm Worker Node:
id joteumer
uid=938401109(joteumer) gid=938400513(SPG) groups=938400513(SPG),27(sudo)

slurmctld log:
[2024-04-18T09:46:40.000] sched: JobId=18 has invalid account

scontrol show job 18
JobId=18 JobName=simplejob.sh
   UserId=joteumer(938401109) GroupId=SPG(938400513) MCS_label=N/A
   Priority=1 Nice=0 *Account=(null) *QOS=(null)

Submit another sbatch job and update the job to include an Account
scontrol update jobid=19 Account=joteumer

[2024-04-18T09:56:05.126] _slurm_rpc_submit_batch_job: JobId=19 InitPrio=1
usec=485
[2024-04-18T09:56:06.000] sched: JobId=19 has invalid account
[2024-04-18T09:56:17.000] debug:  set_job_failed_assoc_qos_ptr: Filling in
assoc for JobId=19 Assoc=0
[2024-04-18T09:56:17.000] sched: JobId=19 has invalid account
[2024-04-18T09:56:17.588] debug:  set_job_failed_assoc_qos_ptr: Filling in
assoc for JobId=19 Assoc=0
[2024-04-18T09:56:27.505] _slurm_rpc_update_job: complete JobId=19 uid=0
usec=110
[2024-04-18T09:56:28.000] sched: JobId=19 has invalid account

scontrol show job 19
JobId=19 JobName=simplejob.sh
   UserId=joteumer(938401109) GroupId=SPG(938400513) MCS_label=N/A
   Priority=1 Nice=0 Account=(null) QOS=(null)


             JOBID PARTITION
          NAME     USER    STATE       TIME TIME_LIMI  NODES
NODELIST(REASON)
                19       SPG                                  simplejob
joteumer  PENDING       0:00  18:00:00      1 (InvalidAccount)
-- 
slurm-users mailing list -- slurm-users@lists.schedmd.com
To unsubscribe send an email to slurm-users-le...@lists.schedmd.com

Reply via email to