I looked at some of our busy 96-core nodes where users are currently
running the STAR-CCM+ CFD software.
One job runs on 4 96-core nodes. I'm amazed that each STAR-CCM+ process
has opened almost 1000 open files, for example:
$ lsof -p 440938 | wc -l
950
and that on this node the user has al
We installed slurm 23.11.5 and we are receiving "JobId=n has invalid
account" for every sbatch job.
We are not using the slurm accounting/user database; we are using uniform
UIDs and GIDs across the cluster.
The jobs run and complete; can these invalid account errors be ignored or
silenced?
Job S