We are pleased to announce the availability of Slurm version 20.11.6.
This includes a number of minor-to-moderate severity fixes, as well as improvements to the recently added job_container/tmpfs plugin.
Slurm can be downloaded from https://www.schedmd.com/downloads.php . - Tim -- Tim Wickberg Chief Technology Officer, SchedMD LLC Commercial Slurm Development and Support
* Changes in Slurm 20.11.6 ========================== -- Fix sacct assert with the --qos option. -- Use pkg-config --atleast-version instead of --modversion for systemd. -- common/fd - fix getsockopt() call in fd_get_socket_error(). -- Properly handle the return from fd_get_socket_error() in _conn_readable(). -- cons_res - Fix issue where running jobs were not taken into consideration when creating a reservation. -- Avoid a deadlock between job_list for_each and assoc QOS_LOCK. -- Fix TRESRunMins usage for partition qos on restart/reconfig. -- Fix printing of number of tasks on a completed job that didn't request tasks. -- Fix updating GrpTRESRunMins when decrementing job time is bigger than it. -- Make it so we handle multithreaded allocations correctly when doing --exclusive or --core-spec allocations. -- Fix incorrect round-up division in _pick_step_cores -- Use appropriate math to adjust cpu counts when --ntasks-per-core=1. -- cons_tres - Fix consideration of power downed nodes. -- cons_tres - Fix DefCpuPerGPU, increase cpus-per-task to match with gpus-per-task * cpus-per-gpu. -- Fix under-cpu memory auto-adjustment when MaxMemPerCPU is set. -- Make it possible to override CR_CORE_DEFAULT_DIST_BLOCK. -- Perl API - fix retrieving/storing of slurm_step_id_t in job_step_info_t. -- Recover state of burst buffers when slurmctld is restarted to avoid skipping burst buffer stages. -- Fix race condition in burst buffer plugin which caused a burst buffer in stage-in to not get state saved if slurmctld stopped. -- auth/jwt - print an error if jwt_file= has not been set in slurmdbd. -- Fix RESV_DEL_HOLD not being a valid state when using squeue --states. -- Add missing squeue selectable states in valid states error message. -- Fix scheduling last array task multiple times on error, causing segfault. -- Fix issue where a step could be allocated more memory than the job when dealing with --mem-per-cpu and --threads-per-core. -- Fix removing qos from assoc with -= can lead to assoc with no qos -- auth/jwt - fix segfault on invalid credential in slurmdbd due to missing validate_slurm_user() function in context. -- Fix single Port= not being applied to range of nodes in slurm.conf -- Fix Jobs not requesting a tres are not starting because of that tres limit. -- acct_gather_energy/rapl - fix AveWatts calculation. -- job_container/tmpfs - Fix issues with cleanup and slurmd restarting on running jobs.