Accepting request 942081 from home:mslacken:branches:network:cluster

- update to 21.08.5 with following changes:
  * Fix issue where typeless GRES node updates were not immediately reflected.
  * Fix setting the default scrontab job working directory so that it's the home
    of the different user (*u <user>) and not that of root or SlurmUser editor.
  * Fix stepd not respecting SlurmdSyslogDebug.
  * Fix concurrency issue with squeue.
  * Fix job start time not being reset after launch when job is packed onto
    already booting node.
  * Fix updating SLURM_NODE_ALIASES for jobs packed onto powering up nodes.
  * Cray - Fix issues with starting hetjobs.
  * auth/jwks - Print fatal() message when jwks is configured but file could
    not be opened.
  * If sacctmgr has an association with an unknown qos as the default qos
    print 'UNKN*###' instead of leaving a blank name.
  * Correctly determine task count when giving --cpus-per-gpu, --gpus and
    *-ntasks-per-node without task count.
  * slurmctld - Fix places where the global last_job_update was not being set
    to the time of update when a job's reason and description were updated.
  * slurmctld - Fix case where a job submitted with more than one partition
    would not have its reason updated while waiting to start.
  * Fix memory leak in node feature rebooting.
  * Fix time limit permanetly set to 1 minute by backfill for job array tasks
    higher than the first with QOS NoReserve flag and PreemptMode configured.
  * Fix sacct -N to show jobs that started in the current second
  * Fix issue on running steps where both SLURM_NTASKS_PER_TRES and
    SLURM_NTASKS_PER_GPU are set.
  * Handle oversubscription request correctly when also requesting
    *-ntasks-per-tres.
  * Correctly detect when a step requests bad gres inside an allocation.
  * slurmstepd - Correct possible deadlock when UnkillableStepTimeout triggers.

OBS-URL: https://build.opensuse.org/request/show/942081
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=195
This commit is contained in:
Christian Goll 2021-12-23 10:26:41 +00:00 committed by Git OBS Bridge
parent 350be975f5
commit d442993ff4
4 changed files with 51 additions and 4 deletions

View File

@ -1,3 +0,0 @@
version https://git-lfs.github.com/spec/v1
oid sha256:cf5256dfe08c565ddcca387a4d902a1643e125d427c8c2e0c07a32c950bc57aa
size 6743524

3
slurm-21.08.5.tar.bz2 Normal file
View File

@ -0,0 +1,3 @@
version https://git-lfs.github.com/spec/v1
oid sha256:fce78185c5c69b3a9143286df641725503be7aa4c1d5cec9161ec72905ed4f8a
size 6741051

View File

@ -1,3 +1,49 @@
-------------------------------------------------------------------
Wed Dec 22 09:24:28 UTC 2021 - Christian Goll <cgoll@suse.com>
- update to 21.08.5 with following changes:
* Fix issue where typeless GRES node updates were not immediately reflected.
* Fix setting the default scrontab job working directory so that it's the home
of the different user (*u <user>) and not that of root or SlurmUser editor.
* Fix stepd not respecting SlurmdSyslogDebug.
* Fix concurrency issue with squeue.
* Fix job start time not being reset after launch when job is packed onto
already booting node.
* Fix updating SLURM_NODE_ALIASES for jobs packed onto powering up nodes.
* Cray - Fix issues with starting hetjobs.
* auth/jwks - Print fatal() message when jwks is configured but file could
not be opened.
* If sacctmgr has an association with an unknown qos as the default qos
print 'UNKN*###' instead of leaving a blank name.
* Correctly determine task count when giving --cpus-per-gpu, --gpus and
*-ntasks-per-node without task count.
* slurmctld - Fix places where the global last_job_update was not being set
to the time of update when a job's reason and description were updated.
* slurmctld - Fix case where a job submitted with more than one partition
would not have its reason updated while waiting to start.
* Fix memory leak in node feature rebooting.
* Fix time limit permanetly set to 1 minute by backfill for job array tasks
higher than the first with QOS NoReserve flag and PreemptMode configured.
* Fix sacct -N to show jobs that started in the current second
* Fix issue on running steps where both SLURM_NTASKS_PER_TRES and
SLURM_NTASKS_PER_GPU are set.
* Handle oversubscription request correctly when also requesting
*-ntasks-per-tres.
* Correctly detect when a step requests bad gres inside an allocation.
* slurmstepd - Correct possible deadlock when UnkillableStepTimeout triggers.
* srun - use maximum number of open files while handling job I/O.
* Fix writing to Xauthority files on root_squash NFS exports, which was
preventing X11 forwarding from completing setup.
* Fix regression in 21.08.0rc1 that broke --gres=none.
* Fix srun --cpus-per-task and --threads-per-core not implicitly setting
*-exact. It was meant to work this way in 21.08.
* Fix regression in 21.08.0 that broke dynamic future nodes.
* Fix dynamic future nodes remembering active state on restart.
* Fix powered down nodes getting stuck in COMPLETING+POWERED_DOWN when job is
cancelled before nodes are powering up.
-------------------------------------------------------------------
Wed Nov 17 08:33:13 UTC 2021 - Christian Goll <cgoll@suse.com>

View File

@ -18,7 +18,7 @@
# Check file META in sources: update so_version to (API_CURRENT - API_AGE)
%define so_version 37
%define ver 21.08.4
%define ver 21.08.5
%define _ver _21_08
%define dl_ver %{ver}
# so-version is 0 and seems to be stable
@ -1269,6 +1269,7 @@ exit 0
%{_mandir}/man5/slurm.*
%{_mandir}/man5/cgroup.*
%{_mandir}/man5/gres.*
%{_mandir}/man5/helpers.*
%{_mandir}/man5/nonstop.conf.5.*
%{_mandir}/man5/oci.conf.5.gz
%{_mandir}/man5/topology.*