Accepting request 942081 from home:mslacken:branches:network:cluster
- update to 21.08.5 with following changes: * Fix issue where typeless GRES node updates were not immediately reflected. * Fix setting the default scrontab job working directory so that it's the home of the different user (*u <user>) and not that of root or SlurmUser editor. * Fix stepd not respecting SlurmdSyslogDebug. * Fix concurrency issue with squeue. * Fix job start time not being reset after launch when job is packed onto already booting node. * Fix updating SLURM_NODE_ALIASES for jobs packed onto powering up nodes. * Cray - Fix issues with starting hetjobs. * auth/jwks - Print fatal() message when jwks is configured but file could not be opened. * If sacctmgr has an association with an unknown qos as the default qos print 'UNKN*###' instead of leaving a blank name. * Correctly determine task count when giving --cpus-per-gpu, --gpus and *-ntasks-per-node without task count. * slurmctld - Fix places where the global last_job_update was not being set to the time of update when a job's reason and description were updated. * slurmctld - Fix case where a job submitted with more than one partition would not have its reason updated while waiting to start. * Fix memory leak in node feature rebooting. * Fix time limit permanetly set to 1 minute by backfill for job array tasks higher than the first with QOS NoReserve flag and PreemptMode configured. * Fix sacct -N to show jobs that started in the current second * Fix issue on running steps where both SLURM_NTASKS_PER_TRES and SLURM_NTASKS_PER_GPU are set. * Handle oversubscription request correctly when also requesting *-ntasks-per-tres. * Correctly detect when a step requests bad gres inside an allocation. * slurmstepd - Correct possible deadlock when UnkillableStepTimeout triggers. OBS-URL: https://build.opensuse.org/request/show/942081 OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=195
This commit is contained in:
parent
350be975f5
commit
d442993ff4
@ -1,3 +0,0 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:cf5256dfe08c565ddcca387a4d902a1643e125d427c8c2e0c07a32c950bc57aa
|
||||
size 6743524
|
3
slurm-21.08.5.tar.bz2
Normal file
3
slurm-21.08.5.tar.bz2
Normal file
@ -0,0 +1,3 @@
|
||||
version https://git-lfs.github.com/spec/v1
|
||||
oid sha256:fce78185c5c69b3a9143286df641725503be7aa4c1d5cec9161ec72905ed4f8a
|
||||
size 6741051
|
@ -1,3 +1,49 @@
|
||||
-------------------------------------------------------------------
|
||||
Wed Dec 22 09:24:28 UTC 2021 - Christian Goll <cgoll@suse.com>
|
||||
|
||||
- update to 21.08.5 with following changes:
|
||||
* Fix issue where typeless GRES node updates were not immediately reflected.
|
||||
* Fix setting the default scrontab job working directory so that it's the home
|
||||
of the different user (*u <user>) and not that of root or SlurmUser editor.
|
||||
* Fix stepd not respecting SlurmdSyslogDebug.
|
||||
* Fix concurrency issue with squeue.
|
||||
* Fix job start time not being reset after launch when job is packed onto
|
||||
already booting node.
|
||||
* Fix updating SLURM_NODE_ALIASES for jobs packed onto powering up nodes.
|
||||
* Cray - Fix issues with starting hetjobs.
|
||||
* auth/jwks - Print fatal() message when jwks is configured but file could
|
||||
not be opened.
|
||||
* If sacctmgr has an association with an unknown qos as the default qos
|
||||
print 'UNKN*###' instead of leaving a blank name.
|
||||
* Correctly determine task count when giving --cpus-per-gpu, --gpus and
|
||||
*-ntasks-per-node without task count.
|
||||
* slurmctld - Fix places where the global last_job_update was not being set
|
||||
to the time of update when a job's reason and description were updated.
|
||||
* slurmctld - Fix case where a job submitted with more than one partition
|
||||
would not have its reason updated while waiting to start.
|
||||
* Fix memory leak in node feature rebooting.
|
||||
* Fix time limit permanetly set to 1 minute by backfill for job array tasks
|
||||
higher than the first with QOS NoReserve flag and PreemptMode configured.
|
||||
* Fix sacct -N to show jobs that started in the current second
|
||||
* Fix issue on running steps where both SLURM_NTASKS_PER_TRES and
|
||||
SLURM_NTASKS_PER_GPU are set.
|
||||
* Handle oversubscription request correctly when also requesting
|
||||
*-ntasks-per-tres.
|
||||
* Correctly detect when a step requests bad gres inside an allocation.
|
||||
* slurmstepd - Correct possible deadlock when UnkillableStepTimeout triggers.
|
||||
* srun - use maximum number of open files while handling job I/O.
|
||||
* Fix writing to Xauthority files on root_squash NFS exports, which was
|
||||
preventing X11 forwarding from completing setup.
|
||||
* Fix regression in 21.08.0rc1 that broke --gres=none.
|
||||
* Fix srun --cpus-per-task and --threads-per-core not implicitly setting
|
||||
*-exact. It was meant to work this way in 21.08.
|
||||
* Fix regression in 21.08.0 that broke dynamic future nodes.
|
||||
* Fix dynamic future nodes remembering active state on restart.
|
||||
* Fix powered down nodes getting stuck in COMPLETING+POWERED_DOWN when job is
|
||||
cancelled before nodes are powering up.
|
||||
|
||||
|
||||
|
||||
-------------------------------------------------------------------
|
||||
Wed Nov 17 08:33:13 UTC 2021 - Christian Goll <cgoll@suse.com>
|
||||
|
||||
|
@ -18,7 +18,7 @@
|
||||
|
||||
# Check file META in sources: update so_version to (API_CURRENT - API_AGE)
|
||||
%define so_version 37
|
||||
%define ver 21.08.4
|
||||
%define ver 21.08.5
|
||||
%define _ver _21_08
|
||||
%define dl_ver %{ver}
|
||||
# so-version is 0 and seems to be stable
|
||||
@ -1269,6 +1269,7 @@ exit 0
|
||||
%{_mandir}/man5/slurm.*
|
||||
%{_mandir}/man5/cgroup.*
|
||||
%{_mandir}/man5/gres.*
|
||||
%{_mandir}/man5/helpers.*
|
||||
%{_mandir}/man5/nonstop.conf.5.*
|
||||
%{_mandir}/man5/oci.conf.5.gz
|
||||
%{_mandir}/man5/topology.*
|
||||
|
Loading…
Reference in New Issue
Block a user