Accepting request 923425 from home:mslacken:branches:network:cluster
- update to 21.08.2 - major change: * removed of support of the TaskAffinity=yes option in cgroup.conf. Please consider using "TaskPlugins=cgroup,affinity" in slurm.conf as an option. - minor changes and bugfixes: * slurmctld - fix how the max number of cores on a node in a partition are calculated when the partition contains multi*socket nodes. This in turn corrects certain jobs node count estimations displayed client*side. * job_submit/cray_aries - fix "craynetwork" GRES specification after changes introduced in 21.08.0rc1 that made TRES always have a type prefix. * Ignore nonsensical check in the slurmd for [Pro|Epi]logSlurmctld. * Fix writing to stderr/syslog when systemd runs slurmctld in the foreground. * Fix issue with updating job started with node range. * Fix issue with nodes not clearing state in the database when the slurmctld is started with clean*start. * Fix hetjob components > 1 timing out due to InactiveLimit. * Fix sprio printing -nan for normalized association priority if PriorityWeightAssoc was not defined. * Disallow FirstJobId=0. * Preserve job start info in the database for a requeued job that hadn't registered the first time in the database yet. * Only send one message on prolog failure from the slurmd. * Remove support for TaskAffinity=yes in cgroup.conf. * accounting_storage/mysql - fix issue where querying jobs via sacct *-whole-hetjob=yes or slurmrestd (which automatically includes this flag) could in some cases return more records than expected. * Fix issue for preemption of job array task that makes afterok dependency fail. Additionally, send emails when requeueing happens due to preemption. * Fix sending requeue mail type. * Properly resize a job's GRES bitmaps and counts when resizing the job. OBS-URL: https://build.opensuse.org/request/show/923425 OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=190
This commit is contained in:
parent
64b9f7f60a
commit
7a20fda376
@ -1,3 +0,0 @@
|
|||||||
version https://git-lfs.github.com/spec/v1
|
|
||||||
oid sha256:5b648864036477f5c188c750b9bdaefaa429b04c36912d4db2a86ced2a8e0c04
|
|
||||||
size 6736996
|
|
3
slurm-21.08.2.tar.bz2
Normal file
3
slurm-21.08.2.tar.bz2
Normal file
@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:6cfea3ae89021dd5986109ef0bda5ad1418f88b61a446631bea576fd6c3399f3
|
||||||
|
size 6739808
|
@ -1,3 +1,47 @@
|
|||||||
|
-------------------------------------------------------------------
|
||||||
|
Wed Oct 6 07:12:52 UTC 2021 - Christian Goll <cgoll@suse.com>
|
||||||
|
|
||||||
|
- update to 21.08.2
|
||||||
|
- major change:
|
||||||
|
* removed of support of the TaskAffinity=yes option in cgroup.conf. Please
|
||||||
|
consider using "TaskPlugins=cgroup,affinity" in slurm.conf as an option.
|
||||||
|
- minor changes and bugfixes:
|
||||||
|
* slurmctld - fix how the max number of cores on a node in a partition are
|
||||||
|
calculated when the partition contains multi*socket nodes. This in turn
|
||||||
|
corrects certain jobs node count estimations displayed client*side.
|
||||||
|
* job_submit/cray_aries - fix "craynetwork" GRES specification after changes
|
||||||
|
introduced in 21.08.0rc1 that made TRES always have a type prefix.
|
||||||
|
* Ignore nonsensical check in the slurmd for [Pro|Epi]logSlurmctld.
|
||||||
|
* Fix writing to stderr/syslog when systemd runs slurmctld in the foreground.
|
||||||
|
* Fix issue with updating job started with node range.
|
||||||
|
* Fix issue with nodes not clearing state in the database when the slurmctld
|
||||||
|
is started with clean*start.
|
||||||
|
* Fix hetjob components > 1 timing out due to InactiveLimit.
|
||||||
|
* Fix sprio printing -nan for normalized association priority if
|
||||||
|
PriorityWeightAssoc was not defined.
|
||||||
|
* Disallow FirstJobId=0.
|
||||||
|
* Preserve job start info in the database for a requeued job that hadn't
|
||||||
|
registered the first time in the database yet.
|
||||||
|
* Only send one message on prolog failure from the slurmd.
|
||||||
|
* Remove support for TaskAffinity=yes in cgroup.conf.
|
||||||
|
* accounting_storage/mysql - fix issue where querying jobs via sacct
|
||||||
|
*-whole-hetjob=yes or slurmrestd (which automatically includes this flag)
|
||||||
|
could in some cases return more records than expected.
|
||||||
|
* Fix issue for preemption of job array task that makes afterok dependency
|
||||||
|
fail. Additionally, send emails when requeueing happens due to preemption.
|
||||||
|
* Fix sending requeue mail type.
|
||||||
|
* Properly resize a job's GRES bitmaps and counts when resizing the job.
|
||||||
|
* Fix node being able to transition to CLOUD state from non-cloud state.
|
||||||
|
* Fix regression introduced in 21.08.0rc1 which broke a step's ability to
|
||||||
|
inherit GRES from the job when the step didn't request GRES but the job did.
|
||||||
|
* Fix errors in logic when picking nodes based on bracketed anded constraints.
|
||||||
|
This also enforces the requirement to have a count when using such
|
||||||
|
constraints.
|
||||||
|
* Handle job resize better in the database.
|
||||||
|
* Exclude currently running, resized jobs from the runaway jobs list.
|
||||||
|
* Make it possible to shrink a job more than once.
|
||||||
|
|
||||||
|
|
||||||
-------------------------------------------------------------------
|
-------------------------------------------------------------------
|
||||||
Tue Sep 28 15:53:38 UTC 2021 - Christian Goll <cgoll@suse.com>
|
Tue Sep 28 15:53:38 UTC 2021 - Christian Goll <cgoll@suse.com>
|
||||||
|
|
||||||
|
@ -18,7 +18,7 @@
|
|||||||
|
|
||||||
# Check file META in sources: update so_version to (API_CURRENT - API_AGE)
|
# Check file META in sources: update so_version to (API_CURRENT - API_AGE)
|
||||||
%define so_version 37
|
%define so_version 37
|
||||||
%define ver 21.08.1
|
%define ver 21.08.2
|
||||||
%define _ver _21_08
|
%define _ver _21_08
|
||||||
%define dl_ver %{ver}
|
%define dl_ver %{ver}
|
||||||
# so-version is 0 and seems to be stable
|
# so-version is 0 and seems to be stable
|
||||||
|
Loading…
Reference in New Issue
Block a user