Accepting request 1220076 from network:cluster
- Update to version 24.05.4 & fix for CVE-2024-48936. * Fix generic int sort functions. * Fix user look up using possible unrealized uid in the dbd. * `slurmrestd` - Fix regressions that allowed `slurmrestd` to be run as SlurmUser when `SlurmUser` was not root. * mpi/pmix fix race conditions with het jobs at step start/end which could make srun to hang. * Fix not showing some `SelectTypeParameters` in `scontrol show config`. * Avoid assert when dumping removed certain fields in JSON/YAML. * Improve how shards are scheduled with affinity in mind. * Fix `MaxJobsAccruePU` not being respected when `MaxJobsAccruePA` is set in the same QOS. * Prevent backfill from planning jobs that use overlapping resources for the same time slot if the job's time limit is less than `bf_resolution`. * Fix memory leak when requesting typed gres and `--[cpus|mem]-per-gpu`. * Prevent backfill from breaking out due to "system state changed" every 30 seconds if reservations use `REPLACE` or `REPLACE_DOWN` flags. * `slurmrestd` - Make sure that scheduler_unset parameter defaults to true even when the following flags are also set: `show_duplicates`, `skip_steps`, `disable_truncate_usage_time`, `run_away_jobs`, `whole_hetjob`, `disable_whole_hetjob`, `disable_wait_for_result`, `usage_time_as_submit_time`, `show_batch_script`, and or `show_job_environment`. Additionaly, always make sure show_duplicates and `disable_truncate_usage_time` default to true when the following flags are also set: `scheduler_unset`, `scheduled_on_submit`, (forwarded request 1220075 from eeich) OBS-URL: https://build.opensuse.org/request/show/1220076 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=108
This commit is contained in:
commit
17d576bce0
@ -1,3 +0,0 @@
|
|||||||
version https://git-lfs.github.com/spec/v1
|
|
||||||
oid sha256:b0b40513e9b6ae867ddb95d60b950bcb980c15b735b5d0dea37a9a00cc64ae24
|
|
||||||
size 7189600
|
|
3
slurm-24.05.4.tar.bz2
Normal file
3
slurm-24.05.4.tar.bz2
Normal file
@ -0,0 +1,3 @@
|
|||||||
|
version https://git-lfs.github.com/spec/v1
|
||||||
|
oid sha256:240a2105c8801bc0d222fa2bbcf46f71392ef94cce9253357e5f43f029adaf9b
|
||||||
|
size 7183430
|
@ -1,3 +1,91 @@
|
|||||||
|
-------------------------------------------------------------------
|
||||||
|
Fri Nov 1 12:50:27 UTC 2024 - Egbert Eich <eich@suse.com>
|
||||||
|
|
||||||
|
- Update to version 24.05.4 & fix for CVE-2024-48936.
|
||||||
|
* Fix generic int sort functions.
|
||||||
|
* Fix user look up using possible unrealized uid in the dbd.
|
||||||
|
* `slurmrestd` - Fix regressions that allowed `slurmrestd` to
|
||||||
|
be run as SlurmUser when `SlurmUser` was not root.
|
||||||
|
* mpi/pmix fix race conditions with het jobs at step start/end
|
||||||
|
which could make srun to hang.
|
||||||
|
* Fix not showing some `SelectTypeParameters` in `scontrol show
|
||||||
|
config`.
|
||||||
|
* Avoid assert when dumping removed certain fields in JSON/YAML.
|
||||||
|
* Improve how shards are scheduled with affinity in mind.
|
||||||
|
* Fix `MaxJobsAccruePU` not being respected when `MaxJobsAccruePA`
|
||||||
|
is set in the same QOS.
|
||||||
|
* Prevent backfill from planning jobs that use overlapping
|
||||||
|
resources for the same time slot if the job's time limit is
|
||||||
|
less than `bf_resolution`.
|
||||||
|
* Fix memory leak when requesting typed gres and
|
||||||
|
`--[cpus|mem]-per-gpu`.
|
||||||
|
* Prevent backfill from breaking out due to "system state
|
||||||
|
changed" every 30 seconds if reservations use `REPLACE` or
|
||||||
|
`REPLACE_DOWN` flags.
|
||||||
|
* `slurmrestd` - Make sure that scheduler_unset parameter defaults
|
||||||
|
to true even when the following flags are also set:
|
||||||
|
`show_duplicates`, `skip_steps`, `disable_truncate_usage_time`,
|
||||||
|
`run_away_jobs`, `whole_hetjob`, `disable_whole_hetjob`,
|
||||||
|
`disable_wait_for_result`, `usage_time_as_submit_time`,
|
||||||
|
`show_batch_script`, and or `show_job_environment`. Additionaly,
|
||||||
|
always make sure show_duplicates and
|
||||||
|
`disable_truncate_usage_time` default to true when the following
|
||||||
|
flags are also set: `scheduler_unset`, `scheduled_on_submit`,
|
||||||
|
`scheduled_by_main`, `scheduled_by_backfill`, and or `job_started`.
|
||||||
|
This effects the following endpoints:
|
||||||
|
`GET /slurmdb/v0.0.40/jobs`
|
||||||
|
`GET /slurmdb/v0.0.41/jobs`
|
||||||
|
* Ignore `--json` and `--yaml` options for `scontrol` show config
|
||||||
|
to prevent mixing output types.
|
||||||
|
* Fix not considering nodes in reservations with Maintenance or
|
||||||
|
Overlap flags when creating new reservations with `nodecnt` or
|
||||||
|
when they replace down nodes.
|
||||||
|
* Fix suspending/resuming steps running under a 23.02 `slurmstepd`
|
||||||
|
process.
|
||||||
|
* Fix options like `sprio --me` and `squeue --me` for users with
|
||||||
|
a uid greater than 2147483647.
|
||||||
|
* `fatal()` if `BlockSizes=0`. This value is invalid and would
|
||||||
|
otherwise cause the `slurmctld` to crash.
|
||||||
|
* `sacctmgr` - Fix issue where clearing out a preemption list using
|
||||||
|
`preempt=''` would cause the given qos to no longer be preempt-able
|
||||||
|
until set again.
|
||||||
|
* Fix `stepmgr` creating job steps concurrently.
|
||||||
|
* `data_parser/v0.0.40` - Avoid dumping "Infinity" for `NO_VAL` tagged
|
||||||
|
"number" fields.
|
||||||
|
* `data_parser/v0.0.41` - Avoid dumping "Infinity" for `NO_VAL` tagged
|
||||||
|
"number" fields.
|
||||||
|
* `slurmctld` - Fix a potential leak while updating a reservation.
|
||||||
|
* `slurmctld` - Fix state save with reservation flags when a update
|
||||||
|
fails.
|
||||||
|
* Fix reservation update issues with parameters Accounts and Users, when
|
||||||
|
using +/- signs.
|
||||||
|
* `slurmrestd` - Don't dump warning on empty wckeys in:
|
||||||
|
`GET /slurmdb/v0.0.40/config`
|
||||||
|
`GET /slurmdb/v0.0.41/config`
|
||||||
|
* Fix slurmd possibly leaving zombie processes on start up in configless
|
||||||
|
when the initial attempt to fetch the config fails.
|
||||||
|
* Fix crash when trying to drain a non-existing node (possibly deleted
|
||||||
|
before).
|
||||||
|
* `slurmctld` - fix segfault when calculating limit decay for jobs with
|
||||||
|
an invalid association.
|
||||||
|
* Fix IPMI energy gathering with multiple sensors.
|
||||||
|
* `data_parser/v0.0.39` - Remove xassert requiring errors and warnings
|
||||||
|
to have a source string.
|
||||||
|
* `slurmrestd` - Prevent potential segfault when there is an error
|
||||||
|
parsing an array field which could lead to a double xfree. This
|
||||||
|
applies to several endpoints in `data_parser` v0.0.39, v0.0.40 and
|
||||||
|
v0.0.41.
|
||||||
|
* `scancel` - Fix a regression from 23.11.6 where using both the
|
||||||
|
`--ctld` and `--sibling` options would cancel the federated job on
|
||||||
|
all clusters instead of only the cluster(s) specified by `--sibling`.
|
||||||
|
* `accounting_storage/mysql` - Fix bug when removing an association
|
||||||
|
specified with an empty partition.
|
||||||
|
* Fix setting multiple partition state restore on a job correctly.
|
||||||
|
* Fix difference in behavior when swapping partition order in job
|
||||||
|
submission.
|
||||||
|
* Fix security issue in stepmgr that could permit an attacker to
|
||||||
|
execute processes under other users' jobs. CVE-2024-48936.
|
||||||
|
|
||||||
-------------------------------------------------------------------
|
-------------------------------------------------------------------
|
||||||
Wed Oct 23 08:54:29 UTC 2024 - Egbert Eich <eich@suse.com>
|
Wed Oct 23 08:54:29 UTC 2024 - Egbert Eich <eich@suse.com>
|
||||||
|
|
||||||
|
@ -19,7 +19,7 @@
|
|||||||
# Check file META in sources: update so_version to (API_CURRENT - API_AGE)
|
# Check file META in sources: update so_version to (API_CURRENT - API_AGE)
|
||||||
%define so_version 41
|
%define so_version 41
|
||||||
# Make sure to update `upgrades` as well!
|
# Make sure to update `upgrades` as well!
|
||||||
%define ver 24.05.3
|
%define ver 24.05.4
|
||||||
%define _ver _24_05
|
%define _ver _24_05
|
||||||
%define dl_ver %{ver}
|
%define dl_ver %{ver}
|
||||||
# so-version is 0 and seems to be stable
|
# so-version is 0 and seems to be stable
|
||||||
|
Loading…
Reference in New Issue
Block a user