Accepting request 1109308 from network:cluster

- Fixes since 23.02.03:
  Highlights:
  * Fix main scheduler loop not starting after a failover to backup controller.
  * Avoid slurmctld segfault when specifying `AccountingStorageExternalHost`
    (bsc#1214983).
  Other:
  * Fix sbatch return code when `--wait` is requested on a job array.
  * Fix collected `GPUUtilization` values for `acct_gather_profile` plugins.
  * Fix `slurmrestd` handling of job hold/release operations.
  * Make spank `S_JOB_ARGV` item value hold the requested command `argv`
    instead of the `srun --bcast` value when `--bcast` requested (only in local
    context).
  * Fix step running indefinitely when slurmctld takes more than
    `MessageTimeout` to respond. Now, slurmctld will cancel the step when
    detected, preventing following steps from getting stuck waiting for
    resources to be released.
  * Fix regression to make `job_desc.min_cpus` accurate again in job_submit when
    requesting a job with `--ntasks-per-node`.
  * Fix handling of `ArrayTaskThrottle` in backfill.
  * Fix regression in 23.02.2 when checking gres state on `slurmctld` startup or
    reconfigure. Gres changes in the configuration were not updated on slurmctld
    startup. On startup or reconfigure, these messages were present in the log:
    `"error: Attempt to change gres/gpu Count`".
  * Fix potential double count of gres when dealing with limits.
  * Fix slurmstepd segfault when ContainerPath is not set in `oci.conf`
  * Fixed an issue where jobs requesting licenses were incorrectly rejected.
  * `scrontab` - Fix cutting off the final character of quoted variables.
  * `smail` - Fix issues where e-mails at job completion were not being sent.
  * `scontrol/slurmctld` - fix comma parsing when updating a reservation's
    nodes.

OBS-URL: https://build.opensuse.org/request/show/1109308
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=92
This commit is contained in:
Ana Guerrero 2023-09-07 19:12:41 +00:00 committed by Git OBS Bridge
commit 6b47182efe

View File

@ -1,30 +1,174 @@
------------------------------------------------------------------- -------------------------------------------------------------------
Mon Aug 21 09:43:08 UTC 2023 - Christian Goll <cgoll@suse.com> Mon Aug 21 09:43:08 UTC 2023 - Christian Goll <cgoll@suse.com>
- updated to 23.02.04 which includes following changes: - Fixes since 23.02.03:
* fixing the main scheduler loop not starting on the backup controller after Highlights:
a failover event, a segfault when attempting to use * Fix main scheduler loop not starting after a failover to backup controller.
* AccountingStorageExternalHost, and an issue where steps could continue * Avoid slurmctld segfault when specifying `AccountingStorageExternalHost`
running indefinitely if the slurmctld takes too long to respond (bsc#1214983) (bsc#1214983).
* include a fix for a potential slurmctld crashes when the backup slurmctld Other:
takes over. * Fix sbatch return code when `--wait` is requested on a job array.
* This also fixes some issues when using older versions of the command line * Fix collected `GPUUtilization` values for `acct_gather_profile` plugins.
tools with a 23.02 controller. * Fix `slurmrestd` handling of job hold/release operations.
* srun/sbatch/salloc - In order to support user namespaces, process user and * Make spank `S_JOB_ARGV` item value hold the requested command `argv`
group ids are no longer used unless explicitly requested as an argument and instead of the `srun --bcast` value when `--bcast` requested (only in local
are left as nobody(99) by default. Any cli_filters or SPANK plugins need to context).
ignore any uid or gid that equal SLURM_AUTH_NOBODY (99). User and group ids * Fix step running indefinitely when slurmctld takes more than
are now resolved by the active auth plugin. To determine the actual job uid `MessageTimeout` to respond. Now, slurmctld will cancel the step when
or gid you should use the RESPONSE_RESOURCE_ALLOCATION RPC. detected, preventing following steps from getting stuck waiting for
- removed Fix-test-3.13.patch as fixed upstream resources to be released.
- removed Fix-test-38.11.patch as test changed upstream * Fix regression to make `job_desc.min_cpus` accurate again in job_submit when
requesting a job with `--ntasks-per-node`.
* Fix handling of `ArrayTaskThrottle` in backfill.
* Fix regression in 23.02.2 when checking gres state on `slurmctld` startup or
reconfigure. Gres changes in the configuration were not updated on slurmctld
startup. On startup or reconfigure, these messages were present in the log:
`"error: Attempt to change gres/gpu Count`".
* Fix potential double count of gres when dealing with limits.
* Fix slurmstepd segfault when ContainerPath is not set in `oci.conf`
* Fixed an issue where jobs requesting licenses were incorrectly rejected.
* `scrontab` - Fix cutting off the final character of quoted variables.
* `smail` - Fix issues where e-mails at job completion were not being sent.
* `scontrol/slurmctld` - fix comma parsing when updating a reservation's
nodes.
* Fix `--gpu-bind=single binding` tasks to wrong gpus, leading to some gpus
having more tasks than they should and other gpus being unused.
* Fix regression in 23.02 that causes slurmstepd to crash when srun requests
more than `TreeWidth` nodes in a step and uses the pmi2 or pmix plugin.
* `job_container/tmpfs` - Fix `%h` and `%n` substitution in `BasePath` where
`%h` was substituted as the NodeName instead of the hostname, and %n was
substituted as an empty string.
* Fix regression where `--cpu-bind=verbose` would override `TaskPluginParam`.
* `scancel` - Fix `--clusters/-M` for federations. Only filtered jobs (e.g.
`-A`, `-u`, `-p`, etc.) from the specified clusters will be canceled,
rather than all jobs in the federation. Specific jobids will still be
routed to the origin cluster for cancellation.
- Fixes since 23.02.02
Highlight:
* `slurmctld` - Fix backup slurmctld crash when it takes control multiple
times.
Other:
* Fix regression in 23.02.2 that ignored the partition `DefCpuPerGPU` setting
on the first pass of scheduling a job requesting `--gpus --ntasks`.
* `srun` - fix issue creating regular and interactive steps because
*_PACK_GROUP* environment variables were incorrectly set on non-HetSteps.
* Fix dynamic nodes getting stuck in allocated states when reconfiguring.
* Fix regression in 23.02.2 that set the `SLURM_NTASKS` environment variable
in sbatch jobs from `--ntasks-per-node` when `--ntasks` was not requested.
* Fix regression in 23.02 that caused sbatch jobs to set the wrong number
of tasks when requesting `--ntasks-per-node` without `--ntasks`, and also
requesting one of the following options: `--sockets-per-node`,
--cores-per-socket, --threads-per-core (or `--hint=nomultithread`), or
`-B,--extra-node-info`.
* Fix double counting suspended job counts on nodes when reconfiguring, which
prevented nodes with suspended jobs from being powered down or rebooted
once the jobs completed.
* Fix backfill not scheduling jobs submitted with `--prefer` and
`--constraint` properly.
* mpi/pmix - fix regression introduced in 23.02.2 which caused PMIx shmem
backed files permissions to be incorrect.
* api/submit - fix memory leaks when submission of batch regular jobs or batch
HetJobs fails (response data is a return code).
* Fix regression in 23.02 leading to error() messages being sent at `INFO`
instead of `ERR` in syslog.
* Fix `TresUsageIn[Tot|Ave]` calculation for `gres/gpumem` and `gres/gpuutil`.
* Fix issue in the gpu plugins where gpu frequencies would only be set if both
gpu memory and gpu frequencies were set, while one or the other suffices.
* Fix reservations group ACL's not working with the root group.
* Fix updating a job with a ReqNodeList greater than the job's node count.
* Fix inadvertent permission denied error for `--task-prolog` and
`--task-epilog` with filesystems mounted with `root_squash`.
* Fix missing detailed cpu and gres information in json/yaml output from
`scontrol`, `squeue` and `sinfo`.
* Fix regression in 23.02 that causes a failure to allocate job steps that
request `--cpus-per-gpu` and gpus with types.
* Fix potentially waiting indefinitely for a defunct process to finish,
which affects various scripts including `Prolog` and `Epilog`. This could
have various symptoms, such as jobs getting stuck in a completing state.
* Fix losing list of reservations on job when updating job with list of
reservations and restarting the controller.
* Fix nodes resuming after down and drain state update requests from
clients older than 23.02.
* Fix advanced reservation creation/update when an association that should
have access to it is composed with partition(s).
* Fix job layout calculations with `--ntasks-per-gpu`, especially when
`--nodes` has not been explicitly provided.
* Fix X11 forwarding for jobs submitted from the slurmctld host.
* When a job requests `--no-kill` and one or more nodes fail during the job,
fix subsequent job steps unable to use some of the remaining resources
allocated to the job.
* Fix shared gres allocation when using `--tres-per-task` with tasks that span
multiple sockets.
- Other changes
(since 23.02.3):
* `scontrol` - Permit changes to StdErr and StdIn for pending jobs.
* `scontrol` - Reset std{err,in,out} when set to empty string.
* `slurmrestd` - mark environment as a required field for job submission
descriptions.
* `slurmrestd` - avoid dumping null in OpenAPI schema required fields.
* `data_parser/v0.0.39` - avoid rejecting valid memory_per_node formatted as
dictionary provided with a job description.
* `data_parser/v0.0.39` - avoid rejecting valid memory_per_cpu formatted as
dictionary provided with a job description.
* `slurmrestd` - Return HTTP error code 404 when job query fails.
* `slurmrestd` - Add return schema to error response to job and license query.
* Change the log message warning for rate limited users from debug to verbose.
* `cgroup/v2` - Avoid capturing log output for ebpf when constraining devices,
as this can lead to inadvertent failure if the log buffer is too small.
* Added error message when attempting to use sattach on batch or extern steps.
* Reject job ArrayTaskThrottle update requests from unprivileged users.
* `data_parser/v0.0.39` - populate description fields of property objects in
generated OpenAPI specifications where defined.
* `slurmstepd` - Avoid segfault caused by ContainerPath not being terminated
by '/' in oci.conf.
* `data_parser/v0.0.39` - Change `v0.0.39_job_info` response to tag `exit_code`
field as being complex instead of only an unsigned integer.
(since 23.02.2):
* `openapi/dbv0.0.39/users` - If a default account update failed, resulting
in a no-op, the query returned success without any warning. Now a warning
is sent back to the client that the default account wasn't modified.
* Avoid job write lock when nodes are dynamically added/removed.
* burst_buffer/lua - allow jobs to get scheduled sooner after
`slurm_bb_data_in` completes.
* `openapi/v0.0.39` - fix memory leak in `_job_post_het_submit()`.
* Avoid possible `slurmctld` segfault caused by race condition with already
completed `slurmdbd_conn` connections.
* `Slurmdbd.conf` checks included conf files for 0600 permissions
* `slurmrestd` - fix regression "oversubscribe" fields were removed from job
descriptions and submissions from v0.0.39 end points.
* `accounting_storage/mysql` - Query for indiviual QOS correctly when you have
more than 10.
* Add warning message about ignoring `--tres-per-tasks=license` when used
on a step.
* `sshare` - Fix command to work when using priority/basic.
* Avoid loading `cli_filter` plugins outside of `salloc`/`sbatch`/`scron`/
`srun`. This fixes a number of missing symbol problems that can manifest
for executables linked against libslurm (and not `libslurmfull`).
* Allow cloud_reg_addrs to update dynamically registered node's addrs on
subsequent registrations.
* Revert a change in 22.05.5 that prevented tasks from sharing a core if
`--cpus-per-task` > threads per core, but caused incorrect accounting and
cpu.
binding. Instead, `--ntasks-per-core=1` may be requested to prevent tasks
from sharing a core.
* Correctly send `assoc_mgr` lock to mcs plugin.
* Avoid unnecessary gres/gpumem and gres/gpuutil TRES position lookups.
* `sacct` - when printing PLANNED time, use end time instead of start time for
jobs cancelled before they started.
* Hold the job with "(Reservation ... invalid)" state reason if the
reservation is not usable by the job.
* `auth/jwt` - Fix memory leak.
* `sbatch` - Added new `--export=NIL` option.
- Removed:
* Fix-test-3.13.patch
* Fix-test-38.11.patch as both tests changed upstream
------------------------------------------------------------------- -------------------------------------------------------------------
Tue May 9 09:28:23 UTC 2023 - Christian Goll <cgoll@suse.com> Tue May 9 09:28:23 UTC 2023 - Christian Goll <cgoll@suse.com>
- updated to 23.02.02 which includes a number of fixes to Slurm stability - updated to 23.02.02 which includes a number of fixes to Slurm stability
* Includes a fix for a regression in 23.02 that caused openmpi mpirun to fail * Includes a fix for a regression in 23.02 that caused openmpi mpirun to fail
to launch tasks. to launch tasks.
* It also includes two functional changes: Don't update the cron job tasks if * It also includes two functional changes: Don't update the cron job tasks if
the whole crontab file is left untouched after opening it with scrontab -e the whole crontab file is left untouched after opening it with scrontab -e
* Sort dynamic nodes and include them in topology after scontrol reconfigure * Sort dynamic nodes and include them in topology after scontrol reconfigure