From b61c5b25fa7eadf00aee186f6538681826ecdd555590dba8b79d88929552b936 Mon Sep 17 00:00:00 2001 From: Christian Goll Date: Fri, 2 Jul 2021 15:32:26 +0000 Subject: [PATCH] Accepting request 903744 from home:mslacken:slurm_update - Updated to 20.11.8: * slurmctld - fix erroneous "StepId=CORRUPT" messages in error logs. * Correct the error given when auth plugin fails to pack a credential. * Fix unused-variable compiler warning on FreeBSD in fd_resolve_path(). * acct_gather_filesystem/lustre - only emit collection error once per step. * Add GRES environment variables (e.g., CUDA_VISIBLE_DEVICES) into the interactive step, the same as is done for the batch step. * Fix various potential deadlocks when altering objects in the database dealing with every cluster in the database. * slurmrestd: - handle slurmdbd connection failures without segfaulting. - fix segfault for searches in slurmdb/v0.0.36/jobs. - remove (non-functioning) users query parameter for slurmdb/v0.0.36/jobs from openapi.json - fix segfault in slurmrestd db/jobs with numeric queries - add argv handling for job/submit endpoint. - add description for slurmdb/job endpoint. * slurmrestd/dbv0.0.36: - Fix values dumped in job state/current and job step state. - Correct description for previous state property. * srun: - fix broken node step allocation in a heterogeneous allocation. - leave SLURM_DIST_UNKNOWN as default for --interactive. * Fail step creation if -n is not multiple of --ntasks-per-gpu. * job_container/tmpfs - Fix slowdown on teardown. * Fix problem with SlurmctldProlog where requeued jobs would never launch. * job_container/tmpfs - Fix issue when restarting slurmd where the namespace mount points could disappear. * sacct: OBS-URL: https://build.opensuse.org/request/show/903744 OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=183 --- slurm-20.11.7.tar.bz2 | 3 --- slurm-20.11.8.tar.bz2 | 3 +++ slurm.changes | 55 +++++++++++++++++++++++++++++++++++++++++++ slurm.spec | 2 +- 4 files changed, 59 insertions(+), 4 deletions(-) delete mode 100644 slurm-20.11.7.tar.bz2 create mode 100644 slurm-20.11.8.tar.bz2 diff --git a/slurm-20.11.7.tar.bz2 b/slurm-20.11.7.tar.bz2 deleted file mode 100644 index b9c6845..0000000 --- a/slurm-20.11.7.tar.bz2 +++ /dev/null @@ -1,3 +0,0 @@ -version https://git-lfs.github.com/spec/v1 -oid sha256:7ad9c41ab103c30c67672dcc3f18da209b578922614b7f0e2d7a9e1e1e8cb2b9 -size 6628714 diff --git a/slurm-20.11.8.tar.bz2 b/slurm-20.11.8.tar.bz2 new file mode 100644 index 0000000..8f357b7 --- /dev/null +++ b/slurm-20.11.8.tar.bz2 @@ -0,0 +1,3 @@ +version https://git-lfs.github.com/spec/v1 +oid sha256:a083cee4803c060f3c2943db74ae161d70c19e2bf67970029512f1fc476ddadc +size 6630041 diff --git a/slurm.changes b/slurm.changes index d684d74..27a42ed 100644 --- a/slurm.changes +++ b/slurm.changes @@ -1,3 +1,58 @@ +------------------------------------------------------------------- +Fri Jul 2 08:01:32 UTC 2021 - Christian Goll + +- Updated to 20.11.8: + * slurmctld - fix erroneous "StepId=CORRUPT" messages in error logs. + * Correct the error given when auth plugin fails to pack a credential. + * Fix unused-variable compiler warning on FreeBSD in fd_resolve_path(). + * acct_gather_filesystem/lustre - only emit collection error once per step. + * Add GRES environment variables (e.g., CUDA_VISIBLE_DEVICES) into the + interactive step, the same as is done for the batch step. + * Fix various potential deadlocks when altering objects in the database + dealing with every cluster in the database. + * slurmrestd: + - handle slurmdbd connection failures without segfaulting. + - fix segfault for searches in slurmdb/v0.0.36/jobs. + - remove (non-functioning) users query parameter for + slurmdb/v0.0.36/jobs from openapi.json + - fix segfault in slurmrestd db/jobs with numeric queries + - add argv handling for job/submit endpoint. + - add description for slurmdb/job endpoint. + * slurmrestd/dbv0.0.36: + - Fix values dumped in job state/current and + job step state. + - Correct description for previous state property. + * srun: + - fix broken node step allocation in a heterogeneous allocation. + - leave SLURM_DIST_UNKNOWN as default for --interactive. + * Fail step creation if -n is not multiple of --ntasks-per-gpu. + * job_container/tmpfs - Fix slowdown on teardown. + * Fix problem with SlurmctldProlog where requeued jobs would never launch. + * job_container/tmpfs - Fix issue when restarting slurmd where the namespace + mount points could disappear. + * sacct: + - avoid truncating JobId at 34 characters. + - fix segfault when printing StepId (or when using --long). + * scancel - fix segfault when --wckey filtering option is used. + * select/cons_tres - Fix memory leak. + * Prevent file descriptor leak in job_container/tmpfs on slurmd restart. + * perlapi/libslurmdb - expose tres_req_str to job hash. + * scrontab - close and reopen temporary crontab file to deal with editors + that do not change the original file, but instead write out then rename + a new file. + * sstat - fix linking so that it will work when --without-shared-libslurm + was used to build Slurm. + * Clear allocated cpus for running steps in a job before handling requested + nodes on new step. + * Don't reject a step if not enough nodes are available. Instead, defer the + step until enough nodes are available to satisfy the request. + * Don't reject a step if it requests at least one specific node that is + already allocated to another step. Instead, defer the step until the + requested node(s) become available. + * Better handling of --mem=0. + * Ignore DefCpuPerGpu when --cpus-per-task given. + + ------------------------------------------------------------------- Fri May 14 10:07:04 UTC 2021 - Christian Goll diff --git a/slurm.spec b/slurm.spec index 3e00490..810b5b0 100644 --- a/slurm.spec +++ b/slurm.spec @@ -18,7 +18,7 @@ # Check file META in sources: update so_version to (API_CURRENT - API_AGE) %define so_version 36 -%define ver 20.11.7 +%define ver 20.11.8 %define _ver _20_11 %define dl_ver %{ver} # so-version is 0 and seems to be stable