SHA256
1
0
forked from pool/slurm
Commit Graph

302 Commits

Author SHA256 Message Date
521f372d87 Accepting request 1072084 from home:mslacken:branches:network:cluster
- slurm-plugins need to require pmix-pluginlib (bsc#1209260)

OBS-URL: https://build.opensuse.org/request/show/1072084
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=243
2023-03-15 10:57:09 +00:00
Dominique Leuenberger
c224ea00c3 Accepting request 1070214 from network:cluster
- Fixing dependencies for slurm--plugin-ext-sensors-rrd again. (forwarded request 1070212 from eeich)

OBS-URL: https://build.opensuse.org/request/show/1070214
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=85
2023-03-09 16:45:23 +00:00
e85b508441 Accepting request 1070212 from home:eeich:branches:network:cluster
- Fixing dependencies for slurm--plugin-ext-sensors-rrd again.

OBS-URL: https://build.opensuse.org/request/show/1070212
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=241
2023-03-08 15:43:28 +00:00
86940cb8c4 Accepting request 1070094 from home:eeich:branches:network:cluster
- Fix conflicts for plugin-ext-sensors-rrd

OBS-URL: https://build.opensuse.org/request/show/1070094
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=240
2023-03-08 07:58:58 +00:00
0f04c66747 Accepting request 1070043 from home:eeich:branches:network:cluster
- Fixup previous submission.

OBS-URL: https://build.opensuse.org/request/show/1070043
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=239
2023-03-07 22:14:15 +00:00
da464bfaae Accepting request 1070038 from home:eeich:branches:network:cluster
- Stop pulling firewall rules from github. There is no benefit to
  host these separately.
- Remove pre-sle12 pieces.

- Add missing Provides:, Conflicts: and Obsoletes: to slurm-cray,
  slurm-hdf5 and slurm-testsuite to avoid package conflicts.
- Unify Obsoletes:.
- Consolidate spec files between different Slurm releases in
  Leap/SLE maintenance.

OBS-URL: https://build.opensuse.org/request/show/1070038
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=238
2023-03-07 21:33:03 +00:00
Dominique Leuenberger
50b2b76a05 Accepting request 1068523 from network:cluster
- Add missing Provides: and Obsoletes: to slurm-cray, slurm-hdf5
  and slurm-testsuite to avoid package conflicts.
- Add dependency for the general plugin package to the
  AcctGatherProfile HDF5 plugin.
- Adjust node RealMemory in slurm.conf of test suite for 8G test
  nodes. (forwarded request 1068522 from eeich)

OBS-URL: https://build.opensuse.org/request/show/1068523
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=84
2023-03-02 22:03:34 +00:00
6997bacde0 Accepting request 1068522 from home:eeich:branches:network:cluster
- Add missing Provides: and Obsoletes: to slurm-cray, slurm-hdf5
  and slurm-testsuite to avoid package conflicts.
- Add dependency for the general plugin package to the
  AcctGatherProfile HDF5 plugin.
- Adjust node RealMemory in slurm.conf of test suite for 8G test
  nodes.

OBS-URL: https://build.opensuse.org/request/show/1068522
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=236
2023-03-01 17:58:54 +00:00
Dominique Leuenberger
8a8f7dcb78 Accepting request 1068320 from network:cluster
- updated to 23.02.0
  * Highlights
    + slurmctld - Add new RPC rate limiting feature. This is enabled through
      SlurmctldParameters=rl_enable, otherwise disabled by default.
    + Make scontrol reconfigure and sending a SIGHUP to the slurmctld behave
      the same. If you were using SIGHUP as a 'lighter' scontrol reconfigure
      to rotate logs please update your scripts to use SIGUSR2 instead.
    + Change cloud nodes to show by default. PrivateData=cloud is no longer
      needed.
    + sreport - Count planned (FKA reserved) time for jobs running in
      IGNORE_JOBS reservations. Previously was lumped into IDLE time.
    + job_container/tmpfs - Support running with an arbitrary list of private
      mount points (/tmp and /dev/shm are the default, but not required).
    + job_container/tmpfs - Set more environment variables in InitScript.
    + Make all cgroup directories created by Slurm owned by root. This was the
      behavior in cgroup/v2 but not in cgroup/v1 where by default the step
      directories ownership were set to the user and group of the job.
    + accounting_storage/mysql - change purge/archive to calculate record ages
      based on end time, rather than start or submission times.
    + job_submit/lua - add support for log_user() from slurm_job_modify().
    + Run the following scripts in slurmscriptd instead of slurmctld:
      ResumeProgram, ResumeFailProgram, SuspendProgram, ResvProlog, ResvEpilog,
      and RebootProgram (only with SlurmctldParameters=reboot_from_controller).
    + Only permit changing log levels with 'srun --slurmd-debug' by root
      or SlurmUser.
    + slurmctld will fatal() when reconfiguring the job_submit plugin fails.
    + Add PowerDownOnIdle partition option to power down nodes after nodes
      become idle.
    + Add "[jobid.stepid]" prefix from slurmstepd and "slurmscriptd" prefix
      from slurmcriptd to Syslog logging. Previously was only happening when

OBS-URL: https://build.opensuse.org/request/show/1068320
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=83
2023-03-01 15:14:17 +00:00
e60f39a466 - updated to 23.02.0
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=234
2023-02-28 20:50:48 +00:00
8899aac00b - testsuite: on later SUSE versions claim ownership of directory
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=233
2023-02-28 20:34:03 +00:00
18aa012ab9 Accepting request 1068316 from home:eeich:branches:network:cluster
+ Fixed GpuFreqDef option. When set in slurm.conf, it will be used if
      --gpu-freq was not explicitly set by the job step.
    + topology/tree - Add new TopologyParam=SwitchAsNodeRank option to reorder
      nodes based on switch layout. This can be useful if the naming convention
      for the nodes does not natually map to the network topology.
    + Removed the default setting for GpuFreqDef. If unset, no attempt to change
      the GPU frequency will be made if --gpu-freq is not set for the step.

OBS-URL: https://build.opensuse.org/request/show/1068316
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=232
2023-02-28 20:30:32 +00:00
ef6d6521aa Accepting request 1067475 from home:eeich:branches:network:cluster
- updated to 23.02.0-0rc1
  * Highlights
    + slurmctld - Add new RPC rate limiting feature. This is enabled through
      SlurmctldParameters=rl_enable, otherwise disabled by default.
    + Make scontrol reconfigure and sending a SIGHUP to the slurmctld behave
      the same. If you were using SIGHUP as a 'lighter' scontrol reconfigure
      to rotate logs please update your scripts to use SIGUSR2 instead.
    + Change cloud nodes to show by default. PrivateData=cloud is no longer
      needed.
    + sreport - Count planned (FKA reserved) time for jobs running in
      IGNORE_JOBS reservations. Previously was lumped into IDLE time.
    + job_container/tmpfs - Support running with an arbitrary list of private
      mount points (/tmp and /dev/shm are the default, but not required).
    + job_container/tmpfs - Set more environment variables in InitScript.
    + Make all cgroup directories created by Slurm owned by root. This was the
      behavior in cgroup/v2 but not in cgroup/v1 where by default the step
      directories ownership were set to the user and group of the job.
    + accounting_storage/mysql - change purge/archive to calculate record ages
      based on end time, rather than start or submission times.
    + job_submit/lua - add support for log_user() from slurm_job_modify().
    + Run the following scripts in slurmscriptd instead of slurmctld:
      ResumeProgram, ResumeFailProgram, SuspendProgram, ResvProlog, ResvEpilog,
      and RebootProgram (only with SlurmctldParameters=reboot_from_controller).
    + Only permit changing log levels with 'srun --slurmd-debug' by root
      or SlurmUser.
    + slurmctld will fatal() when reconfiguring the job_submit plugin fails.
    + Add PowerDownOnIdle partition option to power down nodes after nodes
      become idle.
    + Add "[jobid.stepid]" prefix from slurmstepd and "slurmscriptd" prefix
      from slurmcriptd to Syslog logging. Previously was only happening when

OBS-URL: https://build.opensuse.org/request/show/1067475
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=231
2023-02-23 19:32:51 +00:00
Dominique Leuenberger
d1ebf00ba6 Accepting request 1063957 from network:cluster
- testsuite: on laster SUSE versions claim ownership of directory
  /etc/security/limits.d. (forwarded request 1063954 from eeich)

OBS-URL: https://build.opensuse.org/request/show/1063957
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=82
2023-02-09 15:23:26 +00:00
4693e39860 Accepting request 1063954 from home:eeich:branches:network:cluster
- testsuite: on laster SUSE versions claim ownership of directory
  /etc/security/limits.d.

OBS-URL: https://build.opensuse.org/request/show/1063954
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=229
2023-02-09 08:22:55 +00:00
Dominique Leuenberger
a4484c7dc2 Accepting request 1042071 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/1042071
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=81
2022-12-11 16:16:58 +00:00
6f080824a4 Accepting request 1039957 from home:eeich:branches:network:cluster
- Move the ext_sensors/rrd plugin to a separate package: this
  plugin requires librrd which in turn requires huge parts of
  the client side X Window System stack.
  There is probably no use in cluttering up a system for a
  plugin that probably only used by a few.

OBS-URL: https://build.opensuse.org/request/show/1039957
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=227
2022-12-11 07:58:12 +00:00
Dominique Leuenberger
30dd030610 Accepting request 1031255 from network:cluster
- Test Suite fixes:
  * Update README_Testsuite.md.
  * Clean up left over files when de-installing test suite.
  * Adjustment to test suite package: for SLE mark the openmpi4
    devel package and slurm-hdf5 optional.
  * Add -ffat-lto-objects to the build flags when LTO is set to
    make sure the object files we ship with the test suite still
    work correctly.
  * Improve setup-testsuite.sh: copy ssh fingerprints from all nodes.

- set environment variable SUSE_ZNOW to 0 in %build to avoid module load
  failures due to unresolved symbols as module take advantage of lazy
  bindings (bsc#1200030).

OBS-URL: https://build.opensuse.org/request/show/1031255
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=80
2022-10-26 10:32:00 +00:00
212048404b * Improve setup-testsuite.sh: copy ssh fingerprints from all nodes.
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=225
2022-10-26 06:23:36 +00:00
776ce8f23b - Test Suite fixes:
* Update README_Testsuite.md.
  * Clean up left over files when de-installing test suite.
  * Adjustment to test suite package: for SLE mark the openmpi4
    devel package and slurm-hdf5 optional.
  * Add -ffat-lto-objects to the build flags when LTO is set to
    make sure the object files we ship with the test suite still
    work correctly.

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=224
2022-10-25 11:33:49 +00:00
642a47efa7 - Adjustment to test suite package: only recommend openmpi4
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=223
2022-10-24 08:54:35 +00:00
52046053d5 Accepting request 1030610 from home:eeich:branches:network:cluster
- Update README_Testsuite.md.
- Make hdf5 package optional for test suite.
- Clean up left over files when de-installing test suite.

- set environment variable SUSE_ZNOW to 0 in %build to avoid module load
  failures due to unresolved symbols as module take advantage of lazy
  bindings (bsc#1200030).

OBS-URL: https://build.opensuse.org/request/show/1030610
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=222
2022-10-24 05:31:40 +00:00
Dominique Leuenberger
220eec76a4 Accepting request 1030432 from network:cluster
- updated to 22.05.5
- NOTE: Slurm validates that libraries are of the same version. Unfortunately,
  due to an oversight, we failed to notice that the slurmstepd loads the
  hash_k12 library only after a job has completed. This means that if the
  hash_k12 library is upgraded before a job finishes, the slurmstepd will load
  the new library when the job finishes, and will fail due to a mismatch of
  versions.  This results in nodes with slurmstepd processes stuck
  indefinitely. These processes require manual intervention to clean up. There
  is no clean way to resolve these hung slurmstepd processes.
  The only recommended way to upgrade between minor versions of 22.05 with
  RPM’s or upgrades that replace current binaries and libraries is to drain the
  nodes of running jobs first.
- Fixes a number of moderate severity issues, noteable are:
  * Load hash plugin at slurmstepd launch time to prevent issues loading the
    plugin at step completion if the Slurm installation is upgraded.
  * Update nvml plugin to match the unique id format for MIG devices in new
    Nvidia drivers.
  * Fix multi-node step launch failure when nodes in the controller aren't in
    natural order. This can happen with inconsistent node naming (such as
    node15 and node052) or with dynamic nodes which can register in any order.
  * job_container/tmpfs - cleanup containers even when the .ns file isn't
    mounted anymore.
  * Wait up to PrologEpilogTimeout before shutting down slurmd to allow prolog
    and epilog scripts to complete or timeout. Previously, slurmd waited 120
    seconds before timing out and killing prolog and epilog scripts. (forwarded request 1010642 from mslacken)

OBS-URL: https://build.opensuse.org/request/show/1030432
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=79
2022-10-22 12:13:18 +00:00
c2551ab47f Accepting request 1010642 from home:mslacken:branches:network:cluster
- updated to 22.05.5
- NOTE: Slurm validates that libraries are of the same version. Unfortunately,
  due to an oversight, we failed to notice that the slurmstepd loads the
  hash_k12 library only after a job has completed. This means that if the
  hash_k12 library is upgraded before a job finishes, the slurmstepd will load
  the new library when the job finishes, and will fail due to a mismatch of
  versions.  This results in nodes with slurmstepd processes stuck
  indefinitely. These processes require manual intervention to clean up. There
  is no clean way to resolve these hung slurmstepd processes.
  The only recommended way to upgrade between minor versions of 22.05 with
  RPM’s or upgrades that replace current binaries and libraries is to drain the
  nodes of running jobs first.
- Fixes a number of moderate severity issues, noteable are:
  * Load hash plugin at slurmstepd launch time to prevent issues loading the
    plugin at step completion if the Slurm installation is upgraded.
  * Update nvml plugin to match the unique id format for MIG devices in new
    Nvidia drivers.
  * Fix multi-node step launch failure when nodes in the controller aren't in
    natural order. This can happen with inconsistent node naming (such as
    node15 and node052) or with dynamic nodes which can register in any order.
  * job_container/tmpfs - cleanup containers even when the .ns file isn't
    mounted anymore.
  * Wait up to PrologEpilogTimeout before shutting down slurmd to allow prolog
    and epilog scripts to complete or timeout. Previously, slurmd waited 120
    seconds before timing out and killing prolog and epilog scripts.

OBS-URL: https://build.opensuse.org/request/show/1010642
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=220
2022-10-21 15:00:25 +00:00
Dominique Leuenberger
edd405b2c8 Accepting request 1006180 from network:cluster
- Do not deduplicate files of testsuite Slurm configuration.
  This directory is supposed to be mounted over /etc/slurm
  therefore it must not contain softlinks to the files in
  this directory.
- Improve .a and .o file collection for test suite: find these
  files even if there are multiple ones in a single line. (forwarded request 1005746 from eeich)

OBS-URL: https://build.opensuse.org/request/show/1006180
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=78
2022-09-26 16:48:44 +00:00
09aecc2015 Accepting request 1005746 from home:eeich:branches:network:cluster
- Do not deduplicate files of testsuite Slurm configuration.
  This directory is supposed to be mounted over /etc/slurm
  therefore it must not contain softlinks to the files in
  this directory.
- Improve .a and .o file collection for test suite: find these
  files even if there are multiple ones in a single line.

OBS-URL: https://build.opensuse.org/request/show/1005746
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=218
2022-09-26 15:01:51 +00:00
Dominique Leuenberger
ae04ec8787 Accepting request 1005247 from network:cluster
- Fix build for older product version. (forwarded request 1005246 from eeich)

OBS-URL: https://build.opensuse.org/request/show/1005247
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=77
2022-09-22 12:49:55 +00:00
3f68233e21 Accepting request 1005246 from home:eeich:branches:network:cluster
- Fix build for older product version.

OBS-URL: https://build.opensuse.org/request/show/1005246
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=216
2022-09-21 15:33:09 +00:00
Dominique Leuenberger
d3bcbab808 Accepting request 992362 from network:cluster
- Fix a potential security vulnerability in the test package
  (bsc#1201674, CVE-2022-31251).

- Patch NOFILE Limit in the slurmd.service copy for the testsuite. (forwarded request 992353 from eeich)

OBS-URL: https://build.opensuse.org/request/show/992362
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=76
2022-08-02 20:09:54 +00:00
b60ac5f569 Accepting request 992353 from home:eeich:branches:network:cluster
- Fix a potential security vulnerability in the test package
  (bsc#1201674, CVE-2022-31251).

- Patch NOFILE Limit in the slurmd.service copy for the testsuite.

OBS-URL: https://build.opensuse.org/request/show/992353
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=214
2022-08-02 15:34:01 +00:00
fd509c0258 Accepting request 990637 from home:bmwiedemann:branches:network:cluster
make slurmtest.tar reproducible

OBS-URL: https://build.opensuse.org/request/show/990637
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=213
2022-08-02 13:14:07 +00:00
Richard Brown
7a8e082057 Accepting request 990643 from network:cluster
Automatic submission by obs-autosubmit

OBS-URL: https://build.opensuse.org/request/show/990643
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=75
2022-07-22 17:21:25 +00:00
e067a36989 - Fix a typo which prevented the nproc limit for slurmd to be
up-ed for the test suite.

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=211
2022-07-15 07:15:34 +00:00
69890cab1e Accepting request 989256 from home:eeich:branches:network:cluster
- Improve check for mpicc in testsuite package: if binary isn't
  found, don't crash.

OBS-URL: https://build.opensuse.org/request/show/989256
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=210
2022-07-15 07:13:32 +00:00
167150eca6 - Fix a typo
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=209
2022-07-15 07:12:53 +00:00
Dominique Leuenberger
e57307d81e Accepting request 988733 from network:cluster
- Package the Slurm testsuite for QA purposes.
  * Fixes for test suite:
    Keep-logs-of-skipped-test-when-running-test-cases-sequentially.patch
    Fix-test-21.41.patch
    Fix-test-38.11.patch
    Fix-test-32.8.patch
    Fix-test-3.13.patch
    Fix-test7.2-to-find-libpmix-under-lib64-as-well.patch
  * Add documentation:
    README_Testsuite.md
- Allow log in as user 'slurm'. This allows admins to run certain
  priviledged commands more easily without becoming root. (forwarded request 988732 from eeich)

OBS-URL: https://build.opensuse.org/request/show/988733
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=74
2022-07-13 11:45:23 +00:00
7d13a7ba97 Accepting request 988732 from home:eeich:branches:network:cluster
- Package the Slurm testsuite for QA purposes.
  * Fixes for test suite:
    Keep-logs-of-skipped-test-when-running-test-cases-sequentially.patch
    Fix-test-21.41.patch
    Fix-test-38.11.patch
    Fix-test-32.8.patch
    Fix-test-3.13.patch
    Fix-test7.2-to-find-libpmix-under-lib64-as-well.patch
  * Add documentation:
    README_Testsuite.md
- Allow log in as user 'slurm'. This allows admins to run certain
  priviledged commands more easily without becoming root.

OBS-URL: https://build.opensuse.org/request/show/988732
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=207
2022-07-12 20:03:18 +00:00
52adf61c22 Accepting request 983910 from home:mslacken:branches:network:cluster
- update to 22.05.2 with following fixes:
  * Fix regression which allowed the oversubscription of licenses.
  * Fix a segfault in slurmctld when requesting gres in job arrays.

OBS-URL: https://build.opensuse.org/request/show/983910
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=206
2022-06-20 11:58:11 +00:00
2951a00ce2 - Package the Slrum testsuite for QA purposes.
NOTE: This package is not meant to be used for testing by the
  user but rather for testing by the maintainers to ensure the
  package is working properly.
  DO NOT report test suite failures unless you are able to confirm
  that the failure is really a bug.

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=205
2022-06-08 13:21:55 +00:00
Dominique Leuenberger
13c4d39104 Accepting request 980097 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/980097
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=73
2022-05-31 14:04:51 +00:00
faa19fe22b Accepting request 980093 from home:mslacken:branches:network:cluster
- update to 22.05.0 with following changes:
- Support for dynamic node addition and removal
- Support for native Linux cgroup v2 operation
- Newly added plugins to support HPE Slingshot 11 networks
  (switch/hpe_slingshot), and Intel Xe GPUs (gpu/oneapi)
- Added new acct_gather_interconnect/sysfs plugin to collect statistics
  from arbitrary network interfaces.
- Expanded and synced set of environment variables available in the
  Prolog/Epilog/PrologSlurmctld/EpilogSlurmctld scripts.
- New "--prefer" option to job submissions to allow for a "soft
  constraint" request to influence node selection.
- Optional support for license planning in the backfill scheduler with
  "bf_licenses" option in SchedulerParameters.
- removed file slurm-2.4.4-init.patch as sysvinit is now realy deprecated
- removed file load-pmix-major-version.patch as fixed upstream

OBS-URL: https://build.opensuse.org/request/show/980093
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=203
2022-05-31 13:38:54 +00:00
Dominique Leuenberger
737b47d2be Accepting request 976280 from network:cluster
- Add a comment about the CommunicationParameters=block_null_hash
  option warning users who migrate - just in case.

- Update to 21.08.8 which fixes CVE-2022-29500 (bsc#1199278),
  CVE-2022-29501 (bsc#1199279), and CVE-2022-29502 (bsc#1199281).

OBS-URL: https://build.opensuse.org/request/show/976280
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=72
2022-05-12 20:59:35 +00:00
a07f819c2f - Update to 21.08.8 which fixes CVE-2022-29500 (bsc#1199278),
CVE-2022-29501 (bsc#1199279), and CVE-2022-29502 (bsc#1199281).

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=201
2022-05-11 10:26:59 +00:00
5f6ca5dea6 Accepting request 976056 from home:eeich:branches:network:cluster
- Add a comment about the CommunicationParameters=block_null_hash
  option warning users who migrate - just in case.

OBS-URL: https://build.opensuse.org/request/show/976056
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=200
2022-05-11 10:25:15 +00:00
Dominique Leuenberger
62db1261ed Accepting request 975440 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/975440
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=71
2022-05-06 17:00:14 +00:00
950ae37e78 Accepting request 975374 from home:mslacken:branches:network:cluster
- Update to 21.08.8 which fixes CVE-2022-29500, CVE-2022-29501
  and CVE-2022-29502
- Added 'CommunicationParameters=block_null_hash' to slurm.conf, please
  add this parameter to existing configurations.

OBS-URL: https://build.opensuse.org/request/show/975374
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=198
2022-05-06 15:13:12 +00:00
Dominique Leuenberger
2450bd4dcd Accepting request 974456 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/974456
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=70
2022-05-03 19:19:04 +00:00
30c749c9e0 Accepting request 974433 from home:mslacken:branches:network:cluster
- Update to 21.08.7 with following changes:
  * openapi/v0.0.37 - correct calculation for bf_queue_len_mean in /diag.
  * Avoid shrinking a reservation when overlapping with downed nodes.
  * Only check TRES limits against current usage for TRES requested by the job.
  * Do not allocate shared gres (MPS) in whole-node allocations
  * Constrain slurmstepd to job/step cgroup like in previous versions of Slurm.
  * Fix warnings on 32-bit compilers related to printf() formats.
  * Fix reconfigure issues after disabling/reenabling the GANG PreemptMode.
  * Fix race condition where a cgroup was being deleted while another step
    was creating it.
  * Set the slurmd port correctly if multi-slurmd
  * Fix FAIL mail not being sent if a job was cancelled due to preemption.
  * slurmrestd - move debug logs for HTTP handling to be gated by debugflag
    NETWORK to avoid unnecessary logging of communication contents.
  * Fix issue with bad memory access when shrinking running steps.
  * Fix various issues with internal job accounting with GRES when jobs are
    shrunk.
  * Fix ipmi polling on slurmd reconfig or restart.
  * Fix srun crash when reserved ports are being used and het step fails
    to launch.
  * openapi/dbv0.0.37 - fix DELETE execution path on /user/{user_name}.
  * slurmctld - Properly requeue all components of a het job if PrologSlurmctld
    fails.
  * rlimits - remove final calls to limit nofiles to 4096 but to instead use
    the max possible nofiles in slurmd and slurmdbd.
  * Allow the DBD agent to load large messages (up to MAX_BUF_SIZE) from state.
  * Fix potential deadlock during slurmctld restart when there is a completing
    job.
  * slurmstepd - reduce user requested soft rlimits when they are above max
    hard rlimits to avoid rlimit request being completely ignored and

OBS-URL: https://build.opensuse.org/request/show/974433
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=196
2022-05-02 17:06:13 +00:00
Dominique Leuenberger
ec8df38732 Accepting request 942222 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/942222
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=69
2021-12-23 16:53:52 +00:00
d442993ff4 Accepting request 942081 from home:mslacken:branches:network:cluster
- update to 21.08.5 with following changes:
  * Fix issue where typeless GRES node updates were not immediately reflected.
  * Fix setting the default scrontab job working directory so that it's the home
    of the different user (*u <user>) and not that of root or SlurmUser editor.
  * Fix stepd not respecting SlurmdSyslogDebug.
  * Fix concurrency issue with squeue.
  * Fix job start time not being reset after launch when job is packed onto
    already booting node.
  * Fix updating SLURM_NODE_ALIASES for jobs packed onto powering up nodes.
  * Cray - Fix issues with starting hetjobs.
  * auth/jwks - Print fatal() message when jwks is configured but file could
    not be opened.
  * If sacctmgr has an association with an unknown qos as the default qos
    print 'UNKN*###' instead of leaving a blank name.
  * Correctly determine task count when giving --cpus-per-gpu, --gpus and
    *-ntasks-per-node without task count.
  * slurmctld - Fix places where the global last_job_update was not being set
    to the time of update when a job's reason and description were updated.
  * slurmctld - Fix case where a job submitted with more than one partition
    would not have its reason updated while waiting to start.
  * Fix memory leak in node feature rebooting.
  * Fix time limit permanetly set to 1 minute by backfill for job array tasks
    higher than the first with QOS NoReserve flag and PreemptMode configured.
  * Fix sacct -N to show jobs that started in the current second
  * Fix issue on running steps where both SLURM_NTASKS_PER_TRES and
    SLURM_NTASKS_PER_GPU are set.
  * Handle oversubscription request correctly when also requesting
    *-ntasks-per-tres.
  * Correctly detect when a step requests bad gres inside an allocation.
  * slurmstepd - Correct possible deadlock when UnkillableStepTimeout triggers.

OBS-URL: https://build.opensuse.org/request/show/942081
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=195
2021-12-23 10:26:41 +00:00