SHA256
1
0
forked from pool/slurm
Commit Graph

86 Commits

Author SHA256 Message Date
54640668e5 Accepting request 773459 from home:mslacken:branches:network:cluster
- Updated to version 20.02.0-0pre1, highlights are
  Highlights:
 * Exclusive behavior of a node includes all GRES on a node as well
   as the cpus.
 * Use python3 instead of python for internal build/test scripts.
   The slurm.spec file has been updated to depend on python3 as well.
 * Added new NodeSet configuration option to help simplify partition
   configuration sections for heterogeneous / condo*style clusters.
 * Added slurm.conf option MaxDBDMsgs to control how many messages will be
   stored in the slurmctld before throwing them away when the slurmdbd is down.
 * The checkpoint plugin interface and all associated API calls have been
   removed.
 * slurm_init_job_desc_msg() initializes mail_type as uint16_t. This allows
   mail_type to be set to NONE with scontrol.
 * Add new slurm_spank_log() function to print messages back to the user from
   within a SPANK plugin without prepending "error: " from slurm_error().
 * Enforce having partition name and nodelist=ALL when creating reservations
   with flags=PART_NODES.
 * SPANK - removed never-implemented slurm_spank_slurmd_init() interface. This
   hook has always been accessible through slurm_spank_init() in the
   S_CTX_SLURMD context instead.
 * sbcast - add new BcastAddr option to NodeName lines to allow sbcast traffic
   to flow over an alternate network path.
 * Added auth/jwt plugin, and 'scontrol token' subcommand.  PMIx - improve
 * performance of proc map generation.  Deprecate kill_invalid_depend in
 * SchedulerParameters and move it to a new
   option called DependencyParameters.
 * Enable job dependencies for any job on any cluster in the same federation.
 * Allow clusters to be added automatically to db at startup of ctld.  Add
 * AccountingStorageExternalHost slurm.conf parameter.  The

OBS-URL: https://build.opensuse.org/request/show/773459
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=130
2020-02-11 14:31:26 +00:00
d94a66a178 - standard slurm.conf uses now also SlurmctldHost on all build
targets (bsc#1162377)

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=128
2020-02-05 15:38:55 +00:00
17b070147f - Fix a missed systemd_requires -> systemd_ordering conversion.
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=126
2020-01-27 08:54:27 +00:00
73e298f12f Accepting request 767005 from home:eeich:branches:network:cluster
- Remove special OHPC compatibility macro: these settings should
  be applied univerally.
- Add a Recommends for mariadb to slurm-slurmdbd: it is recommened
  to run the database on the same machine as the daemon.

OBS-URL: https://build.opensuse.org/request/show/767005
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=124
2020-01-25 06:14:47 +00:00
345d1bbb94 Accepting request 766872 from home:dimstar:Factory
- BuildRequire pkgconfig(systemd) instead of systemd: allow OBS to
  shortcut through the -mini flavors.
- Use systemd_ordering instead of systemd_requires: systemd is
  never a strict requirement; but in case the system is scheduled
  for installation together with systemd, we want systemd to be
  installed prior to slurm.

- start slurmdbd after mariadb (bsc#1161716)

OBS-URL: https://build.opensuse.org/request/show/766872
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=123
2020-01-24 17:12:50 +00:00
995841bad4 Accepting request 766677 from home:mslacken:branches:network:cluster
- start slurmdbd after mariabd (bsc#1161716)

OBS-URL: https://build.opensuse.org/request/show/766677
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=122
2020-01-23 17:49:33 +00:00
c39f0bf6fb - Fix base_ver for SLE 15 SP2.
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=120
2020-01-13 15:42:28 +00:00
0581b91660 Accepting request 762650 from home:eeich:branches:network:cluster
- Update to version 19.05.5 (jsc#SLE-8491)
  * Check %docdir/NEWS for details.
  * Includes security fixes CVE-2019-19727, CVE-2019-19728,
    CVE-2019-12838.
  * Disable i586 builds as this is no longer supported.
  * Create libnss_slurm package to support user and group resolution
    thru slurmstepd.
  * slurm-2.4.4-rpath.patch -> Remove-rpath-from-build.patch
    Obsoleted:
    - pam_slurm_adopt-avoid-running-outside-of-the-sshd-PA.patch
    - pam_slurm_adopt-send_user_msg-don-t-copy-undefined-d.patch
    - pam_slurm_adopt-use-uid-to-determine-whether-root-is.patch

OBS-URL: https://build.opensuse.org/request/show/762650
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=118
2020-01-10 10:38:48 +00:00
69c13014d9 Accepting request 760450 from home:eeich:branches:network:cluster
- Deprecate "ControlMachine" only for SLURM version upgrades and
  products newer than 1501. This ensures that the original setting
  is retained for the SLURM version shipped origianlly with SLE-15-SP1
  or Leap 15.1.

- Update to v18.08.9 for fixing CVE-2019-19728 (bsc#1159692).
  * Wrap END_TIMER{,2,3} macro definition in "do {} while (0)" block.
  * Make sview work with glib2 v2.62.
  * Make Slurm compile on linux after sys/sysctl.h was deprecated.
  * Install slurmdbd.conf.example with 0600 permissions to encourage secure
    use. CVE-2019-19727.
  * srun - do not continue with job launch if --uid fails. CVE-2019-19728.

- added pmix support jsc#SLE-10800 

- Use --with-shared-libslurm to build slurm binaries using libslurm.
- Make libslurm depend on slurm-config.

- Fix ownership of /var/spool/slurm on new installations
  and upgrade (boo#1158696).

- Fix permissions of slurmdbd.conf (bsc#1155784, CVE-2019-19727).
- Fix %posttrans macro _res_update to cope with added newline
  (bsc#1153259).

- Add package slurm-webdoc which sets up a web server to provide
  the documentation for the version shipped.

- Move srun from 'slurm' to 'slurm-node': srun is required on the
  nodes as well so sbatch will work. 'slurm-node' is a requirement

OBS-URL: https://build.opensuse.org/request/show/760450
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=116
2020-01-08 19:27:10 +00:00
163930db89 - Set %base_ver for SLE-15-SP2 to 18.08 (for now).
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=114
2019-10-02 08:27:50 +00:00
e3e7bce7dc Accepting request 731004 from home:eeich:branches:network:cluster
- Edit sample configuration to deprecate "ControlMachine",
  "ControlAddr", "BackupController" and "BackupAddr" in favor
  "SlurmctldHost".

OBS-URL: https://build.opensuse.org/request/show/731004
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=112
2019-09-14 21:47:11 +00:00
9c7abff085 - Updated to 18.08.8 for fixing (CVE-2019-12838, bsc#1140709, jsc#SLE-7341,
jsc#SLE-7342)

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=110
2019-08-18 20:13:20 +00:00
c0e29e647e - Updated to 18.08.8 for fixing (CVE-2019-12838, bsc#1140709, jre#SLE-7341,
jre#SLE-7342)

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=109
2019-08-18 18:46:31 +00:00
f2775f6e1e - Fix logic of slurm-munge recommends: slurm-munge requires munge
already, so if we have munge installed we recommend slurm-munge
  as the authentication when installing slurm or slurm-node.

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=108
2019-08-17 14:25:47 +00:00
89f111874a Accepting request 715613 from home:mslacken:branches:network:cluster
removed explanation of changelog entry

OBS-URL: https://build.opensuse.org/request/show/715613
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=106
2019-07-16 08:32:48 +00:00
5a7922ceef Accepting request 715604 from home:mslacken:branches:network:cluster
- Fixed changelog entry from Jul 11 in order to use the right

OBS-URL: https://build.opensuse.org/request/show/715604
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=105
2019-07-16 08:18:32 +00:00
9d923e48e1 Accepting request 715597 from home:mslacken:branches:network:cluster
- Fixed changelog entry if Jul 11 in order to use the right 
  version slurm 18.08.8

- Updated to 18.08.8 for fixing CVE-2019-12838 and (bsc#1140709)

OBS-URL: https://build.opensuse.org/request/show/715597
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=104
2019-07-16 07:57:42 +00:00
f88a1f8e69 Accepting request 715348 from home:eeich:branches:network:cluster
- Fix build for SLE-11-SP4 and older.

OBS-URL: https://build.opensuse.org/request/show/715348
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=102
2019-07-14 21:25:41 +00:00
257676d4f2 Accepting request 714908 from home:mslacken:branches:network:cluster
- added cray depend libraries to seperate package, as they are now
  built, since json is enabled

- Updated to 18.0.7 for fixing CVE-2019-12838 and (bsc#1140709)
  * Update "xauth list" to use the same 10000ms timeout as the other xauth
    commands.
  * Fix issue in gres code to handle a gres cnt of 0.
  * Don't purge jobs if backfill is running.
  * Verify job is pending add/removing accrual time.
  * Don't abort when the job doesn't have an association that was removed
    before the job was able to make it to the database.
  * Set state_reason if select_nodes() fails job for QOS or Account.
  * Avoid seg_fault on referencing association without a valid_qos bitmap.
  * If Association/QOS is removed on a pending job set that job as ineligible.
  * When changing a jobs account/qos always make sure you remove the old limits.
  * Don't reset a FAIL_QOS or FAIL_ACCOUNT job reason until the qos or
    account changed.
  * Restore "sreport -T ALL" functionality.
  * Correctly typecast signals being sent through the api.
  * Properly initialize structures throughout Slurm.
  * Sync "numtask" squeue format option for jobs and steps to "numtasks".
  * Fix sacct -PD to avoid CA before start jobs.
  * Fix potential deadlock with backup slurmctld.
  * Fixed issue with jobs not appearing in sacct after dependency satisfied.
  * Fix showing non-eligible jobs when asking with -j and not -s.
  * Fix issue with backfill scheduler scheduling tasks of an array
    when not the head job.
  * accounting_storage/mysql - fix SIGABRT in the archive load logic.
  * accounting_storage/mysql - fix memory leak in the archive load logic.
  * Limit records per single SQL statement when loading archived data.

OBS-URL: https://build.opensuse.org/request/show/714908
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=100
2019-07-12 18:09:50 +00:00
fa2138ebce Accepting request 714002 from home:eeich:slurm-staging
- Fix build dependency issue around libibmad-devel introduced
  in SLE-12-SP4.

OBS-URL: https://build.opensuse.org/request/show/714002
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=99
2019-07-08 08:21:33 +00:00
5a25a5ea8b Accepting request 713918 from home:eeich:slurm-staging
- Add BuildRequires to address warnings during build:
  * for libcurl-devel, libssh2-devel and rrdtool-devel
  * for libjson-c-devel and liblz4-devel where available,
    disable these with --without-json and --without-lz4
    where not.
  * disable DataWarp (--without-datawarp).

OBS-URL: https://build.opensuse.org/request/show/713918
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=98
2019-07-08 05:48:14 +00:00
d212ad0245 Accepting request 713773 from home:eeich:branches:network:cluster
- Update SLURM to 18.08.7:
  * Set debug statement to debug2 to avoid benign error messages.
  * Add SchedulerParameters option of bf_hetjob_immediate to attempt to start
    a heterogeneous job as soon as all of its components are determined able
    to do so.
  * Fix underflow causing decay thread to exit.
  * Fix main scheduler not considering hetjobs when building the job queue.
  * Fix regression for sacct to display old jobs without a start time.
  * Fix setting correct number of gres topology bits.
  * Update hetjobs pending state reason when appropriate.
  * Fix accounting_storage/filetxt's understanding of TRES.
  * Set Accrue time when not enforcing limits.
  * Fix srun segfault when requesting a hetjob with test_exec or bcast
    options.
  * Hide multipart priorities log message behind Priority debug flag.
  * sched/backfill - Make hetjobs sensitive to bf_max_job_start.
  * Fix slurmctld segfault due to job's partition pointer NULL dereference.
  * Fix issue with OR'ed job dependencies.
  * Add new job's bit_flags of INVALID_DEPEND to prevent rebuilding a job's
    dependency string when it has at least one invalid and purged dependency.
  * Promote federation unsynced siblings log message from debug to info.
  * burst_buffer/cray - fix slurmctld SIGABRT due to illegal read/writes.
  * burst_buffer/cray - fix memory leak due to unfreed job script content.
  * node_features/knl_cray - fix script_argv use-after-free.
  * burst_buffer/cray - fix script_argv use-after-free.
  * Fix invalid reads of size 1 due to non null-terminated string reads.
  * Add extra debug2 logs to identify why BadConstraints reason is set.

OBS-URL: https://build.opensuse.org/request/show/713773
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=94
2019-07-07 04:27:16 +00:00
0c8ed23dc7 Accepting request 713744 from home:eeich:branches:network:cluster
- Do not build hdf5 support where not available.

OBS-URL: https://build.opensuse.org/request/show/713744
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=93
2019-07-06 20:02:33 +00:00
2536acafc5 Accepting request 713735 from home:eeich:branches:network:cluster
- Add support for version updates on SLE: Update packages to a
  later version than the version supported originally on SLE
  will receive a version string in their package name.

OBS-URL: https://build.opensuse.org/request/show/713735
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=92
2019-07-06 17:41:00 +00:00
4a0199d836 Accepting request 679787 from home:mslacken:slurm18
- added the hdf5 job data gathering plugin

OBS-URL: https://build.opensuse.org/request/show/679787
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=90
2019-05-29 15:15:25 +00:00
2b7d9f397e Accepting request 670635 from home:eeich:branches:network:cluster
- Add backward compatibility with SLE-11 SP4

OBS-URL: https://build.opensuse.org/request/show/670635
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=88
2019-02-01 19:44:10 +00:00
acb7e0505a - Update to version 18.08.05-2:
This version obsoletes:
  Fix-contrib-perlapi-to-build-with-the-fix-for-CVE-2019-6438-750cc23ed.patch
- Fix spec file for older SUSE versions.

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=85
2019-01-31 20:33:20 +00:00
c9da5cd5a9 Accepting request 670322 from home:eeich:branches:network:cluster
- Update to version 18.08.05:
  * Add mitigation for a potential heap overflow on 32-bit systems in xmalloc.
    (CVE-2019-6438, bsc#1123304).
  * Other fixes:
    + Backfill - If a job has a time_limit guess the end time of a job better
      if OverTimeLimit is Unlimited.
    + Fix "sacctmgr show events event=cluster"
    + Fix sacctmgr show runawayjobs from sibling cluster
    + Avoid bit offset of -1 in call to bit_nclear().
    + Insure that "hbm" is a configured GresType on knl systems.
    + Fix NodeFeaturesPlugins=node_features/knl_generic to allow other gres
      other than knl.
    + cons_res: Prevent overflow on multiply.
    + Better debug for bad values in gres.conf.
    + Fix double accounting of energy at end of job.
    + Read gres.conf for cloud nodes on slurmctld.
    + Don't assume the first node of a job is the batch host when purging jobs
      from a node.
    + Better debugging when a job doesn't have a job_resrcs ptr.
    + Store ave watts in energy plugins.
    + Add XCC plugin for reading Lenovo Power.
    + Fix minor memory leak when scheduling rebootable nodes.
    + Fix debug2 prefix for sched log.
    + Fix printing correct SLURM_JOB_ACCOUNT_PACK_GROUP_* in env for a Het Job.
    + sbatch - search current working directory first for job script.
    + Make it so held jobs reset the AccrueTime and do not count against any
      AccrueTime limits.
    + Add SchedulerParameters option of bf_hetjob_prio=[min|avg|max] to alter
      the job sorting algorithm for scheduling heterogeneous jobs.
    + Fix initialization of assoc_mgr_locks and slurmctld_locks lock

OBS-URL: https://build.opensuse.org/request/show/670322
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=83
2019-01-31 11:56:59 +00:00
364aa9908a Accepting request 663733 from home:mslacken:slurm18
- Update to 18.08.04, with following highlights
  * Fix message sent to user to display preempted instead of time limit when
    a job is preempted.
  * Fix memory leak when a failure happens processing a nodes gres config.
  * Improve error message when failures happen processing a nodes gres config.
  * Don't skip jobs in scontrol hold.
  * Allow --cpu-bind=verbose to be used with SLURM_HINT environment variable.
  * Enhanced handling for runaway jobs
  * cons_res: Delay exiting cr_job_test until after cores/cpus are calculated
    and distributed.
  * Don't check existence of srun --prolog or --epilog executables when set to
    "none" and SLURM_TEST_EXEC is used.
  * Add "P" suffix support to job and step tres specifications.
  * Fix jobacct_gather/cgroup to work correctly when more than one task is
    started on a node.
  * salloc - set SLURM_NTASKS_PER_CORE and SLURM_NTASKS_PER_SOCKET in the
    environment if the corresponding command line options are used.
  * slurmd - fix handling of the -f flag to specify alternate config file
    locations.
  * Add SchedulerParameters option of bf_ignore_newly_avail_nodes to avoid
    scheduling lower priority jobs on resources that become available during
    the backfill scheduling cycle when bf_continue is enabled.
  * job_submit/lua: Add several slurmctld return codes and add user/group info
  * salloc/sbatch/srun - print warning if mutually exclusive options of --mem
    and --mem-per-cpu are both set.
 - Refreshed:
  * pam_slurm_adopt-avoid-running-outside-of-the-sshd-PA.patch

OBS-URL: https://build.opensuse.org/request/show/663733
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=81
2019-01-08 19:05:14 +00:00
9eefc8e774 Accepting request 657422 from home:mslacken:slurm18
- restarting services on update only when activated 
- added rotation of logs
- Added backported patches which harden the pam module pam_slurm_adopt
  (BOO#1116758) which will be in slurm 19.05.x
  * added pam_slurm_adopt-avoid-running-outside-of-the-sshd-PA.patch
    [PATCH 1/3] pam_slurm_adopt: avoid running outside of the sshd PAM
  * added pam_slurm_adopt-send_user_msg-don-t-copy-undefined-d.patch
    [PATCH 2/3] pam_slurm_adopt: send_user_msg: don't copy undefined data
  * added pam_slurm_adopt-use-uid-to-determine-whether-root-is.patch
    [PATCH 3/3] pam_slurm_adopt: use uid to determine whether root is
    logging on
- package slurm-pam_slurm now depends on slurm-node and not on slurm

OBS-URL: https://build.opensuse.org/request/show/657422
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=79
2018-12-12 09:28:26 +00:00
8ddf42df7f Accepting request 655364 from home:mslacken:slurm18
- fixed code in %pretrans section to be compatible with lua 5.1

OBS-URL: https://build.opensuse.org/request/show/655364
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=77
2018-12-06 09:50:36 +00:00
f21d191e3c Accepting request 650545 from home:eeich:branches:network:cluster
- Added missing perl-base dependency.

- Moved HTML docs to doc package.

- Moved config man pages to a separate package: This way, they won't
  get installed on compute nodes.                                                                                                                                  

- Update to 18.08.3
  * Add new burst buffer state of "teardown-fail" to indicate the burst
    buffer teardown operation is failing on specific buffers.
  * Multiple backup slurmctld daemons can be configured
  * Enable jobs with zero node count for creation and/or deletion of persistent
    burst buffers.
  * Add "scontrol show dwstat" command to display Cray burst buffer status.
  * Add "GetSysStatus" option to burst_buffer.conf file.
  * Add node and partition configuration options of "CpuBind" to control
    default task binding.
  * Add "NumaCpuBind" option to knl.conf
  * Add sbatch "--batch" option to identify features required on batch node.
  * Add "BatchFeatures" field to output of "scontrol show job".
  * Add support for "--bb" option to sbatch command.
  * Add new SystemComment field to job data structure and database.
  * Expand reservation "flags" field from 32 to 64 bits.
  * Add job state flag of "SIGNALING" to avoid race condition.
  * Properly handle srun --will-run option when there are jobs in COMPLETING
    state.
  * Properly report who is signaling a step.
  * Don't combine updated reservation records in sreport's reservation report.
  * node_features plugin - Add suport for XOR & XAND of job constraints (node
    feature specifications).

OBS-URL: https://build.opensuse.org/request/show/650545
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=75
2018-11-20 17:07:44 +00:00
2390a20289 Accepting request 639244 from home:eeich:branches:network:cluster
- Move config man-pages to config package.

OBS-URL: https://build.opensuse.org/request/show/639244
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=73
2018-09-30 15:33:20 +00:00
39fedd2ce8 - added correct link flags for perl bindings (bsc#1108671)
* added correct linker search path in slurm-2.4.4-rpath.patch
  * perl:Switch is required by slurm torque wrappers

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=70
2018-09-24 09:37:13 +00:00
410ad28aca Accepting request 637165 from home:eeich:branches:network:cluster
- Fix Requires(pre) and Requires(post) for slurm-config and slurm-node.
  This fixes issues with failing slurm user creation when installed
  during initial system installation (bsc#1109373).

OBS-URL: https://build.opensuse.org/request/show/637165
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=69
2018-09-22 07:50:55 +00:00
dbb82d64bd - slurmctld-rerun-agent_init-when-backup-controller-takes-over.patch:
Fix race in the slurmctld backup controller which prevents it

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=67
2018-08-23 13:54:53 +00:00
fafb5a0196 Accepting request 629226 from home:eeich:branches:network:cluster
- slurmctld-rerun-agent_init-when-backup-controller-takes-over.patch
  Fix an issue where the fallback controller will not be able to idle
  nodes after a failover when a process has terminated (bsc#1084917).

OBS-URL: https://build.opensuse.org/request/show/629226
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=65
2018-08-14 13:18:35 +00:00
d5a2e95d8c Accepting request 629222 from home:eeich:branches:network:cluster
- Update to 17.11.9
  * Fix segfault in slurmctld when a job's node bitmap is NULL during a
    scheduling cycle.  Primarily caused by EnforcePartLimits=ALL.
  * Remove erroneous unlock in acct_gather_energy/ipmi.
  * Enable support for hwloc version 2.0.1.
  * Fix 'srun -q' (--qos) option handling.
  * Fix socket communication issue that can lead to lost task completition
    messages, which will cause a permanently stuck srun process.
  * Handle creation of TMPDIR if environment variable is set or changed in
    a task prolog script.
  * Avoid node layout fragmentation if running with a fixed CPU count but
    without Sockets and CoresPerSocket defined.
  * burst_buffer/cray - Fix datawarp swap default pool overriding jobdw.
  * Fix incorrect job priority assignment for multi-partition job with
    different PriorityTier settings on the partitions.
  * Fix sinfo to print correct node state.

- When using a remote shared StateSaveLocation, slurmctld needs to
  be started after remote filesystems have become available.
  Add 'remote-fs.target' to the 'After=' directive in slurmctld.service
  (boo#1103561).

- Update to 17.11.8
  * Fix incomplete RESPONSE_[RESOURCE|JOB_PACK]_ALLOCATION building path.
  * Do not allocate nodes that were marked down due to the node not responding
    by ResumeTimeout.
  * task/cray plugin - search for "mems" cgroup information in the file
    "cpuset.mems" then fall back to the file "mems".
  * Fix ipmi profile debug uninitialized variable.
  * PMIx: fixed the direct connect inline msg sending.

OBS-URL: https://build.opensuse.org/request/show/629222
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=64
2018-08-14 13:00:16 +00:00
62ef6634bc - Shield comments between script snippets with a %{!?nil:...} to
avoid them being interpreted as scripts - in which case the update
  level is passed as argument (see chapter 'Shared libraries' in:
  https://en.opensuse.org/openSUSE:Packaging_scriptlet_snippets)
  (bsc#1100850).

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=62
2018-07-11 12:08:06 +00:00
1337fac8b2 - Add: slurmsmwd-uses-xdaemon_-for-systemd.patch
* Fixes daemoniziation in newly introduced slurmsmwd daemon.
- Rename:
  split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for-systemd-compatibilty.patch
  * remain in sync with commit messages which introduced that file

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=60
2018-06-11 14:22:14 +00:00
3e1fd5dae9 Accepting request 616031 from home:mslacken
- Fix security issue in handling of username and gid fields
  CVE-2018-10995 and bsc#1095508 what implied an 
  update from 17.11.5 to 17.11.7
- renanmed split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for-systemd-compatibilty.patch
  to split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for.patch
  in order to be in sync with commit messages which introduced that file

OBS-URL: https://build.opensuse.org/request/show/616031
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=59
2018-06-11 14:18:08 +00:00
55d6d2b0c7 Accepting request 615950 from home:mslacken
- Fix security issue in handling of username and gid fields
  CVE-2018-10995 what implied an update from 17.11.5 to 17.11.7
- Update from 17.11.5 to 17.11.7
  Highlights of 17.11.6:
  * CRAY - Add slurmsmwd to the contribs/cray dir
  * PMIX - Added the direct connect authentication.
  * Prevent the backup slurmctld from losing the active/available node
    features list on takeover.
  * Be able to force power_down of cloud node even if in power_save state.
  * Allow cloud nodes to be recognized in Slurm when booted out of band.
  * Numerous fixes - check 'NEWS' file.
  Highlights of 17.11.7:
  * Notify srun and ctld when unkillable stepd exits.
  * Numerous fixes - check 'NEWS' file.

OBS-URL: https://build.opensuse.org/request/show/615950
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=58
2018-06-11 10:31:14 +00:00
7d56316590 Accepting request 599201 from home:eeich:branches:network:cluster
- Avoid running pretrans scripts when running in an instsys:
  there may be not much installed, yet. pretrans code should
  be done in lua, this way, it will be executed by the rpm-internal
  lua interpreter and not be passed to a shell which may not be
  around at the time this scriptlet is run (bsc#1090292).

OBS-URL: https://build.opensuse.org/request/show/599201
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=56
2018-04-20 09:24:13 +00:00
df7fca5b1f - Add requires for slurm-sql to the slurmdbd package.
- Add:
  * split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for-systemd-compatibilty.patch
  * slurmctld-uses-xdaemon_-for-systemd.patch
  * slurmd-uses-xdaemon_-for-systemd.patch
  * slurmdbd-uses-xdaemon_-for-systemd.patch
  * removed-deprecated-xdaemon.patch
  Fix interaction with systemd: systemd expects that a

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=54
2018-04-13 15:08:24 +00:00
d892c59e4e - Package READMEs for pam and pam_slurm_adopt.
- Use the new %%license directive for COPYING file.

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=53
2018-04-12 17:22:25 +00:00
8d80dfc527 - Fix interaction with systemd: systemd expects that a
daemonizing process doesn't go away until the PID file
  with it PID of the daemon has bee written (bsc#1084125).

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=52
2018-04-12 16:42:36 +00:00
7dbbe8e89d - Make sure systemd services get restarted only when all
packages are in a consistent state, not in the middle
  of an 'update' transaction (bsc#1088693).
  Since the %postun scripts that run on update are from
  the old package they cannot be changed - thus we work
  around the restart breakage.
  (bsc#1086859).

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=51
2018-04-11 11:50:15 +00:00
7025591d0d Accepting request 591103 from home:mslacken:hpc
- fixed wrong log file location in slurmdbd.conf and 
  fixed pid location for slurmdbd and made slurm-slurmdbd
  depend on slurm config which provides the dir /var/run/slurm
  (bsc#1086859)

OBS-URL: https://build.opensuse.org/request/show/591103
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=49
2018-03-28 08:20:56 +00:00
003175f991 Accepting request 587822 from home:mslacken
- added comment for (bsc#1085606)

OBS-URL: https://build.opensuse.org/request/show/587822
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=47
2018-03-16 09:52:14 +00:00
cbe6c9fcaa Accepting request 587617 from home:eeich:branches:network:cluster
- Fix security issue in accounting_storage/mysql plugin by always escaping
  strings within the slurmdbd. CVE-2018-7033
  http://web.nvd.nist.gov/view/vuln/detail?vulnId=CVE-2018-7033
  (bsc#1085240).
- Update slurm to v17.11.5 (FATE#325451)
  Highlights of 17.11:
  * Support for federated clusters to manage a single work-flow 
    across a set of clusters.
  * Support for heterogeneous job allocations (various processor types,
    memory sizes, etc. by job component). Support for heterogeneous job
    steps within a single MPI_COMM_WORLD is not yet supported for most
    configurations.
  * X11 support is now fully integrated with the main Slurm code. Remove
    any X11 plugin configured in your plugstack.conf file to avoid errors
    being logged about conflicting options.
  * Added new advanced reservation flag of "flex", which permits jobs
    requesting the reservation to begin prior to the reservation's 
    start time and use resources inside or outside of the reservation.
    A typical use case is to prevent jobs not explicitly requesting the
    reservation from using those reserved resources rather than forcing
    jobs requesting the reservation to use those resources in the time
    frame reserved.
  * The sprio command has been modified to report a job's priority
    information for every partition the job has been submitted to.
  * Group ID lookup performed at job submit time to avoid lookup on
    all compute nodes. Enable with PrologFlags=SendGIDs configuration
    parameter.
  * Slurm commands and daemons dynamically link to libslurmfull.so
    instead of statically linking. This dramatically reduces the
    footprint of Slurm.

OBS-URL: https://build.opensuse.org/request/show/587617
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=46
2018-03-15 19:52:49 +00:00