SHA256
1
0
forked from pool/slurm
Commit Graph

302 Commits

Author SHA256 Message Date
f88a1f8e69 Accepting request 715348 from home:eeich:branches:network:cluster
- Fix build for SLE-11-SP4 and older.

OBS-URL: https://build.opensuse.org/request/show/715348
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=102
2019-07-14 21:25:41 +00:00
Dominique Leuenberger
8991e2f1ad Accepting request 714909 from network:cluster
- added cray depend libraries to seperate package, as they are now
  built, since json is enabled

- Updated to 18.0.7 for fixing CVE-2019-12838 and (bsc#1140709)
  * Update "xauth list" to use the same 10000ms timeout as the other xauth
    commands.
  * Fix issue in gres code to handle a gres cnt of 0.
  * Don't purge jobs if backfill is running.
  * Verify job is pending add/removing accrual time.
  * Don't abort when the job doesn't have an association that was removed
    before the job was able to make it to the database.
  * Set state_reason if select_nodes() fails job for QOS or Account.
  * Avoid seg_fault on referencing association without a valid_qos bitmap.
  * If Association/QOS is removed on a pending job set that job as ineligible.
  * When changing a jobs account/qos always make sure you remove the old limits.
  * Don't reset a FAIL_QOS or FAIL_ACCOUNT job reason until the qos or
    account changed.
  * Restore "sreport -T ALL" functionality.
  * Correctly typecast signals being sent through the api.
  * Properly initialize structures throughout Slurm.
  * Sync "numtask" squeue format option for jobs and steps to "numtasks".
  * Fix sacct -PD to avoid CA before start jobs.
  * Fix potential deadlock with backup slurmctld.
  * Fixed issue with jobs not appearing in sacct after dependency satisfied.
  * Fix showing non-eligible jobs when asking with -j and not -s.
  * Fix issue with backfill scheduler scheduling tasks of an array
    when not the head job.
  * accounting_storage/mysql - fix SIGABRT in the archive load logic.
  * accounting_storage/mysql - fix memory leak in the archive load logic.
  * Limit records per single SQL statement when loading archived data. (forwarded request 714908 from mslacken)

OBS-URL: https://build.opensuse.org/request/show/714909
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=28
2019-07-13 11:50:15 +00:00
257676d4f2 Accepting request 714908 from home:mslacken:branches:network:cluster
- added cray depend libraries to seperate package, as they are now
  built, since json is enabled

- Updated to 18.0.7 for fixing CVE-2019-12838 and (bsc#1140709)
  * Update "xauth list" to use the same 10000ms timeout as the other xauth
    commands.
  * Fix issue in gres code to handle a gres cnt of 0.
  * Don't purge jobs if backfill is running.
  * Verify job is pending add/removing accrual time.
  * Don't abort when the job doesn't have an association that was removed
    before the job was able to make it to the database.
  * Set state_reason if select_nodes() fails job for QOS or Account.
  * Avoid seg_fault on referencing association without a valid_qos bitmap.
  * If Association/QOS is removed on a pending job set that job as ineligible.
  * When changing a jobs account/qos always make sure you remove the old limits.
  * Don't reset a FAIL_QOS or FAIL_ACCOUNT job reason until the qos or
    account changed.
  * Restore "sreport -T ALL" functionality.
  * Correctly typecast signals being sent through the api.
  * Properly initialize structures throughout Slurm.
  * Sync "numtask" squeue format option for jobs and steps to "numtasks".
  * Fix sacct -PD to avoid CA before start jobs.
  * Fix potential deadlock with backup slurmctld.
  * Fixed issue with jobs not appearing in sacct after dependency satisfied.
  * Fix showing non-eligible jobs when asking with -j and not -s.
  * Fix issue with backfill scheduler scheduling tasks of an array
    when not the head job.
  * accounting_storage/mysql - fix SIGABRT in the archive load logic.
  * accounting_storage/mysql - fix memory leak in the archive load logic.
  * Limit records per single SQL statement when loading archived data.

OBS-URL: https://build.opensuse.org/request/show/714908
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=100
2019-07-12 18:09:50 +00:00
fa2138ebce Accepting request 714002 from home:eeich:slurm-staging
- Fix build dependency issue around libibmad-devel introduced
  in SLE-12-SP4.

OBS-URL: https://build.opensuse.org/request/show/714002
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=99
2019-07-08 08:21:33 +00:00
5a25a5ea8b Accepting request 713918 from home:eeich:slurm-staging
- Add BuildRequires to address warnings during build:
  * for libcurl-devel, libssh2-devel and rrdtool-devel
  * for libjson-c-devel and liblz4-devel where available,
    disable these with --without-json and --without-lz4
    where not.
  * disable DataWarp (--without-datawarp).

OBS-URL: https://build.opensuse.org/request/show/713918
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=98
2019-07-08 05:48:14 +00:00
5f6fddfc21 - Remove stray BuildRequires for infiniband-diags-devel
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=97
2019-07-07 19:03:24 +00:00
db5ace2fb9 - Fix test for Factory
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=96
2019-07-07 14:57:15 +00:00
69c4464cd5 - Fix test for oS Factory
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=95
2019-07-07 12:42:55 +00:00
d212ad0245 Accepting request 713773 from home:eeich:branches:network:cluster
- Update SLURM to 18.08.7:
  * Set debug statement to debug2 to avoid benign error messages.
  * Add SchedulerParameters option of bf_hetjob_immediate to attempt to start
    a heterogeneous job as soon as all of its components are determined able
    to do so.
  * Fix underflow causing decay thread to exit.
  * Fix main scheduler not considering hetjobs when building the job queue.
  * Fix regression for sacct to display old jobs without a start time.
  * Fix setting correct number of gres topology bits.
  * Update hetjobs pending state reason when appropriate.
  * Fix accounting_storage/filetxt's understanding of TRES.
  * Set Accrue time when not enforcing limits.
  * Fix srun segfault when requesting a hetjob with test_exec or bcast
    options.
  * Hide multipart priorities log message behind Priority debug flag.
  * sched/backfill - Make hetjobs sensitive to bf_max_job_start.
  * Fix slurmctld segfault due to job's partition pointer NULL dereference.
  * Fix issue with OR'ed job dependencies.
  * Add new job's bit_flags of INVALID_DEPEND to prevent rebuilding a job's
    dependency string when it has at least one invalid and purged dependency.
  * Promote federation unsynced siblings log message from debug to info.
  * burst_buffer/cray - fix slurmctld SIGABRT due to illegal read/writes.
  * burst_buffer/cray - fix memory leak due to unfreed job script content.
  * node_features/knl_cray - fix script_argv use-after-free.
  * burst_buffer/cray - fix script_argv use-after-free.
  * Fix invalid reads of size 1 due to non null-terminated string reads.
  * Add extra debug2 logs to identify why BadConstraints reason is set.

OBS-URL: https://build.opensuse.org/request/show/713773
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=94
2019-07-07 04:27:16 +00:00
0c8ed23dc7 Accepting request 713744 from home:eeich:branches:network:cluster
- Do not build hdf5 support where not available.

OBS-URL: https://build.opensuse.org/request/show/713744
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=93
2019-07-06 20:02:33 +00:00
2536acafc5 Accepting request 713735 from home:eeich:branches:network:cluster
- Add support for version updates on SLE: Update packages to a
  later version than the version supported originally on SLE
  will receive a version string in their package name.

OBS-URL: https://build.opensuse.org/request/show/713735
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=92
2019-07-06 17:41:00 +00:00
Dominique Leuenberger
aa393a5ef3 Accepting request 706361 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/706361
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=27
2019-06-01 07:55:54 +00:00
4a0199d836 Accepting request 679787 from home:mslacken:slurm18
- added the hdf5 job data gathering plugin

OBS-URL: https://build.opensuse.org/request/show/679787
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=90
2019-05-29 15:15:25 +00:00
Stephan Kulow
3f77b9a7fc Accepting request 670636 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/670636
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=26
2019-02-02 20:50:18 +00:00
2b7d9f397e Accepting request 670635 from home:eeich:branches:network:cluster
- Add backward compatibility with SLE-11 SP4

OBS-URL: https://build.opensuse.org/request/show/670635
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=88
2019-02-01 19:44:10 +00:00
Dominique Leuenberger
d8bad37648 Accepting request 670462 from network:cluster
- Update to version 18.08.05-2:
  This version obsoletes:
  Fix-contrib-perlapi-to-build-with-the-fix-for-CVE-2019-6438-750cc23ed.patch
- Fix spec file for older SUSE versions.

- Update to version 18.08.05:
  * Add mitigation for a potential heap overflow on 32-bit systems in xmalloc.
    (CVE-2019-6438, bsc#1123304).
  * Other fixes:
    + Backfill - If a job has a time_limit guess the end time of a job better
      if OverTimeLimit is Unlimited.
    + Fix "sacctmgr show events event=cluster"
    + Fix sacctmgr show runawayjobs from sibling cluster
    + Avoid bit offset of -1 in call to bit_nclear().
    + Insure that "hbm" is a configured GresType on knl systems.
    + Fix NodeFeaturesPlugins=node_features/knl_generic to allow other gres
      other than knl.
    + cons_res: Prevent overflow on multiply.
    + Better debug for bad values in gres.conf.
    + Fix double accounting of energy at end of job.
    + Read gres.conf for cloud nodes on slurmctld.
    + Don't assume the first node of a job is the batch host when purging jobs
      from a node.
    + Better debugging when a job doesn't have a job_resrcs ptr.
    + Store ave watts in energy plugins.
    + Add XCC plugin for reading Lenovo Power.
    + Fix minor memory leak when scheduling rebootable nodes.
    + Fix debug2 prefix for sched log.
    + Fix printing correct SLURM_JOB_ACCOUNT_PACK_GROUP_* in env for a Het Job.
    + sbatch - search current working directory first for job script.

OBS-URL: https://build.opensuse.org/request/show/670462
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=25
2019-02-01 10:48:34 +00:00
a857bd00b6 - Fix build.
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=86
2019-01-31 21:19:18 +00:00
acb7e0505a - Update to version 18.08.05-2:
This version obsoletes:
  Fix-contrib-perlapi-to-build-with-the-fix-for-CVE-2019-6438-750cc23ed.patch
- Fix spec file for older SUSE versions.

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=85
2019-01-31 20:33:20 +00:00
2ff256ff3d - Structural fixes to build on older openSUSE and SLE versions.
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=84
2019-01-31 20:14:27 +00:00
c9da5cd5a9 Accepting request 670322 from home:eeich:branches:network:cluster
- Update to version 18.08.05:
  * Add mitigation for a potential heap overflow on 32-bit systems in xmalloc.
    (CVE-2019-6438, bsc#1123304).
  * Other fixes:
    + Backfill - If a job has a time_limit guess the end time of a job better
      if OverTimeLimit is Unlimited.
    + Fix "sacctmgr show events event=cluster"
    + Fix sacctmgr show runawayjobs from sibling cluster
    + Avoid bit offset of -1 in call to bit_nclear().
    + Insure that "hbm" is a configured GresType on knl systems.
    + Fix NodeFeaturesPlugins=node_features/knl_generic to allow other gres
      other than knl.
    + cons_res: Prevent overflow on multiply.
    + Better debug for bad values in gres.conf.
    + Fix double accounting of energy at end of job.
    + Read gres.conf for cloud nodes on slurmctld.
    + Don't assume the first node of a job is the batch host when purging jobs
      from a node.
    + Better debugging when a job doesn't have a job_resrcs ptr.
    + Store ave watts in energy plugins.
    + Add XCC plugin for reading Lenovo Power.
    + Fix minor memory leak when scheduling rebootable nodes.
    + Fix debug2 prefix for sched log.
    + Fix printing correct SLURM_JOB_ACCOUNT_PACK_GROUP_* in env for a Het Job.
    + sbatch - search current working directory first for job script.
    + Make it so held jobs reset the AccrueTime and do not count against any
      AccrueTime limits.
    + Add SchedulerParameters option of bf_hetjob_prio=[min|avg|max] to alter
      the job sorting algorithm for scheduling heterogeneous jobs.
    + Fix initialization of assoc_mgr_locks and slurmctld_locks lock

OBS-URL: https://build.opensuse.org/request/show/670322
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=83
2019-01-31 11:56:59 +00:00
Dominique Leuenberger
74b4d5ddb3 Accepting request 663813 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/663813
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=24
2019-01-21 09:47:31 +00:00
364aa9908a Accepting request 663733 from home:mslacken:slurm18
- Update to 18.08.04, with following highlights
  * Fix message sent to user to display preempted instead of time limit when
    a job is preempted.
  * Fix memory leak when a failure happens processing a nodes gres config.
  * Improve error message when failures happen processing a nodes gres config.
  * Don't skip jobs in scontrol hold.
  * Allow --cpu-bind=verbose to be used with SLURM_HINT environment variable.
  * Enhanced handling for runaway jobs
  * cons_res: Delay exiting cr_job_test until after cores/cpus are calculated
    and distributed.
  * Don't check existence of srun --prolog or --epilog executables when set to
    "none" and SLURM_TEST_EXEC is used.
  * Add "P" suffix support to job and step tres specifications.
  * Fix jobacct_gather/cgroup to work correctly when more than one task is
    started on a node.
  * salloc - set SLURM_NTASKS_PER_CORE and SLURM_NTASKS_PER_SOCKET in the
    environment if the corresponding command line options are used.
  * slurmd - fix handling of the -f flag to specify alternate config file
    locations.
  * Add SchedulerParameters option of bf_ignore_newly_avail_nodes to avoid
    scheduling lower priority jobs on resources that become available during
    the backfill scheduling cycle when bf_continue is enabled.
  * job_submit/lua: Add several slurmctld return codes and add user/group info
  * salloc/sbatch/srun - print warning if mutually exclusive options of --mem
    and --mem-per-cpu are both set.
 - Refreshed:
  * pam_slurm_adopt-avoid-running-outside-of-the-sshd-PA.patch

OBS-URL: https://build.opensuse.org/request/show/663733
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=81
2019-01-08 19:05:14 +00:00
Dominique Leuenberger
5f5fe54c27 Accepting request 657426 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/657426
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=23
2018-12-12 16:31:01 +00:00
9eefc8e774 Accepting request 657422 from home:mslacken:slurm18
- restarting services on update only when activated 
- added rotation of logs
- Added backported patches which harden the pam module pam_slurm_adopt
  (BOO#1116758) which will be in slurm 19.05.x
  * added pam_slurm_adopt-avoid-running-outside-of-the-sshd-PA.patch
    [PATCH 1/3] pam_slurm_adopt: avoid running outside of the sshd PAM
  * added pam_slurm_adopt-send_user_msg-don-t-copy-undefined-d.patch
    [PATCH 2/3] pam_slurm_adopt: send_user_msg: don't copy undefined data
  * added pam_slurm_adopt-use-uid-to-determine-whether-root-is.patch
    [PATCH 3/3] pam_slurm_adopt: use uid to determine whether root is
    logging on
- package slurm-pam_slurm now depends on slurm-node and not on slurm

OBS-URL: https://build.opensuse.org/request/show/657422
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=79
2018-12-12 09:28:26 +00:00
Dominique Leuenberger
d456564fc2 Accepting request 655559 from network:cluster
OBS-URL: https://build.opensuse.org/request/show/655559
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=22
2018-12-07 13:34:03 +00:00
8ddf42df7f Accepting request 655364 from home:mslacken:slurm18
- fixed code in %pretrans section to be compatible with lua 5.1

OBS-URL: https://build.opensuse.org/request/show/655364
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=77
2018-12-06 09:50:36 +00:00
Dominique Leuenberger
ce6f6d350e Accepting request 653720 from network:cluster
Automatic submission by obs-autosubmit

OBS-URL: https://build.opensuse.org/request/show/653720
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=21
2018-12-04 19:57:26 +00:00
f21d191e3c Accepting request 650545 from home:eeich:branches:network:cluster
- Added missing perl-base dependency.

- Moved HTML docs to doc package.

- Moved config man pages to a separate package: This way, they won't
  get installed on compute nodes.                                                                                                                                  

- Update to 18.08.3
  * Add new burst buffer state of "teardown-fail" to indicate the burst
    buffer teardown operation is failing on specific buffers.
  * Multiple backup slurmctld daemons can be configured
  * Enable jobs with zero node count for creation and/or deletion of persistent
    burst buffers.
  * Add "scontrol show dwstat" command to display Cray burst buffer status.
  * Add "GetSysStatus" option to burst_buffer.conf file.
  * Add node and partition configuration options of "CpuBind" to control
    default task binding.
  * Add "NumaCpuBind" option to knl.conf
  * Add sbatch "--batch" option to identify features required on batch node.
  * Add "BatchFeatures" field to output of "scontrol show job".
  * Add support for "--bb" option to sbatch command.
  * Add new SystemComment field to job data structure and database.
  * Expand reservation "flags" field from 32 to 64 bits.
  * Add job state flag of "SIGNALING" to avoid race condition.
  * Properly handle srun --will-run option when there are jobs in COMPLETING
    state.
  * Properly report who is signaling a step.
  * Don't combine updated reservation records in sreport's reservation report.
  * node_features plugin - Add suport for XOR & XAND of job constraints (node
    feature specifications).

OBS-URL: https://build.opensuse.org/request/show/650545
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=75
2018-11-20 17:07:44 +00:00
Dominique Leuenberger
86c9afa17d Accepting request 639245 from network:cluster
- Move config man-pages to config package. (forwarded request 639244 from eeich)

OBS-URL: https://build.opensuse.org/request/show/639245
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=20
2018-10-01 07:08:06 +00:00
2390a20289 Accepting request 639244 from home:eeich:branches:network:cluster
- Move config man-pages to config package.

OBS-URL: https://build.opensuse.org/request/show/639244
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=73
2018-09-30 15:33:20 +00:00
Dominique Leuenberger
d570b4c591 Accepting request 637642 from network:cluster
- added correct link flags for perl bindings (bsc#1108671)
  * added correct linker search path in slurm-2.4.4-rpath.patch
  * perl:Switch is required by slurm torque wrappers

OBS-URL: https://build.opensuse.org/request/show/637642
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=19
2018-09-25 13:42:36 +00:00
Dominique Leuenberger
e67629e7ac Accepting request 637167 from network:cluster
- Fix Requires(pre) and Requires(post) for slurm-config and slurm-node.
  This fixes issues with failing slurm user creation when installed
  during initial system installation (bsc#1109373). (forwarded request 637165 from eeich)

OBS-URL: https://build.opensuse.org/request/show/637167
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=18
2018-09-24 11:13:28 +00:00
39fedd2ce8 - added correct link flags for perl bindings (bsc#1108671)
* added correct linker search path in slurm-2.4.4-rpath.patch
  * perl:Switch is required by slurm torque wrappers

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=70
2018-09-24 09:37:13 +00:00
410ad28aca Accepting request 637165 from home:eeich:branches:network:cluster
- Fix Requires(pre) and Requires(post) for slurm-config and slurm-node.
  This fixes issues with failing slurm user creation when installed
  during initial system installation (bsc#1109373).

OBS-URL: https://build.opensuse.org/request/show/637165
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=69
2018-09-22 07:50:55 +00:00
Dominique Leuenberger
787af67337 Accepting request 631120 from network:cluster
- slurmctld-rerun-agent_init-when-backup-controller-takes-over.patch:
  Fix race in the slurmctld backup controller which prevents it

OBS-URL: https://build.opensuse.org/request/show/631120
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=17
2018-08-24 15:11:07 +00:00
dbb82d64bd - slurmctld-rerun-agent_init-when-backup-controller-takes-over.patch:
Fix race in the slurmctld backup controller which prevents it

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=67
2018-08-23 13:54:53 +00:00
Dominique Leuenberger
86275b2ca6 Accepting request 629227 from network:cluster
- slurmctld-rerun-agent_init-when-backup-controller-takes-over.patch
  Fix an issue where the fallback controller will not be able to idle
  nodes after a failover when a process has terminated (bsc#1084917). (forwarded request 629226 from eeich)

OBS-URL: https://build.opensuse.org/request/show/629227
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=16
2018-08-17 22:02:08 +00:00
fafb5a0196 Accepting request 629226 from home:eeich:branches:network:cluster
- slurmctld-rerun-agent_init-when-backup-controller-takes-over.patch
  Fix an issue where the fallback controller will not be able to idle
  nodes after a failover when a process has terminated (bsc#1084917).

OBS-URL: https://build.opensuse.org/request/show/629226
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=65
2018-08-14 13:18:35 +00:00
d5a2e95d8c Accepting request 629222 from home:eeich:branches:network:cluster
- Update to 17.11.9
  * Fix segfault in slurmctld when a job's node bitmap is NULL during a
    scheduling cycle.  Primarily caused by EnforcePartLimits=ALL.
  * Remove erroneous unlock in acct_gather_energy/ipmi.
  * Enable support for hwloc version 2.0.1.
  * Fix 'srun -q' (--qos) option handling.
  * Fix socket communication issue that can lead to lost task completition
    messages, which will cause a permanently stuck srun process.
  * Handle creation of TMPDIR if environment variable is set or changed in
    a task prolog script.
  * Avoid node layout fragmentation if running with a fixed CPU count but
    without Sockets and CoresPerSocket defined.
  * burst_buffer/cray - Fix datawarp swap default pool overriding jobdw.
  * Fix incorrect job priority assignment for multi-partition job with
    different PriorityTier settings on the partitions.
  * Fix sinfo to print correct node state.

- When using a remote shared StateSaveLocation, slurmctld needs to
  be started after remote filesystems have become available.
  Add 'remote-fs.target' to the 'After=' directive in slurmctld.service
  (boo#1103561).

- Update to 17.11.8
  * Fix incomplete RESPONSE_[RESOURCE|JOB_PACK]_ALLOCATION building path.
  * Do not allocate nodes that were marked down due to the node not responding
    by ResumeTimeout.
  * task/cray plugin - search for "mems" cgroup information in the file
    "cpuset.mems" then fall back to the file "mems".
  * Fix ipmi profile debug uninitialized variable.
  * PMIx: fixed the direct connect inline msg sending.

OBS-URL: https://build.opensuse.org/request/show/629222
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=64
2018-08-14 13:00:16 +00:00
Dominique Leuenberger
1a766a5938 Accepting request 622077 from network:cluster
- Shield comments between script snippets with a %{!?nil:...} to
  avoid them being interpreted as scripts - in which case the update
  level is passed as argument (see chapter 'Shared libraries' in:
  https://en.opensuse.org/openSUSE:Packaging_scriptlet_snippets)
  (bsc#1100850).

OBS-URL: https://build.opensuse.org/request/show/622077
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=15
2018-07-13 08:20:52 +00:00
62ef6634bc - Shield comments between script snippets with a %{!?nil:...} to
avoid them being interpreted as scripts - in which case the update
  level is passed as argument (see chapter 'Shared libraries' in:
  https://en.opensuse.org/openSUSE:Packaging_scriptlet_snippets)
  (bsc#1100850).

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=62
2018-07-11 12:08:06 +00:00
Yuchen Lin
502ac7ba66 Accepting request 616050 from network:cluster
- Update from 17.11.5 to 17.11.7
- Fix security issue in handling of username and gid fields
  CVE-2018-10995 and bsc#1095508 what implied an 
  update from 17.11.5 to 17.11.7
  Highlights of 17.11.6:
  * CRAY - Add slurmsmwd to the contribs/cray dir
  * PMIX - Added the direct connect authentication.
  * Prevent the backup slurmctld from losing the active/available node
    features list on takeover.
  * Be able to force power_down of cloud node even if in power_save state.
  * Allow cloud nodes to be recognized in Slurm when booted out of band.
  * Numerous fixes - check 'NEWS' file.
  Highlights of 17.11.7:
  * Notify srun and ctld when unkillable stepd exits.
  * Numerous fixes - check 'NEWS' file.
- Add: slurmsmwd-uses-xdaemon_-for-systemd.patch
  * Fixes daemoniziation in newly introduced slurmsmwd daemon.
- Rename:
  split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for-systemd-compatibilty.patch
  to split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for.patch
  * remain in sync with commit messages which introduced that file

OBS-URL: https://build.opensuse.org/request/show/616050
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=14
2018-06-13 13:39:46 +00:00
1337fac8b2 - Add: slurmsmwd-uses-xdaemon_-for-systemd.patch
* Fixes daemoniziation in newly introduced slurmsmwd daemon.
- Rename:
  split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for-systemd-compatibilty.patch
  * remain in sync with commit messages which introduced that file

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=60
2018-06-11 14:22:14 +00:00
3e1fd5dae9 Accepting request 616031 from home:mslacken
- Fix security issue in handling of username and gid fields
  CVE-2018-10995 and bsc#1095508 what implied an 
  update from 17.11.5 to 17.11.7
- renanmed split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for-systemd-compatibilty.patch
  to split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for.patch
  in order to be in sync with commit messages which introduced that file

OBS-URL: https://build.opensuse.org/request/show/616031
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=59
2018-06-11 14:18:08 +00:00
55d6d2b0c7 Accepting request 615950 from home:mslacken
- Fix security issue in handling of username and gid fields
  CVE-2018-10995 what implied an update from 17.11.5 to 17.11.7
- Update from 17.11.5 to 17.11.7
  Highlights of 17.11.6:
  * CRAY - Add slurmsmwd to the contribs/cray dir
  * PMIX - Added the direct connect authentication.
  * Prevent the backup slurmctld from losing the active/available node
    features list on takeover.
  * Be able to force power_down of cloud node even if in power_save state.
  * Allow cloud nodes to be recognized in Slurm when booted out of band.
  * Numerous fixes - check 'NEWS' file.
  Highlights of 17.11.7:
  * Notify srun and ctld when unkillable stepd exits.
  * Numerous fixes - check 'NEWS' file.

OBS-URL: https://build.opensuse.org/request/show/615950
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=58
2018-06-11 10:31:14 +00:00
Dominique Leuenberger
eecd28fef6 Accepting request 599202 from network:cluster
- Avoid running pretrans scripts when running in an instsys:
  there may be not much installed, yet. pretrans code should
  be done in lua, this way, it will be executed by the rpm-internal
  lua interpreter and not be passed to a shell which may not be
  around at the time this scriptlet is run (bsc#1090292). (forwarded request 599201 from eeich)

OBS-URL: https://build.opensuse.org/request/show/599202
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=13
2018-04-20 15:32:10 +00:00
7d56316590 Accepting request 599201 from home:eeich:branches:network:cluster
- Avoid running pretrans scripts when running in an instsys:
  there may be not much installed, yet. pretrans code should
  be done in lua, this way, it will be executed by the rpm-internal
  lua interpreter and not be passed to a shell which may not be
  around at the time this scriptlet is run (bsc#1090292).

OBS-URL: https://build.opensuse.org/request/show/599201
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=56
2018-04-20 09:24:13 +00:00
Dominique Leuenberger
648ad9864b Accepting request 596387 from network:cluster
- Add requires for slurm-sql to the slurmdbd package.

- Package READMEs for pam and pam_slurm_adopt.
- Use the new %%license directive for COPYING file.

- Add:
  * split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for-systemd-compatibilty.patch
  * slurmctld-uses-xdaemon_-for-systemd.patch
  * slurmd-uses-xdaemon_-for-systemd.patch
  * slurmdbd-uses-xdaemon_-for-systemd.patch
  * removed-deprecated-xdaemon.patch
  Fix interaction with systemd: systemd expects that a 
  daemonizing process doesn't go away until the PID file
  with it PID of the daemon has bee written (bsc#1084125).

- Make sure systemd services get restarted only when all
  packages are in a consistent state, not in the middle
  of an 'update' transaction (bsc#1088693).
  Since the %postun scripts that run on update are from
  the old package they cannot be changed - thus we work
  around the restart breakage.

  (bsc#1086859).

OBS-URL: https://build.opensuse.org/request/show/596387
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/slurm?expand=0&rev=12
2018-04-16 10:49:00 +00:00
df7fca5b1f - Add requires for slurm-sql to the slurmdbd package.
- Add:
  * split-xdaemon-in-xdaemon_init-and-xdaemon_finish-for-systemd-compatibilty.patch
  * slurmctld-uses-xdaemon_-for-systemd.patch
  * slurmd-uses-xdaemon_-for-systemd.patch
  * slurmdbd-uses-xdaemon_-for-systemd.patch
  * removed-deprecated-xdaemon.patch
  Fix interaction with systemd: systemd expects that a

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=54
2018-04-13 15:08:24 +00:00
d892c59e4e - Package READMEs for pam and pam_slurm_adopt.
- Use the new %%license directive for COPYING file.

OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=53
2018-04-12 17:22:25 +00:00