SHA256
1
0
forked from pool/slurm

Accepting request 629226 from home:eeich:branches:network:cluster

- slurmctld-rerun-agent_init-when-backup-controller-takes-over.patch
  Fix an issue where the fallback controller will not be able to idle
  nodes after a failover when a process has terminated (bsc#1084917).

OBS-URL: https://build.opensuse.org/request/show/629226
OBS-URL: https://build.opensuse.org/package/show/network:cluster/slurm?expand=0&rev=65
This commit is contained in:
Egbert Eich 2018-08-14 13:18:35 +00:00 committed by Git OBS Bridge
parent d5a2e95d8c
commit fafb5a0196
2 changed files with 4 additions and 1 deletions

View File

@ -17,6 +17,9 @@ Tue Aug 14 10:26:43 UTC 2018 - eich@suse.com
* Fix incorrect job priority assignment for multi-partition job with
different PriorityTier settings on the partitions.
* Fix sinfo to print correct node state.
- slurmctld-rerun-agent_init-when-backup-controller-takes-over.patch
Fix an issue where the fallback controller will not be able to idle
nodes after a failover when a process has terminated (bsc#1084917).
-------------------------------------------------------------------
Thu Aug 2 11:35:55 UTC 2018 - eich@suse.com

View File

@ -3,7 +3,7 @@ Date: Tue Jul 31 17:31:15 2018 +0200
Subject: slurmctld: rerun agent_init() when backup controller takes over
Patch-mainline: Not yet
Git-commit: 169d9522c89a10dcffbf1403c20b4e6249bac79b
References:
References: bsc#1084917
A slurmctld backup controller often fails to clean up jobs which have
finished, the node appears in an 'IDLE+COMPLETING' state while squeue -l