10 Commits

Author SHA256 Message Date
02e5fedcc8 Accepting request 1127173 from network:cluster
- updated to 1.4.3 with following new features:
  * toggle BASH tracing or NHC debugging via SIGUSR1/SIGUSR2, respectively
  * check_nvsmi_healthmon(): New check from CSC for GPU health monitoring via
    nvidia-smi
  * Provide added detail to tracing info (-x mode)
  * Based on feedback from Moe Jette of SchedMD, pull node job data directly
    from Slurm via squeue instead of the previous method that only worked for
    single-node jobs.
  * Support for recent additions to the Slurm node states (e.g., "planned")
  * Pathname expansion has been disabled on startup, and re-enabled only when
    being actively used, to avoid "unintended" expansions of wildcards at
    random points throughout the code.
  * Correct clobbering of BASH built-in variables and add tests to prevent future recurrence
  * Switch "system UID" boundary handling to a more accurate source of truth,
    and ensure that the code matches the math, naming, and intent.
  * Reorder resource manager detection to improve accurate detection,
    especially with respect to Slurm vs. PBS (all variants)
- removed test-test_lbnl_file.nhc-Put-all-process-substitution.patch

OBS-URL: https://build.opensuse.org/request/show/1127173
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/warewulf-nhc?expand=0&rev=2
2023-11-17 19:49:29 +00:00
67043943ac - removed test-test_lbnl_file.nhc-Put-all-process-substitution.patch
OBS-URL: https://build.opensuse.org/package/show/network:cluster/warewulf-nhc?expand=0&rev=9
2023-11-16 19:44:43 +00:00
f8257bc5f0 Accepting request 1127170 from home:mslacken:branches:network:cluster
- updated to 1.4.3 with following new features:
  * toggle BASH tracing or NHC debugging via SIGUSR1/SIGUSR2, respectively
  * check_nvsmi_healthmon(): New check from CSC for GPU health monitoring via
    nvidia-smi
  * Provide added detail to tracing info (-x mode)
  * Based on feedback from Moe Jette of SchedMD, pull node job data directly
    from Slurm via squeue instead of the previous method that only worked for
    single-node jobs.
  * Support for recent additions to the Slurm node states (e.g., "planned")
  * Pathname expansion has been disabled on startup, and re-enabled only when
    being actively used, to avoid "unintended" expansions of wildcards at
    random points throughout the code.
  * Correct clobbering of BASH built-in variables and add tests to prevent future recurrence
  * Switch "system UID" boundary handling to a more accurate source of truth,
    and ensure that the code matches the math, naming, and intent.
  * Reorder resource manager detection to improve accurate detection,
    especially with respect to Slurm vs. PBS (all variants)

OBS-URL: https://build.opensuse.org/request/show/1127170
OBS-URL: https://build.opensuse.org/package/show/network:cluster/warewulf-nhc?expand=0&rev=8
2023-11-16 19:29:07 +00:00
bc9a1ef111 Accepting request 786942 from network:cluster
node health checker which can be used by slurm

OBS-URL: https://build.opensuse.org/request/show/786942
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/warewulf-nhc?expand=0&rev=1
2020-03-23 11:50:20 +00:00
219048a71b now disabled
OBS-URL: https://build.opensuse.org/package/show/network:cluster/warewulf-nhc?expand=0&rev=6
2020-03-20 15:51:16 +00:00
3bd0c49d4e no test for tw
OBS-URL: https://build.opensuse.org/package/show/network:cluster/warewulf-nhc?expand=0&rev=5
2020-03-20 15:48:38 +00:00
93230d12bc added source
OBS-URL: https://build.opensuse.org/package/show/network:cluster/warewulf-nhc?expand=0&rev=4
2020-03-20 15:34:13 +00:00
5c56ad6ab0 - updated to 1.4.2 with following new features:
* Support for negating *any* match string anywhere
  * check_net_ping():  New check for monitoring of network connectivity
  * check_ps_*():  Process owner parameters now accept match strings
  * check_cmd_dmesg():  New check to validate/verify or catch/flag
  * check_fs_mount():  Create missing mount points as necessary
  * New command-line flag:  "-e <check>" will override config file,
- added patch to fix error during test phase
  * test-test_lbnl_file.nhc-Put-all-process-substitution.patch

OBS-URL: https://build.opensuse.org/package/show/network:cluster/warewulf-nhc?expand=0&rev=3
2020-03-20 15:27:01 +00:00
80a267c995 Accepting request 576267 from home:mslacken
- version 1.4.1 
 * Too many changes. See ChangeLog file for details

OBS-URL: https://build.opensuse.org/request/show/576267
OBS-URL: https://build.opensuse.org/package/show/network:cluster/warewulf-nhc?expand=0&rev=2
2018-09-26 14:21:58 +00:00
Tobias Burnus
aa65183872 Accepting request 141678 from home:scorot:branches:network:cluster
Warewulf Node Health Check (NHC)

OBS-URL: https://build.opensuse.org/request/show/141678
OBS-URL: https://build.opensuse.org/package/show/network:cluster/warewulf-nhc?expand=0&rev=1
2013-03-31 22:13:21 +00:00