factory
158 Commits
| Author | SHA256 | Message | Date | |
|---|---|---|---|---|
| 10f84dfbfc |
- Update to v1.23.2:
* Fixed a buffer overrun vulnerability in Netdata's JSON parsing
code. This vulnerability could be used to crash Agents
remotely, and in some circumstances, could be used in an
arbitrary code execution (ACE) exploit.
Improvements:
* Add support for multiple ACLK query processing threads
* Add Infiniband monitoring to collector proc.plugin
* Change the HTTP method to make the IPFS collector compatible
with 0.5.0+
* Add support for returning headers using python.d's UrlService
Bug fixes:
* Fix vulnerability in JSON parsing
* Fixed stored number accuracy
* Fix transition from archived to active charts not generating
alarms
* Fix PyMySQL library to respect my.cnf parameter
* Remove health from archived metrics
* Update exporting engine to read the prefix option from instance
config sections
* Fix display error in Swagger API documentation
* Wrap exporting engine header definitions in compilation
conditions
* Improve cgroups collector to autodetect unified cgroups
* Fix CMake build failing if ACLK is disabled
* Fix now_ms in charts.d collector to prevent tc-qos-helper
crashes
* Fix python.d crashes by adding a lock to stdout write function
* Fix an issue with random crashes when updating a chart's
metadata on the fly
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=58
|
|||
| 41ba39f1b0 |
Accepting request 819067 from network
OBS-URL: https://build.opensuse.org/request/show/819067 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=19 |
|||
| a645c001cf |
Accepting request 818990 from home:seife:branches:network
update to v1.23.1 OBS-URL: https://build.opensuse.org/request/show/818990 OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=56 |
|||
| 587a90c4f9 |
Accepting request 805922 from network
- Update to v1.22.1:
Bug fixes:
- Fixed the latency issue on the ACLK and suppress the diagnostics.
- Restored old semantics of "netdata -W set" command.
- Removed check for old alarm status.
- Changes for v1.22.0:
Breaking Changes:
* The previous iteration of Netdata Cloud, accessible through
various Sign in and Nodes view (beta) buttons on the Agent
dashboard, is deprecated in favor of the new Cloud experience.
* Our old documentation site (docs.netdata.cloud) was replaced
with Netdata Learn. All existing backlinks redirect to the new
site.
* Our localization project is no longer actively maintained.
We're grateful for the hard work of its contributors.
Improvements:
* Netdata Cloud:
- Enabled support for Netdata Cloud.
- Added TTL headers to ACLK responses.
- Improved the thread exit fixes in #8750.
- Improved ACLK reconnection sequence.
- Improved ACLK memory management and shutdown sequence.
- Added session-id to ACLK using connect timestamp.
* Collectors:
- Improved the index size for the eBPF collector.
- Added health alarm templates for the whoisquery collector.
- Added a whoisquery collector.
- Removed an automatic restart of apps.plugin.
* Exporting:
- Enabled internal statistics for the exporting engine in the
Agent dashboard.
- Implemented a Prometheus exporter web API endpoint.
* Notifications:
- Added a certificate revocation alarm for the x509check
collector.
- Added the ability to send Agent alarm notifications to
Dynatrace.
* Other:
- Updated main copyright and links for the year 2020 in daemon
help output.
- Moved bind to to [web] section and update
netdata.service.v235.in to sync it with recent changes.
- Put old dashboard behind a prefix instead of using a script
to switch.
- Enabled the truthy rule in yamllint.
- Added Borg backup, Squeezebox servers, Hiawatha web server,
and Microsoft SQL to apps.plugin so that it can appropriately
group them by type of service.
* Bug fixes:
- Fixed mdstat failed devices alarm.
- Fixed rare race condition in old Cloud iframe.
- Removed no-clear-notification options from portcheck health
templates.
- Fixed old URLs to silence Netlify's mixed content warnings.
- Fixed master streaming fatal exits.
- Fixed email authentiation to Cloud/Nodes View.
- Fixed non-escaped characters in private registry URLs.
- Fixed crash when shutting down an Agent with the ACLK
disabled.
- Fixed status checks for UPS devices using the apcupsd
collector.
- Fixed alarm notification script by adding a check to the
Dynatrace notification method.
- Fixed threads_creation_rate chart context in the
python.d MySQL collector.
- Fixed sudo check in charts.d libreswan collector to prevent
daily security notices.
OBS-URL: https://build.opensuse.org/request/show/805922
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=18
|
|||
| 3d302d8611 |
- Update to v1.22.1:
Bug fixes:
- Fixed the latency issue on the ACLK and suppress the diagnostics.
- Restored old semantics of "netdata -W set" command.
- Removed check for old alarm status.
- Changes for v1.22.0:
Breaking Changes:
* The previous iteration of Netdata Cloud, accessible through
various Sign in and Nodes view (beta) buttons on the Agent
dashboard, is deprecated in favor of the new Cloud experience.
* Our old documentation site (docs.netdata.cloud) was replaced
with Netdata Learn. All existing backlinks redirect to the new
site.
* Our localization project is no longer actively maintained.
We're grateful for the hard work of its contributors.
Improvements:
* Netdata Cloud:
- Enabled support for Netdata Cloud.
- Added TTL headers to ACLK responses.
- Improved the thread exit fixes in #8750.
- Improved ACLK reconnection sequence.
- Improved ACLK memory management and shutdown sequence.
- Added session-id to ACLK using connect timestamp.
* Collectors:
- Improved the index size for the eBPF collector.
- Added health alarm templates for the whoisquery collector.
- Added a whoisquery collector.
- Removed an automatic restart of apps.plugin.
* Exporting:
- Enabled internal statistics for the exporting engine in the
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=54
|
|||
| e59c1d429c |
Accepting request 793692 from network
- Update to v1.21.1: Release v1.21.1 is a hotfix release to improve the performance of the new React dashboard, which was merged and enabled by default in v1.21.0. The React dashboard shipped in v1.21.0 did not properly freeze charts that were outside of the browser's viewport. If a user who loaded many charts by scrolling through the dashboard, charts outside of their browser's viewport continued updating. This excess of chart updates caused all charts to update more slowly than every second. v1.21.1 includes improvements to the way the Netdata dashboard freezes, maintains state, and restores charts as users scroll. OBS-URL: https://build.opensuse.org/request/show/793692 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=17 |
|||
| ea540c3b70 |
- Update to v1.21.1:
Release v1.21.1 is a hotfix release to improve the performance of the new React dashboard, which was merged and enabled by default in v1.21.0. The React dashboard shipped in v1.21.0 did not properly freeze charts that were outside of the browser's viewport. If a user who loaded many charts by scrolling through the dashboard, charts outside of their browser's viewport continued updating. This excess of chart updates caused all charts to update more slowly than every second. v1.21.1 includes improvements to the way the Netdata dashboard freezes, maintains state, and restores charts as users scroll. OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=52 |
|||
| 04cdda6976 |
Accepting request 792974 from network
- Update to v1.21.0 (go.d.plugin v0.18.0):
Improvements:
* Extended TLS support for 1.3.
* Switched to the React dashboard code as the default dashboard.
* Collectors:
- Added a new Pulsar collector.
- Added a new VerneMQ collector.
- Added high precision timer support for plugins such as
idlejitter.
- Added an alarm to the dns_query collector that detects DNS
query failure.
- Added the ability to get the pod name from cgroup with
kubectl in bare-metal deployments.
- Added the ability to connect to non-admin user IDs for a Ceph
storage cluster.
- Added connections (backend) usage to Postgres monitoring.
* Exporting:
- Added a MongoDB connector to the exporting engine.
- Added a Prometheus Remote Write connector to the exporting
engine.
- Added an AWS Kinesis connector to the exporting engine.
Bug fixes:
* Removed notifications from the dashboard and fixed the
/default.html route.
* Fixed help-tooltips styling, private registry node deletion,
and the right-hand sidebar "jumping" on document clicks.
* Fixed errors reported by Coverity.
* Fixed broken pipe ignoring in apps.plugin.
* Fixed the bytespersec chart context in the Python Apache
collector.
* Fixed charts.d.plugin to exit properly during Netdata
service restart
* Fixed minimist dependency vulnerability.
* Fixed how SimpleService truncates Python module names.
* Added proper prefix to Python module names during loading.
* Fixed the flushing error threshold with the database engine.
* Fixed memory leak for host labels streaming from slaves to
master.
* Fixed streaming scaling.
* Fixed missing characters in kernel version field by encoding
slave fields.
* Fixed Ceph collector to get osd_perf_infos in versions 14.2
and higher.
* Removed extraneous commas from chart information in dashboard.
* Removed tmem collection from xenstat_plugin to allow Netdata
on Xen 4.13 to compile successfully.
* Restricted messages to Google Analytics.
* Fixed Python 3 dict access in OpenLDAP collector module.
OBS-URL: https://build.opensuse.org/request/show/792974
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=16
|
|||
| 5f1cca7857 | OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=50 | |||
| ce918550fa |
Accepting request 791781 from home:polslinux:branches:network
- Update to v1.21.0 (go.d.plugin v0.18.0): Improvements * Extended TLS support for 1.3. (#8505) by thiagoftsm * Switched to the React dashboard code as the default dashboard. (#8363) by Ferroin Collectors * Added a new Pulsar collector. (#8364) by ilyam8 * Added a new VerneMQ collector. (#8236) by ilyam8 * Added high precision timer support for plugins such as idlejitter. (#8441) by mfundul * Added an alarm to the dns_query collector that detects DNS query failure. (#8434) by ilyam8 * Added the ability to get the pod name from cgroup with kubectl in bare-metal deployments. (#7416) by WoozyMasta * Added the ability to connect to non-admin user IDs for a Ceph storage cluster. (#8276) by paulmezz * Added connections (backend) usage to Postgres monitoring. (#8126) by ManuelPombo * eBPF: Added support for additional Linux kernels found in Debian 10.2 and Ubuntu 18.04. (#8192) by thiagoftsm Exporting * Added a MongoDB connector to the exporting engine. (#8416) by vlvkobal * Added a Prometheus Remote Write connector to the exporting engine. (#8292) by vlvkobal * Added an AWS Kinesis connector to the exporting engine. (#8145) by vlvkobal Bug fixes * Removed notifications from the dashboard and fixed the /default.html route. (#8599 by jacekkolasa * Fixed help-tooltips styling, private registry node deletion, and the right-hand sidebar "jumping" on document clicks. (#8553 by jacekkolasa * Fixed errors reported by Coverity. (#8593) by thiagoftsm, (#8579) by amoss, and (#8586) by thiagoftsm * Added netdata.service.* to .gitignore to hide system/netdata.service.v235 file. (#8556) by vlvkobal * Fixed Debian 8 (Jessie) support. (#8590) and (#8593) by prologic * Fixed broken Fedora 30/31 RPM builds. (#8572) by prologic * Fixed broken pipe ignoring in apps.plugin. (#8554) by vlvkobal * Fixed the bytespersec chart context in the Python Apache collector. (#8550) by ilyam8 * Fixed charts.d.plugin to exit properly during Netdata service restart. (#8529) by ilyam8 * Fixed minimist dependency vulnerability. (#8537) by jacekkolasa * Fixed our Debian/Ubuntu packages to package the expected systemd unit files. (#8468) by prologic * Fixed auto-updates for static (kickstart-static64.sh) installs. (#8507) by prologic * Fixed openSUSE 15.1 RPM package builds. (#8494) by prologic * Fixed how SimpleService truncates Python module names. (#8492) by ilyam8 * Removed erroneous \n in uninstaller output. (#8446) by prologic * Fixed install-required-packages script to self-update apt. (#8491) by prologic * Added proper prefix to Python module names during loading. (#8474) by ilyam8 * Fixed how the Netdata updater script cleans up after being run. (#8414) by prologic * Fixed the flushing error threshold with the database engine. (#8425) by mfundul * Fixed memory leak for host labels streaming from slaves to master. (#8460) by thiagoftsm * Fixed support for uninstalling the eBPF collector in the uninstaller. (#8444) by prologic * Fixed a bug involving stop_all_netdata uv_pipe_connect() in the installer. (#8444) by prologic * Fixed installer output regarding newlines. (#8447) by prologic * Fixed broken dependencies for Ubuntu 19.10. (#8397) by prologic * Fixed streaming scaling. (#8375) by mfundul * Fixed missing characters in kernel version field by encoding slave fields. (#8216) by thiagoftsm * Fixed installation for Ubuntu 14.04 (#7690) by Ehekatl * Fixed dependencies for Debian Jessie. (#8290) by Ferroin * Fixed dependency names for Arch Linux. (#8334) by Ferroin * Removed extra printed \n in various installers. (#8324)/(#8325)/(#8326) by Jiab77 * Fixed missing folders in /var/ for .deb packages. (#8314) by SamK * Fixed Ceph collector to get osd_perf_infos in versions 14.2 and higher. (#8248) by ilyam8 * Fixed RHEL / CentOS 8.x dependencies for Judy-devel and others.(#8202) by prologic * Removed extraneous commas from chart information in dashboard. (#8266) by FlyingSixtySix * Removed tmem collection from xenstat_plugin to allow Netdata on Xen 4.13 to compile successfully. (#7951) by rushikeshjadhav * Fixed get_latest_version for nightly channel update script. (#8172) by ilyam8 * Restricted messages to Google Analytics. (#8161) by thiagoftsm * Fixed Python 3 dict access in OpenLDAP collector module. (#8162) by Mic92 OBS-URL: https://build.opensuse.org/request/show/791781 OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=49 |
|||
| 3e7eb9d128 |
Accepting request 779690 from network
- Update to v1.20.0 (go.d.plugin v0.15.0)
Breaking Changes:
* Removed deprecated bash collectors apache, cpu_apps, cpufreq,
exim, hddtemp, load_average, mem_apps, mysql, nginx, phpfpm,
postfix, squid, tomcat. If you were still using one of these
collectors with custom configurations, you can find the new
collector that replaces it in the supported collectors list.
Improvements:
* Host labels:
+ Added support for host labels
+ Improved the monitored system information detection.
Added CPU freq & cores, RAM and disk space.
+ Started distinguishing the monitored system's (host)
OS/Kernel etc. from those of the docker container's
+ Started creating host labels from collected system info
+ Started passing labels and container environment variables
via the streaming protocol
+ Started sending host labels via exporting connectors
+ Added label support to alarm definitions and started
recording them in alarm logs
+ Added support for host labels to the API responses
+ Added configurable host labels to netdata.conf
+ Added kubernetes labels
* New collectors:
+ eBPF kernel collector
+ CockroachDB
+ squidlog: squid access log parser
* Collector improvements:
+ apps.plugin: Created dns group, improved database group,
improved ceph & samba groups
+ varnish: Added SMF metrics (cache on disk)
+ phpfpm: Fixed per process chart titles and readme
+ python.d: Formatted the code in all modules
+ node.d/snmp: - Added snmpv3 support, formatted the code in
snmp.node.js
+ cgroups: Improved LXC monitoring by filtering out irrelevant
LXC cgroups
+ litespeed: Added support for different .rtreport format
+ proc.plugin: Added pressure stall information
+ sensors: Improved collection logic
+ proc: Started monitoring network interface speed, duplex,
operstate
+ smartd_log: Fixed the setting in the reallocated sectors
count, by setting ATTR5 chart algorithm to absolute
+ nvidia-smi: Allow executing nvidia-smi in normal instead of
loop mode
+ wmi: collect logon metrics, collect logical_disk disk
latency metrics
+ weblog: handle MKCOL, PROPFIND, MOVE, SEARCH http request
methods
+ scaleio: storage pools and sdcs metrics
* Exporting engine:
+ Implemented the main flow for the Exporting Engine
* Streaming:
+ Add versioning to the streaming protocol
* Installation/Packages:
+ Improved the systemd service files, by removing unecessary
ExecStartPre lines and moving global options to netdata.conf
* Privacy:
+ Added support for opting out of telemetry via the
DO_NOT_TRACK environment variable (telemetry is disabled by
default on openSUSE)
* Other:
+ Preparations for the next netdata cloud release. Added custom
libmosquitto, netdata-cli and other prerequisites.
+ Applied linter fixes in shell scripts
+ Started supporting -fno-common in CFLAGS
+ Completely removed the unbound python collector (dead code)
+ Added possibility to change badges' text font color
+ Small updates to sample multi-host dashboard, dash.html
+ Added missing quoting in shell scripts
+ Bump handlebars from 4.2.0 to 4.5.3
+ Reduce log level for uv_thread_set_name_np from error to info
+ Set standard name to non-libnetdata threads (libuv, pthread)
* Bug fixes:
+ Fixed problems reported by Coverity for eBPF collector plugin
+ Fixed invalid literal for float\(\): NN.NNt error in the
elasticsearch python plugin, by adding terabyte unit parsing
+ Fixed timeout failing in docker containers which broke some
python.d collectors
+ Fixed problem with the httpcheck python collector not being
able to check URLs with the POST method, by adding body to
the URLService. Also record the new options in httpcheck.conf
+ Fixed dbengine repeated global flushing errors and collectors
being blocked, by dropping dirty dbengine pages if the disk
cannot keep up
+ Fixed issue with alarm notifications occasionally ignoring
the configured severity filter when the ROLE was set to root
+ Fixed Netlink Connection Tracker charts in the nfacct plugin
+ Fixed metric values sent via remote write to Prometheus
backends, when using average/sum
+ Fixed unclosed brackets in softnet alarm
+ Fixed SEGFAULT when localhost initialization failed
+ Reduced the number of broken pipe error log entries,
after a SIGKILL
+ Fixed Coverity errors by restoring support for protobuf 3.0
+ Fixed inability to disable Prometheus remote API
+ Fixed SEGFAULT from the cpuidle plugin
+ Fixed samba collector not working, due to inability to run
sudo
+ Fixed invalid css/js resource errors when URL for slave node
has no final / on streaming master
+ Fixed keys_redis chart in the redis collector, by populating
keys at runtime
+ Fixed UrlService bytes decoding and logger unicode encoding
in the python.d plugin
+ Fixed a warning in the prometheus remote write backend
+ Fixed not detecting more than one adapter in the hpssa
collector
+ Fixed race conditions in dbengine
+ Fixed race condition with the dbenging page cache descriptors
+ Fixed dbengine dirty page flushing warning
+ Fixed missing parenthesis on alarm softnet.conf
+ Fixed "Master thread EXPORTING takes too long to exit.
Giving up" error, by cleaning up the main exporting engine
thread on exit
+ Fixed rabbitmq error "update() unhandled exception: invalid
literal for int() with base 10"
+ Fixed some LGTM alerts
+ Fixed valgrind errors
+ Fixed monit collector LGTM warnings
+ Fixed the following go.d.plugin collector issues:
. mysql: panic in Cleanup (#326)
. unbound: gather metrics via unix socket (#319)
. logstash: pipelines chart (#317)
. unbound: configuration file parsing.
. Support include mechanism. (#298)
. logstash: pipelines metrics parsing (#293)
. phpfpm: processes metrics parsing (#297)
OBS-URL: https://build.opensuse.org/request/show/779690
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=15
|
|||
|
|
3cf8053b75 |
- Update to v1.20.0 (go.d.plugin v0.15.0)
Breaking Changes:
* Removed deprecated bash collectors apache, cpu_apps, cpufreq,
exim, hddtemp, load_average, mem_apps, mysql, nginx, phpfpm,
postfix, squid, tomcat. If you were still using one of these
collectors with custom configurations, you can find the new
collector that replaces it in the supported collectors list.
Improvements:
* Host labels:
+ Added support for host labels
+ Improved the monitored system information detection.
Added CPU freq & cores, RAM and disk space.
+ Started distinguishing the monitored system's (host)
OS/Kernel etc. from those of the docker container's
+ Started creating host labels from collected system info
+ Started passing labels and container environment variables
via the streaming protocol
+ Started sending host labels via exporting connectors
+ Added label support to alarm definitions and started
recording them in alarm logs
+ Added support for host labels to the API responses
+ Added configurable host labels to netdata.conf
+ Added kubernetes labels
* New collectors:
+ eBPF kernel collector
+ CockroachDB
+ squidlog: squid access log parser
* Collector improvements:
+ apps.plugin: Created dns group, improved database group,
improved ceph & samba groups
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=47
|
||
| 916540f7e3 |
Accepting request 765606 from network
- Also package go.d.plugin (v0.14.1)
- Update to v1.19.0
Improvements:
* New collectors:
+ AM2320 sensor collector plugin
+ Added parsing of /proc/pagetypeinfo to provide metrics on
fragmentation of free memory pages
+ The unbound collector module was completely rewritten, in Go
* Collector improvements:
+ We rewrote our web log parser in Go, drastically improving
its flexibility and performance
+ The Kubernetes kubelet collector now reads the service
account token and uses it for authorization. We also added a
new default job to collect metrics from
https://localhost:10250/metrics
+ Added a new default job to the Kubernetes coredns collector
to collect metrics from
http://kube-dns.kube-system.svc.cluster.local:9153/metrics
+ apps.plugin: Synced FRRouting daemons configuration with the
frr 7.2 release
+ apps.plugin: Added process group for git-related processes
+ apps.plugin: Added balena to the container-engines
application group
+ web_log: Treat 401 Unauthorized requests as successful
+ xenstat.plugin: Prepare for xen 4.13 by checking for check
xenstat_vbd_error presence
+ mysql: Added galera cluster_status alarm
* Metrics database:
+ Netdata generates alarms if the disk cannot keep up with
data collection
* Health:
+ Fine tune various default alarm configurations
+ Update SYN cookie alarm to be less aggressive
+ Added support for IRC alarm notifications
Bug fixes:
* Prevented freezes due to isolated CPUs
* Fixed missing streaming when slave has SSL activated
* Fixed error 421 in IRC notifications, by removing a line break
from the message
* proc/pagetypeinfo collection could under particular
circumstances cause high CPU load. As a workaround,
we disabled pagetypeinfo by default
* Fixed incorrect memory allocation in proc plugin’s
pagetypeinfo collector
* Eliminated cached responses from the postgres collector
* rabbitmq: Fixed "disk_free": "disk_free_monitoring_disabled"
error
* Fixed Apache module not working with letsencrypt certificate
by allowing the python UrlService to skip tls_verify for http
scheme
* Fixed invalid spikes appearing in certain charts, by improving
the incremental counter reset/wraparound detection algorithm
* Fixed issue with unknown variables in alarm configuration
expressions always being evaluated to zero
* Fixed issue of automatically picking up Pi-hole stats from a
Pi-hole instance installed on another device by disabling the
default job that collects metrics from http://pi.hole
OBS-URL: https://build.opensuse.org/request/show/765606
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=14
|
|||
|
|
e62fa64b06 | OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=45 | ||
|
|
8a2c221fb4 | OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=44 | ||
|
|
2d00dc7edf | OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=43 | ||
|
|
fd25c57bf7 | OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=42 | ||
|
|
e3f59e33a7 |
- Also package go.d.plugin
- Update to v1.19.0
Improvements:
* New collectors:
+ AM2320 sensor collector plugin
+ Added parsing of /proc/pagetypeinfo to provide metrics on
fragmentation of free memory pages
+ The unbound collector module was completely rewritten, in Go
* Collector improvements:
+ We rewrote our web log parser in Go, drastically improving
its flexibility and performance
+ The Kubernetes kubelet collector now reads the service
account token and uses it for authorization. We also added a
new default job to collect metrics from
https://localhost:10250/metrics
+ Added a new default job to the Kubernetes coredns collector
to collect metrics from
http://kube-dns.kube-system.svc.cluster.local:9153/metrics
+ apps.plugin: Synced FRRouting daemons configuration with the
frr 7.2 release
+ apps.plugin: Added process group for git-related processes
+ apps.plugin: Added balena to the container-engines
application group
+ web_log: Treat 401 Unauthorized requests as successful
+ xenstat.plugin: Prepare for xen 4.13 by checking for check
xenstat_vbd_error presence
+ mysql: Added galera cluster_status alarm
* Metrics database:
+ Netdata generates alarms if the disk cannot keep up with
data collection
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=41
|
||
| 689f59acc8 |
Accepting request 741631 from network
- Update to v1.18.1
Improvements:
* Disable slabinfo plugin by default to reduce the total number
of metrics collected
* Add dbengine RAM usage statistics
* Support Google Hangouts chat notifications
* Add CMocka unit tests
* Add prerequisites to enable automatic updates for installations
via the static binary (kickstart-static64.sh)
Bug fixes:
* Fix unbound collector timings: Convert recursion timings to
milliseconds.
* Fix unbound collector unhandled exceptions
* Fix megacli collector binary search and sudo check
* Fix Clang warnings
* Fix python.d error logging: change chart suppress msg level
from ERROR to INFO
* Fix freeipmi update frequency check: was warning that 5 was too
frequent and it was setting it to 5.
* Fix alarm configurations not getting loaded, via better
handling of chart names with special characters
* Don't write HTTP response 204 messages to the logs
* Fix build when CMocka isn't installed
* Prevent zombie processes when a child is re-parented to netdata
when its running in a container, by adding child process reaper
- Update to v1.18.0
Improvements:
* Database engine:
+ Make dbengine the default memory mode
+ Increase dbengine default cache size
OBS-URL: https://build.opensuse.org/request/show/741631
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=13
|
|||
|
|
1f4220058f |
- Update to v1.18.1
Improvements:
* Disable slabinfo plugin by default to reduce the total number
of metrics collected
* Add dbengine RAM usage statistics
* Support Google Hangouts chat notifications
* Add CMocka unit tests
* Add prerequisites to enable automatic updates for installations
via the static binary (kickstart-static64.sh)
Bug fixes:
* Fix unbound collector timings: Convert recursion timings to
milliseconds.
* Fix unbound collector unhandled exceptions
* Fix megacli collector binary search and sudo check
* Fix Clang warnings
* Fix python.d error logging: change chart suppress msg level
from ERROR to INFO
* Fix freeipmi update frequency check: was warning that 5 was too
frequent and it was setting it to 5.
* Fix alarm configurations not getting loaded, via better
handling of chart names with special characters
* Don't write HTTP response 204 messages to the logs
* Fix build when CMocka isn't installed
* Prevent zombie processes when a child is re-parented to netdata
when its running in a container, by adding child process reaper
- Update to v1.18.0
Improvements:
* Database engine:
+ Make dbengine the default memory mode
+ Increase dbengine default cache size
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=39
|
||
| e776956add |
Accepting request 728171 from network
- Update to v1.17.0
Improvements:
* Database engine:
+ Variable granularity support for data collection
+ Added tips on the UI to encourage users to try the new DB
Engine, when they reach the end of their metrics history
* Health:
+ Added support for plain text only email notifications
+ Started showing “hidden” alarm variables in the responses of
the chart and data API calls
+ Added a new API call for alarm status counters, as a first
step towards badges that will show the total number of alarms
* Security:
+ Added configurable default locations for trusted CA
certificates
+ Added safer way to get container names
+ Added SSL connection support to the python mongodb collector
* New collectors:
+ VSphere collector
* Archiving:
+ Added a new MongoDB backend
* Other:
+ Added apps grouping debug messages
+ GCC warning and linting improvements
+ Added global configuration option to show charts with zero
metrics
+ Improved the way we parse HTTP requests, so we can avoid
issues from edge cases
* Bug fixes:
+ Fixed sensor chips filtering in python sensors collector
OBS-URL: https://build.opensuse.org/request/show/728171
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=12
|
|||
|
|
6aa8e2818e |
- Update to v1.17.0
Improvements:
* Database engine:
+ Variable granularity support for data collection
+ Added tips on the UI to encourage users to try the new DB
Engine, when they reach the end of their metrics history
* Health:
+ Added support for plain text only email notifications
+ Started showing “hidden” alarm variables in the responses of
the chart and data API calls
+ Added a new API call for alarm status counters, as a first
step towards badges that will show the total number of alarms
* Security:
+ Added configurable default locations for trusted CA
certificates
+ Added safer way to get container names
+ Added SSL connection support to the python mongodb collector
* New collectors:
+ VSphere collector
* Archiving:
+ Added a new MongoDB backend
* Other:
+ Added apps grouping debug messages
+ GCC warning and linting improvements
+ Added global configuration option to show charts with zero
metrics
+ Improved the way we parse HTTP requests, so we can avoid
issues from edge cases
* Bug fixes:
+ Fixed sensor chips filtering in python sensors collector
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=37
|
||
| 6acaf72731 |
Accepting request 714120 from network
- Update to v1.16.0
Improvements:
* Health:
+ Easily disable alarms, by persisting the silencers
configuration
+ Repeating alarm notifications
+ Simplified the health cmdapi tester - no setup/cleanup needed
+ Αdd last_collected alarm to the x509check collector
+ New alarm for abnormally high number of active processes.
* Security:
+ SSL support in the web server and streaming/replication
+ Support encrypted connections to OpenTSDB backends
* New collectors:
+ Go.d collector modules for WMI, Dnsmasq DHCP leases
and Pihole
+ Riak KV instances collector
+ CPU performance statistics using Performance Monitoring Units
via the perf_event_open() system call. (perf plugin)
* Collector improvements:
+ Handle different sensor IDs for the same element in the
freeipmi plugin
+ Increase the cpu_limit chart precision in cgroup plugin
+ Added userstats and deadlocks charts to the python mysql
collector
+ Add perforce server process monitoring to the apps plugin
* Backends:
+ Prometheus remote write backend
* DB engine improvements:
+ Reduced memory requirements by 40-50%
+ Reduced the number of pages needed to be stored and indexed
when using memory mode = dbengine, by adding empty page
detection
* Rebranding:
+ Updated the netdata logo and changed links to point to the
new website
* Other:
+ Pass the the cloud base url parameter to the notifications
mechanism, so that modifications to the configuration are
respected when creating the link to the alarm
+ Improved logging, to be able to trace the
CRITICAL: main[main] SIGPIPE received. error
Bug fixes:
* Fixed ram_available alarm
* Stop monitoring /dev and /run in the disk space and inode
usage charts
* Fixed the monitoring of the “time” group of processes
* Fixed compilation error PERF_COUNT_HW_REF_CPU_CYCLES'
undeclared here in old Linux kernels (perf plugin)
* Fixed invalid XML page error (tomcat plugin)
* Remove obsolete monit metrics
* Fixed Failed to parse error in adaptec_raid
* Fixed cluster_health_nodes and cluster_stats_nodes charts in
the elasticsearch collector
* A modified slave chart's "name" was not properly transferred
to the master
* Netdata could run out of file descriptors when using the
new DB engine
* Fixed UI behavior when pressing the End key
* Fixed UI link to check the configuration file,
to open in a new tab
* Prevented Error: 'module' object has no attribute 'Retry'
messages from python collectors, by enforcing minimum version
check for the UrlService library
* Fixed typo that causes nfacct.plugin log messages to
incorrectly show freeipmi
* The daemon could get stuck during collection or during
shutdown, when using the new dbengine. Reduced new dbengine IO
utilization by forcing page alignment per dimension of chart.
* Properly handle timeouts/no response in dns_query_time
python collector
* When a collector restarted after having stopped for a long
time, the new dbengine would consume a lot of CPU resources.
* Fixed error Assertion old_state & PG_CACHE_DESCR_ALLOCATED'
failed` of the new dbengine. Eliminated a page cache descriptor
race condition
* tv.html failed to load the three left charts when accessed via
https. Turn tv.html links to https
* Change print level from error to info for messages about
clearing old files from the database
* Fixed warning regarding the x509check_last_collected_secs
alarms. Changed the template update frequency to 60s, to match
the chart’s update frequency
* Email notification header lines were not terminated with \r\n
as per the RFC
* Some log entries would not be caught by the python web_log
plugin. Fixed the regular expressions
* Corrected the date used in pushbullet notifications
* Fixed FATAL error when using the new dbengine with no direct
I/O support, by falling back to buffered I/O
* Fixed compatibility issues with varnish v4 (varnish collector)
* The total number of disks in mdstat.XX_disks chart was
displayed incorrectly.
Fixed the "inuse" and "down" disks stacking.
* The config option --disable-telemetry was being checked after
restarting netdata, which means that we would still send
anonymous statistics the first time netdata was started.
(NOTE: Telemetry is disabled by default on openSUSE.)
* Fixed apcupsd collector errors, by passing correct info to the
run function.
* apcupsd and libreswan were not enabled by default
* Fixed incorrect module name: energi to energid
* The nodes view did not work properly when a reverse proxy was
configured to access netdata via paths containing subpaths
(e.g. myserver/netdata)
* Fix error message PLUGINSD : cannot open plugins directory
* Corrected invalid links to web_log.conf that appear on the
agent UI
* Fixed ScaleIO collector endpoint paths
* Fixed web client timeout handling in the go.d plugin httpcheck
collector
OBS-URL: https://build.opensuse.org/request/show/714120
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=11
|
|||
|
|
8836025eed |
- Update to v1.16.0
Improvements:
* Health:
+ Easily disable alarms, by persisting the silencers
configuration
+ Repeating alarm notifications
+ Simplified the health cmdapi tester - no setup/cleanup needed
+ Αdd last_collected alarm to the x509check collector
+ New alarm for abnormally high number of active processes.
* Security:
+ SSL support in the web server and streaming/replication
+ Support encrypted connections to OpenTSDB backends
* New collectors:
+ Go.d collector modules for WMI, Dnsmasq DHCP leases
and Pihole
+ Riak KV instances collector
+ CPU performance statistics using Performance Monitoring Units
via the perf_event_open() system call. (perf plugin)
* Collector improvements:
+ Handle different sensor IDs for the same element in the
freeipmi plugin
+ Increase the cpu_limit chart precision in cgroup plugin
+ Added userstats and deadlocks charts to the python mysql
collector
+ Add perforce server process monitoring to the apps plugin
* Backends:
+ Prometheus remote write backend
* DB engine improvements:
+ Reduced memory requirements by 40-50%
+ Reduced the number of pages needed to be stored and indexed
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=35
|
||
| 2a74e9e057 |
Accepting request 705267 from network
- Update to v1.15.0
Bug Fixes:
* Prowl notifications were not being sent, unless another
notification method was also active
* Fix exception handling in the python.d plugin
* The node applications group did not include all node processes.
* The nvidia_smi collector displayed incorrect power usage
* The python.d plugin would sometimes hang, because it lacked
a connect timeout
* The mongodb collector raised errors due to various KeyErrors
* The smartd_log collector would show incorrect temperature
values
Improvements:
* Support for aggregate node view
* Database engine
* New collector modules:
+ Go.d collectors for OpenVPN, the Tengine web server and
ScaleIO (VxFlex OS) instances
+ Monitor disk access latency like ioping does
* Energi Core daemon monitoring, suits other Bitcoin forks
* Collector improvements:
+ Add docker swarm manager metrics to the go.d docker_engine
collector
+ Implement unified cgroup cpu limit
+ python.d.plugin: Allow monitoring of HTTP(S) endpoints which
require POST data and make the UrlService more flexible
* Support the AWS Kinesis backend for long-term storage
* Add a new "text-only" chart renderer
* API Improvements:
+ Smarter caching of API calls. Do not cache alarms and info
api calls and extend no-cache headers.
+ Extend the api/v1/info call response with system and
collector information
+ k6 script for API load testing
* Kubernetes helmchart improvements:
+ Added the init container, where sysctl params could be
managed, to bypass the Cannot allocate memory issue
+ Better startup/shutdown of slaves and reduced memory usage
with liveness/readiness probes and default memory mode none
+ Added the option of overriding the default settings for
kubelet, kubeproxy and coredns collectors via values.yaml
+ Make the use of persistent volumes optional, add apiVersion
to fix linting errors and correct the location of the env
field
- Update to v1.14.0
The release introduces major additions to Kubernetes monitoring,
with tens of new charts for Kubelet, kube-proxy and coredns
metrics, as well as significant improvements to the netdata
helm chart.
Two new collectors were added, to monitor Docker hub and Docker
engine metrics.
Finally, v1.14 adds support for version 2 cgroups, OpenLDAP
over TLS, NVIDIA SMI free and per process memory and configurable
syslog facilities.
Bug Fixes:
* Fixed problem autodetecting failed jobs in python.d plugin.
It now properly restarts jobs that are being rechecked,
as soon as they are able to run.
* CouchdDB monitoring would stop sometimes with an exception.
Fixed the unhandled exception causing the issue.
* The netdata api deliberately returned http error 400 when
netdata ran in memory mode none. Modified the behavior to
return responses, regardless of the memory mode
* The python.d plugin sometimes does not receive SIGTERM when
netdata exits, resulting in zombie processes. Added a heartbeat
so that the process can exit on SIGPIPE.
* The new SMS Server Tools notifications did not handle errors
well, resulting in cryptic error messages.
Improved error handling.
* Fix segmentation fault in streaming, when two dimensions had
similar names.
* Kubernetes Helm Chart: Fixed incorrect use of namespaces in
ServiceAccount and ClusterRoleBinding RBAC fixes.
* Elastic search: The option to enable HTTPS was not included in
the config file, giving the erroneous impression that HTTPS was
not supported. The option was added.
* RocketChat notifications were not being sent properly.
Added default recipients for roles in the health alarm
notification configuration.
Improvements:
* go.d.plugin v0.4.0 : Docker Hub and k8s coredns collectors,
springboot2 URI filters support.
* go.d.plugin v0.3.1 : Add default job to run k8s_kubelet.conf,
k8s_kubeproxy, activemq modules
* go.d.plugin v0.3.0 : Docker engine, kubelet and kub-proxy
collectors. x509check module reading certs from file support
* Added unified cgroup support that includes v2 cgroups
* Disk stats: Added preferred disk id pattern, so that users can
see the id they prefer, when multiple ids appear for the same
device
* NVIDIA SMI: Added memory free and per process memory usage
charts to the collector
* OpenLDAP: Added TLS support, to allow monitoring of LDAPS.
* PHP-FPM: Add health check to raise alarms when the phpfm server
is unreachable
* PostgreSQL: Our configuration options to connect to a DB did
not support all possible option. Added option to connect to a
PostreSQL instance by defining a connection string (URI).
* python.d.plugin: There was no way to delete obsolete dimensions
in charts created by the python.d plugin. The plugin can now
delete dimension at runtime.
* netdata supports sending its logs to Syslog, but the facility
was hard-coded. We now support configurable Syslog facilities
in netdata.conf.
* Kubernetes Helm Chart improvements:
+ Added serviceName in statefulset spec to align with the k8s
documentation
+ Added preStart command to persist slave machine GUIDs,
so that pod deletion/addition during upgrades doesn't lose
the slave history.
+ Disabled non-essential master netdata collector plugins to
avoid duplicate data
+ Added preStop command to wait for netdata to exit gracefully
before removing the container
+ Extended configuration file support to provide more control
from the helm command line
+ Added option to disable Role-based access control
+ Added liveness and readiness probes.
- Update to v1.13.0
netdata has taken the first step into the world of Kubernetes,
with a beta version of a Helm chart for deployment to a k8s
cluster and proper naming of the cgroup containers.
We have big plans for Kubernetes, so stay tuned!
A major refactoring of the python.d plugin has resulted in a
dramatic decrease of the required memory, making netdata even
more resource efficient.
We also added charts for IPC shared memory segments and total
memory used.
Improvements:
* Kubernetes: Helm chart and proper cgroup naming
* python.d.plugin: Reduce memory usage with separate process
for initial module checking and loaders cleanup
* IPC shared memory charts
* mysql module add ssl connection support
* FreeIPMI: Have the debug option apply the internal freeipmi
debug flags
* Prometheus backend: Support legacy metric names for source=avg
* Registry: Allow deleting the host we are looking at
* SpigotMC: Use regexes for parsing.
Bug Fixes:
* Postgres: fix connection issues
* Proxmox container: Fix cgroup naming and use total_* memory
counters for cgroups
* proc.plugin and plugins.d: Fix memory leaks
* SpigotMC: Fix UnicodeDecodeError and py2 compatibility fix
* Fix non-obsolete dimension deletion
* UI: Fix incorrect icon for the streaming master
* Docker container names: Retry renaming when a name is not found
* apps.plugin: Don't send zeroes for empty process groups
* go.d.plugin: Correct sha256sum check
* Unbound module: Documentation corrected with troubleshooting
section.
* Streaming: Prevent UI issues upon GUID duplication between
master and slave netdata instances
* Linux power supply module: Fix missing zero dimensions
* Minor fixes around plugin_directories initialization
- Update to v1.12.2
Bug Fixes:
* Zombie processes exist after restart netdata - add heartbeat
to python.d plugin
* RocketChat notifications not working
* SIGSEGV crash during shutdown of tc plugin
* CMake warning for nfacct plugin
Improvements:
* Oracledb python module
* Show streamed servers even for users that are not signed in
- Drop GPG signature (no longer used)
- Drop spec compatibility with old distro versions
- Drop netdata-automake-no-dist-xz.patch
- Refresh netdata-smartd-log-path.patch
- Update to v1.12.1
Fixes:
* Fix SIGSEGV at startup: Don't free vars of charts that do not
exist #5455
* Prevent invalid Linux power supply alarms during startup #5447
* Correct duplicate flag enum in health.h #5441
* Remove extra 'v' for netdata version from Server response
header #5440 and spec URL #5427
* apcupsd - Treat ONBATT status the same as ONLINE #5435
* Fix #5430 - LogService._get_raw_data under python3 fails on
undecodable data #5431
* Correct version check in UI #5429
* Fix ERROR 405: Cannot download charts index from server -
cpuidle handle newlines in names #5425
* Fix clock_gettime() failures with the CLOCK_BOOTTIME argument
#5415
* Use netnsid for detecting cgroup networks; #5413
* Python module sensors fix #5406
* Fix ceph.chart.py for Python3 #5396 (GaetanF)
* Fix warning condition for mem.available #5353
* cups.plugin: Support older versions #5350
Improvements:
* Add driver-type option to the freeipmi plugin #5384
* Add support of tera-byte size for Linux bcache. #5373
* Split nfacct plugin into separate process #5361
* Add cgroup cpu and memory limits and alarms #5172
* Add message queue statistics #5115
- Update to v1.12.0
Key improvements:
* Introducing netdata.cloud, the free netdata service for all
netdata users
* High performance plugins with go.d.plugin (data collection
orchestrator written in Go)
* 7 new data collectors and 11 rewrites of existing data
collectors for improved performance
* A new management API for all netdata servers
* Bind different functions of the netdata APIs to different ports
Management API:
Netdata now has a management API. We plan to provide a full set
of configuration commands using this API.
In this release, the management API supports disabling or
silencing alarms during maintenance periods.
For more information about the management API, check
https://docs.netdata.cloud/web/api/health/#health-management-api
Anonymous statistics:
Anonymous usage information can be collected and sent to Google
Analytics. This functionality is disabled by default in openSUSE.
Remove /etc/netdata/.opt-out-from-anonymous-statistics to enable.
The statistics calculated from this information will be used for:
1. Quality assurance, to help us understand if netdata behaves as
expected and help us identify repeating issues for certain
distributions or environment.
2. Usage statistics, to help us focus on the parts of netdata
that are used the most, or help us identify the extend our
development decisions influence the community.
Information is sent to Netdata via two different channels:
* Google Tag Manager is used when an agent's dashboard is
accessed.
* The script anonymous-statistics.sh is executed by the Netdata
daemon, when Netdata starts, stops cleanly, or fails.
Both methods are controlled via the same opt-out mechanism.
For more information, check
https://docs.netdata.cloud/docs/anonymous-statistics/
Data collection:
This release introduces a new Go plugin orchestrator.
This plugin has its own github repo
(https://github.com/netdata/go-orchestrator). It is open-source,
using the same license and we welcome contributions.
The orchestrator can also be used to build custom data collection
plugins written in Go. We have used the orchestrator to write
many new Go plugins in our go.d plugin github repo.
For more information, check
https://github.com/netdata/go-orchestrator#go-orchestrator-wip
New data collectors:
* Activemq (Go)
* Consul (Go)
* Lighttpd2 (Go)
* Solr (Go)
* Springboot2 (Go)
* mdstat - nonredundant arrays (C)
* CUPS printing system (C)
High performance versions of older data collectors:
* apache (Go)
* dns_query (Go)
* Freeradius (Go)
* Httpcheck (Go)
* Lighttpd (Go)
* Portcheck (Go)
* Nginx (Go)
* cpufreq (C)
* cpuidle (C)
* mdstat (C)
* power supply (C)
Other improved data collectors:
* Fix the python plugin clock (collectors falling behind).
* adaptec_raid: add to python.d.conf.
* apcupsd: Detect if UPS is online.
* apps: Fix process statistics collection for FreeBSD.
* apps: Properly lookup docker container name when running in ECS
* fail2ban: Add 'Restore Ban' action.
* go_expavar: Don't check for duplicate expvars.
* hddtemp: Don't use disk model as dim name.
* megacli: add to python.d.conf.
* nvidia_smi: handle N/A values.
* postgres: Fix integer out of range error on Postgres 11,
fix locks count.
* proc: Don't show zero charts for ZFS filesystem.
* proc; Fix cached memory calculation.
* sensors: Don't ignore 0 RPM fans on start.
* smartd_log: check() unhandled exception: list index out of
range.
* SNMP: Gracefully ignore the offset if the value is not a number
Health Monitoring:
* Add Prowl notifications for iOS users.
* Show count of active alarms per state in email notifications.
* Show evaluated expression and expression variable values in
email notifications.
* Improve support for slack recipients (channels/users).
* Custom notifications: Fix bug with alarm role recipients.
Dashboards:
* Server filtering in my-netdata menu when signed in to
netdata.cloud
* All units are now IEC-compliant abbreviations (KiB, MiB etc.).
* GUI: Make entire row clickable in the registry menu showing
the list of servers.
Backends:
* Do not report stale metrics to prometheus.
Other:
* Treat DT_UNKNOWN files as regular files.
* API: Stricter rules for URL separators.
- Update to v1.11.1
Improved internal database:
Overflown incremental values (counters) do not show a zero point
at the charts. Netdata detects the width (8bit, 16bit, 32bit,
64bit) of each counter and properly calculates the delta when
the counter overflows.
The internal database format has been extended to support values
above 64bit.
New data collection plugins:
* openldap, to collect performance statistics from OpenLDAP
servers.
* tor, to collect traffic statistics from Tor.
* nvidia_smi to monitor NVIDIA GPUs.
Improved data collection plugins:
* BUG FIX: network interface names with colon (:) in them were
incorrectly parsed and resulted in faulty data collection
values.
* BUG FIX: smartd_log has been refactored, has better python v2
compatibility, and now supports SCSI smart attributes
* cpufreq has been re-written in C - since this module if common,
we decided to convert to an internal plugin to lower the
pressure on the python ones. There are a few more that will be
transitioned to C in the next release.
* BUG FIX: sensors got some compatibility fixes and improved
handling for lm-sensors errors.
Health monitoring:
* BUG FIX: max network interface speed data collection was
faulty, which resulted in false-positive alarms on systems with
multiple interfaces using different speeds (the speed of the
first network interface was used for all network interfaces).
Now the interface speed is shown as a badge.
* alerta.io notifications got a few improvements
* BUG FIX: conntrack_max alarm has been restored (was not working
due to an invalid variable name referenced)
Registry (my-netdata menu):
* It has been refactored a bit to reveal the URLs known for each
node and now it supports deleting individual URLs.
- Update to 1.11.0
* Stock config files are now in /usr/lib/netdata; use the
/etc/netdata/edit-config script to copy and edit them.
* The query engine of netdata has been re-written to support
query plugins. We have already added the following algorithms
that are available for alarm, charts and badges:
+ stddev, for calculating the standard deviation on any
time-frame.
+ ses or ema or ewma, for calculating the exponential weighted
moving average, or single/simple exponential smoothing on
any time-frame.
+ des, for calculating the double exponential smoothing on any
time-frame.
+ cv or rsd, for calculating the coefficient of variation for
any time-frame.
Fixed security issues:
* CVE-2018-18836 Fixed JSON Header Injection (an attacker could
send \n encoded in the request to inject a JSON fragment into
the response).
* CVE-2018-18837 Fixed HTTP Header Injection (an attacker could
send \n encoded in the request to inject an HTTP header into
the response).
* CVE-2018-18838 Fixed LOG Injection (an attacker could send \n
encoded in the request to inject a log line at access.log).
* CVE-2018-18839 Not fixed Full Path Disclosure, since these are
intended (netdata reports the absolute filename of web files,
alarm config files and alarm handlers).
* Fixed Privilege Escalation by manipulating apps.plugin or
cgroup-network error handling.
* Fixed LOG injection (by sending URLs with \n in them).
New data collection modules:
* rethinkdbs for monitoring RethinkDB performance
* proxysql for monitoring ProxySQL performance
* litespeed for monitoring LiteSpeed web server performance.
* uwsgi for monitoring uWSGI performance
* unbound for monitoring the performance of Unbound DNS servers.
* powerdns for monitoring the performance of PowerDNS servers.
* dockerd for monitoring the health of dockerd
* puppet for monitoring Puppet Server and Puppet DB.
* logind for monitoring the number of active users.
* adaptec_raid and megacli for monitoring the relevant raid
controller
* spigotmc for monitoring minecraft server statistics
* boinc for monitoring Berkeley Open Infrastructure Network
Computing clients.
* w1sensor for monitoring multiple 1-Wire temperature sensors.
* monit for collecting process, host, filesystem, etc checks
from monit.
* linux_power_supplies for monitoring Linux Power Supplies
attributes
Data collection orchestrators changes:
* node.d.plugin does not use the js command any more.
* python.d.plugin now uses monotonic clocks. There was a
discrepancy in clocks used in netdata that resulted in a shift
in time of python module after some time (it was missing 1 sec
per day).
* added MySQLService for quickly adding plugins using mysql
queries.
* URLService now supports self-signed certificates and supports
custom client certificates.
* all python.d.plugin modules that require sudo to collect
metrics, are now disabled by default, to avoid security alarms
on installations that do not need them.
Improved data collection modules:
* apps.plugin now detects changes in process file descriptors,
also fixed a couple of memory leaks. Its default configuration
has been enriched significantly, especially for IoT.
* freeipmi.plugin now supports option ignore-status to ignore
the status reported by given sensors.
* statsd.plugin (for collecting custom APM metrics)
+ The charting thread has been optimized for lowering its CPU
consumption when several millions of metrics are collected.
+ sets now report zeros instead of gaps when no data are
collected
+ histograms and timers have been optimized for lowering their
CPU consumption to support several thousands of such metrics
are collected.
+ histograms had wrong sampling rate calculations.
+ gauges now ignore sampling rate when no sign is included in
the value.
+ the minimum sampling rate supported is now 0.001.
+ netdata statsd is now drop-in replacement for datadog statsd
(although statsd tags are currently ignored by netdata).
* proc.plugin (Linux, system monitoring)
+ Unused interrupts and softirqs are not used in charts
(this saves quite some processing power and memory on systems
with dozens of CPU cores).
+ fixed /proc/net/snmp parsing of IcmpMsg lines that failed on
a few systems.
+ Veritas Volume Manager disks are now recognized and named
accordingly.
+ Now netdata collects TcpExtTCPReqQFullDrop and re-organizes
metrics in charts to properly monitor the TCP SYN queue and
the TCP Accept queue of the kernel.
+ Many charts that were previously reported as IPv4, were
actually reflecting metrics for both IPv4 and IPv6.
They have been renamed to ip.*.
+ netdata now monitors SCTP.
+ Fixed BTRFS over BCACHE sector size detection.
+ BCACHE data collection is now faster.
+ /proc/interrupts and /proc/softirqs parsing fixes.
* diskspace.plugin (Linux, disk space usage monitoring)
+ It does not stat() excluded mount points any more (it was
interfering with kerberos authenticated mount points).
+ several filesystems are now by default excluded from
disk-space monitoring, to avoid breaking suspend on
workstations.
* python.d.plugin PYTHON modules (applications monitoring)
+ web_log module now supports virtual hosts, reports http/https
metrics, support squid logs
+ nginx_plus module now handles non-continuous peer IDs
+ ipfs module is optimized, the use of its Pin API is now
disabled by default and can enabled with a netdata module
option (using the IPFS Pin API increases the load on the
IPFS server).
+ fail2ban module now supports IPv6 too.
+ ceph module now checks permissions and properly reports
issues
+ elasticsearch module got better error handling
+ nginx_plus module now uses upstream ip:port instead of
transient id to identify dimensions.
+ redis, now it supports Pika, collects evited keys, fixes
authentication issues reported and improves exception
handling.
+ beanstalk, bug fix for yaml config loading.
+ mysql, the % of active connections is now monitored,
query types are also charted.
+ varnish, now it supports versions above 5.0.0
+ couchdb
+ phpfpm, now supports IPv6 too.
+ apache, now supports IPv6 too.
+ icecast
+ mongodb, added support for connect URIs
+ postgress
+ elasticsearch, now it supports versions above 6.3.0,
fixed JSON parse errors
+ mdstat , now collects mismatch_cnt
+ openvpn_log
* node.d.plugin NODE.JS modules
+ snmp was incorrectly parsing a new OID names as float.
* charts.d.plugin BASH modules
+ nut now supports naming UPSes.
Health monitoring:
* Added variable $system.cpu.processors.
* Added alarms for detecting abnormally high load average.
* TCP SYN and TCP accept queue alarms, replacing the old softnet
dropped alarm that was too generic and reported many false
positives.
* system alarms are now enabled on FreeBSD.
* netdata now reads NIC speed and sets alarms on each interface
to detect congestion.
* Network alarms are now relaxed to avoid false positives.
* New bcache alarms.
* New mdstat alarms.
* New apcupsd alarms.
* New mysql alarms.
* New notification methods:
+ rocket.chat
+ Microsoft Teams
+ syslog
+ fleep.io
+ Amazon SNS
Backends:
* Host tags are now sent to Graphite
* Host variables are now sent to Prometheus
Streaming:
* Each netdata slave and proxy now filter the charts that are
streamed. This allows exposing netdata masters to third parties
by limiting the number of charts available at the master.
* Fixed a bug in streaming slaves that randomly prevented them
to resume streaming after network errors.
* Fixed a bug that on slaves that sent duplicated chart names
under certain conditions.
* Fixed a bug that caused slaves to consume 100% CPU (due to a
misplaced lock) when multiple threads were adding dimensions
on the same chart.
* The receiving nodes of streaming (netdata masters and proxies)
can now rate-limit the rate of inbound streaming requests
received.
* Re-worked time synchronization between netdata slaves and
masters.
API:
* Badges that report time, now show "undefined" instead of
"never".
Dashboard:
* Added UTC timezone to the list of available time-zones.
* The dashboard was sending some non-HTTP compliant characters
at the URLs that made netdata dashboards break when used under
certain proxies.
- Remove _service
- Add GPG signature
- Use %license macro
- Update to v1.10.0
* new plugins:
- BTRFS - monitor the allocations of BTRFS filesystems (netdata can now
properly detect when btrfs is going out of space)
- BCACHE - monitor the caching block layer that allows building hybrid
disks using normal HDDs and SSDs
- Ceph - monitor ceph distributed storage
- nginx plus - monitor the nginx+ web servers
- libreswan - monitor IPSEC tunnels
- Traefik - monitor traefik reverse proxies
- icecast - monitor icecast streaming servers
- ntpd - monitor NTP servers
- httpcheck - monitor any remote web server
- portcheck - monitor any remote TCP port
- spring-boot - monitor java spring boot applications
- dnsdist - monitor dnsdist name servers
- hugepages - monitor the allocation of Linux hugepages
* enhanced/improved plugins:
- statsd
+ statsd dimensions now support the options the external plugin
dimensions support (currently the only usable option is hidden to add
the dimension, but make it hidden on the dashboard - a hidden
dimension can participate in various calculations, including alarms).
+ statsd now reports the CPU usage of its threads at the netdata
section.
+ statsd metrics are logged to access.log the first time they are
encountered.
+ statsd metrics now accept the special value zinit to allow them get
initialized without altering their values (this is useful if you have
rare metrics that you need to initialize when netdata starts).
+ statsd over TCP is now a lot faster - netdata can process up to 3.5mil
statsd metrics / second using just one core. Added options to control
the timeouts of TCP statsd connections.
+ fixed the title and context of statsd private charts
+ statsd private charts can now be hidden from the dashboard
- web_log
+ Added web server response timings histogram
- containers monitoring
+ netdata now monitors systemd-nspawn containers.
+ netdata now renames charts of kubernetes containers.
+ virsh is now called with -r to avoid prompting for password
+ cgroup-network is now a lot more strict, preventing unauthorized
privilege escalation
+ cgroup-network now searches for container processes in sub-cgroups
too - this improves the mapping of network interfaces to containers
+ cgroup-network now works even when there are no veth interfaces
in the system
- system memory
+ treat slab memory as cached
+ added a new chart for monitoring the memory available for use,
before hitting swap
+ netdata now monitors Linux hugepages and (transparent) hugepages
- diskspace
+ support huge amounts of mountpoints - netdata was crashing with stack
overflow due to recursion - now it is a loop, so any number of mount
points is supported
- network interfaces
+ moved tcp passive and active opens to a separate chart, to allow the
TCP issues dimensions scale better by default
+ updated the information presented on TCP charts to match the latest
v4.15 kernel source
- postgres
+ new charts:
checkpointer
bgwriter
autovacuum
replication delta
WAL archive
WAL
temporary files
+ The postgres plugin now also works when postgres is in recovery mode.
- rabbitmq
+ added Erlang run queue chart. This is useful in conjunction with the
existing Erlang processes chart to get a better overall idea of what's
going on in the Erlang VM.
+ added rabbitmq information on the dashboard to complement the charts.
- apps.plugin
netdata prior to this version was detecting the user and group of
processes by examining the ownership of /proc/PID/stat. Unfortunately it
seems that the owneship of files in /proc do not change when the process
switches user. So, netdata could not detect the user and group of
processes that started as root and then switched to another user.
Now netdata reads /proc/PID/status:
+ process ownship information is now accurate
+ eliminated the need to read /proc/PID/statm (all the information of
/proc/PID/statm is available in /proc/PID/status)
+ allowed netdata to read VmSwap, so a new chart has been added to
monitor the swap memory usage per process, user and group.
+ The new plugin is 20% more expensive in terms of CPU. We tried hard
to optimize it, but this is as good as it can get.
- haproxy
+ hrsp_1xx, hrsp_2xx, hrsp_3xx, hrsp_4xx, hrsp_5xx, hrsp_other,
hrsp_total for backands and frontends
+ qtime, ctime, rtime, ttime metrics for backend servers
+ backend servers In UP state
- uptime
+ netdata now uses /proc/uptime when CLOCK_BOOTTIME does not report the
same uptime. In containers CLOCK_BOOTTIME reports the uptime of the
host, while /proc/uptime reports the uptime of the container, so now
netdata correctly reports the uptime of the container.
- ksm
+ removed to_scan dimension
+ the savings % reported by netdata was less than the actual - fixed it.
- mdstat
+ various fixes to better monitor rebuild time and rate
- elasticsearch
+ Added several charts for translog / indices segments statistics and
JVM buffer pool utilization, which are often helpful when evaluating
an elasticsearch node health
- apcupsd
+ netdata now supports monitoring multiple APC UPSes.
- isc-dhcpd
+ netdata now also supports monitoring IPv6 leases
- fronius
+ added a new dimension solar_consumption
+ added alarms
- stiebeleltron
+ added alarms
- python.d
+ python.d.plugin can now start even if /etc/netdata/python.d.conf
is missing
+ python.d.plugin now has an internal run counter
+ the unicode decoding of the plugin has been fixed
+ the plugin now does not validate self-signed certificates
+ the plugin can not revive obsolete charts
- charts.d
+ charts.d.plugin BASH modules can now have custom number of retries in
case of data collection failures
* new alarm notifications plugins:
- alerta
- IRC
* web server:
- netdata now has a new internal web server that supports a fixed number
of threads - we call it static web server. This web server allows
netdata to work around memory fragmentation (since the treads are fixed,
the underlying memory allocators reuse the same memory arenas) and cpu
utilization (we can control the number of threads that will be used by
netdata). This is the default now.
- now the static threads web server reports the CPU usage of each of its
threads.
- the HTTP response headers now include the netdata version
* dashboard:
- the print button now respects the URL path netdata is hosted.
- dygraphs updated to the latest version - this fixes an issue that
prevented netdata charts from being interactive under certain conditions
- added dygraph theme logscale
- fontawesome updated to version 5
- d3 updated to the latest version (this broke c3 charts that require an
older version)
- added d3pie charts
- custom dashboards can now have alarms for specific roles (all, none,
one or more).
- allow stacked charts to zoom vertically when dimensions are selected
- netdata now has a global XSS protection
- netdata now uses intersectionObserver when available - this improves the
scrolling performance of the dashboard.
- prevent date, time and units from wrapping at the charts legends
- various units scaling improvements
- added data-common-colors="NAME" chart option for custom dashboards
- added wiki page for creating custom dashboards on Atlassian's Confluence
- prevented a double click on the charts' toolbox to select the text of
the buttons.
- fixed the alignment of dashboard icons
- added a simple js, called refresh-badges.js, to update badges on a
custom web page
* badges:
- netdata badges can now be scaled
* API:
- added gtime parameter, for group time. This is used to request from
netdata to return values in a different rate (i.e. gtime=60 on a X/sec
dimension, will return X/min).
- fixed a rounding bug in JSON generation
- the dimensions= parameter now supports simple patterns and added option
values match-ids and match-names to control which matches are executed
for dimensions.
* alarms:
- "system.swap" alarms now send notifications with a 30 seconds delay,
to work-around a kernel bug that incorrectly reports all swap as
instantly used under containers
- added alarm to predict the time a mount point will run out of inodes
- added alerta.io notifications
- added available memory alarm
- removed unsupported html tags from hipchat notifications.
- pagerduty notifications have been modified to avoid incident duplication
- alarm definitions can now use both chart IDs and chart names (prior to
this version only chart IDs were allowed).
- curl options (eg for disabling SSL certificates verification) for
alarm-notify.sh can now be defined in health_alarm_notify.conf.
- netdata can now send notifications to IRC channels
* backends:
- on netdata masters, allow filtering the hosts that will be sent to
backends with send hosts matching = * pattern.
- improved connection error handling and added retries to allow netdata
connect to certain backends that failed with EALREADY or EINPROGRESS.
- json backends now receive host tags (the tags have to be formatted in a
json friendly way)
- re-worked the alarm that triggers when backend data are lost,
to avoid flip-flops.
- prometheus backends:
+ added URL option timestamps=yes|no to /api/v1/allmetrics to support
prometheus Pushgateway
+ added netdata_info variable with the version of netdata
+ renamed netdata_host_tags to netdata_host_tags_info (the old exists
but is deprecated and will be removed eventually)
+ when prometheus uses average metrics, netdata remembers the last
access time the prometheus collected metrics, on a per host basis.
* metrics streaming between netdata:
- netdata masters and proxies now expose the version of the netdata
collecting the metrics, not their own. So, now a netdata master shows
on the dashboard and sends to backends the version of the netdata
collecting the metrics
- added stream.conf option multiple "connections = accept | deny" to allow
or deny multiple connection for the same netdata host. The default
remains "accept", but it is likely to be changed to no on future
versions.
* other:
- added global option gap when lost iterations to control the number of
iterations that should be lost to show a gap on the charts.
- various fixes/improvements related to netdata logs - the main change is
that now netdata logs the thread name that logged the message, providing
helpful insights about the thread that complained.
- re-worked the exit procedure of netdata to allow it cleanup properly -
sometimes netdata was deadlocked during exit, waiting forever - now
netdata always exits promptly
- fixed compilation on ancient gcc versions
- netdata was always setting itself to the idle process scheduling
priority, even when it was configured to do otherwise. Fixed it.
- Update to v1.9.0
* new features:
- IP address ACLs
- auto-scaling units (KB → MB → GB → TB etc.)
- dashboard snapshots for any timeframe/resolution
- detection of TCP overflows and dropped connections
- detection of redis background save failures
- export dashboards to PDF
- flock.com and kavenegar.com alarm notifications
- highlighting timeframes on all charts by holding Ctrl or Alt
- monitoring of couchdb, powerdns, beanstalkd and dnsdist
- monitoring of libvirt VMs
- new alarms:
1. too many tcp orphan sockets
2. tcp memory that detects that the tcp stack is under memory pressure
or close to giving memory errors
3. too many tcp connections (for kernels that do not support dynamic
allocation of connections)
- timezone support
- unit conversion (temperatures, time, etc.)
* fixes/improvements:
- better/faster dashboard scrolling
- python.d.plugin rewritten
+ supports option autodetection_retry: SECONDS
+ the new URL service disables certificates checks by default,
to allow self-signed certificates to work without configuration.
- streaming:
+ netdata proxies with more than 100 slaves had a timing issue that
caused them to crash randomly on slave reconnects. Parts of the code
have been rewritten to get rid of the timing issue.
+ netdata slaves and proxies now have a protection that ensures they will
never use 100% CPU, even if the master is misbehaving.
+ expired orphaned hosts are now removed from the my-netdata menu of the
dashboard.
+ streaming functions can now be monitored via access.log
+ streaming now transfers alarm variables too
- web_log plugin
+ custom regex now supports parsing hostnames and IPs
+ now parses lines with error 408 (request timeout - these are a special
case, since the request has not received by the web server, so the log
line is incomplete)
+ now properly parses resp_length with value
- disks under Linux are renamed using /dev/disk/by-label. An option has been
added at netdata.conf to also allow renaming based on /dev/disk/by-id.
- chrony is now disabled by default, because there have been reports that
chronyc enters an infinite loop in CentOS and RHEL.
- tomcat improvements to support flavors of the tomcat server
- chart system.io (the total system Disk I/O) is now calculated by
aggregating the reads and writes of all physical disks. The previous
system.io chart (that is based on pgpgin and pgpgout from /proc/vmstat)
is now named system.pgpgio. The key difference is that the new system.io
now sees ZFS I/O, and it also correctly and accurately sums the real disk
bandwidth of RAID arrays.
- chart system.net (the total system network bandwidth) is now calculated
by aggregating the bandwidth of all physical network interfaces and is
common for both IPv4 and IPv6.
- tc (QoS) charts now sort the dimensions on the legends, the same way tc
reports them.
- mysql (and mariadb) got new charts for galera replication
- postgres versions <= 10 the WAL directory was named pg_xlog' and from 10
upwards has been renamed to pg_wal
- freeipmi.plugin got a command line option (can be given in netdata.conf)
to ignore certain sensor IDs that are faulty.
- minor openvpn_log, smartd, smartd_log, varnish, mdstat, dns_query_time,
isc_dhcpd, freeradius and node.d.plugin fixes/improvements
- netdata now supports multiple plugin directories. The setting is the same
in netdata.conf, plugins directory = "DIRECTORY1" "DIRECTORY2" ...,
up to 20 directories.
- netdata now supports alarms variables. Each plugin can now define host
global and chart local variables with static values that can be used in
alarms' expressions.
- dashboard:
+ hover selection of charts is now faster on all browsers
+ the dashboard is now fixed when a modal is open, preventing scrolling
+ the dashboard now uses fontawesome 5.0.1 for icons
+ chart names can now be searched with browser control-F
+ netdata now detects libvirt VM network interfaces and moves them to the
VM section of the dashboard
+ now shows the context, resolution and plugin/module of each chart in
tooltips
+ should now put a lot less CPU pressure on the browser when the page
does not have focus
- statsd:
+ metrics can now be added to statsd synthetic charts using patterns
+ dimensions added to statsd synthetic charts can automatically be renamed
using a dictionary
+ timers and histograms now report zeros when nothing is collected
- badges:
+ fixed a bug in netdata badges that was incorrectly matching zero values
with the null color condition
+ added API option display_absolute to allow badges use the signed value
for color evaluation, but present the absolute value
- alarms/notifications:
+ warning emails sent by netdata are now a little bit more orange
(they were a bit greenish)
+ fixed a bug in email notifications that was triggering a corrupted MIME
match by anti-spam solutions
+ pushbullet notifications now track the devices, so that per-device
filtering at pushbullet is possible. Also improved the formatting a bit.
+ correct priority of warnings for pushover notifications
+ alarms can now use variables like this:
${variable with spaces or +, -, *, / in it}
- other:
+ access.log has been refactored to support monitoring all netdata
operations
+ inodes monitoring is now by default disabled for mount points based on
filesystems that do not have a maximum inode threshold (such as cephfs)
+ rabbitmq has been added to apps_groups.conf so that apps.plugin now
monitors (cpu, memory, disk I/O, sockets, etc) rabbitmq instances
+ several email and log management apps have been added to email and logs
targets of apps_groups.conf
+ ceph target added to apps_groups.conf to allow netdata to monitor Ceph
+ refactored several internal data collection plugins to eliminate a few
hundreds of index lookups per second
+ netdata.conf settings that are loaded from disk, but were the same with
the default ones, were generated commented when the server was asked to
return its config. Now all loaded settings are generated uncommented.
+ netdata simple patterns can now extract the the wildcarded part of the
string they match (used in statsd synthetic charts)
+ netdata simple patterns can allow escaping spaces by prefixing them
with a backslash
- Update netdata-smartd-log-path.patch
- Remove netdata-main-return.patch (fixed upstream)
- Fix rpmlint env-script-interpreter warnings
- Add return statement to main function (-Wreturn-type)
* netdata-main-return.patch
- Update to v1.8.0
* fixed bugs:
- streaming slaves consuming 100% CPU
- missing alarm notifications on netdata masters
- API generating corrupted JSON
- crashing when starting on systems without writable disks
- python.d.plugin URLService did not support HTTP keep-alive
- streamed charts with duplicate names
* netdata enhancements:
- netdata can now listen on UNIX domain sockets
- systemd netdata.service now allows setting negative netdata
OOM score and restarts netdata if it crashes
* new plugins:
- CPU thermal throttling charts
- chrony plugin
- Stiebel Eltron plugin to collect metrics from heat pumps and
hot water installations from Stiebel Eltron ISG
* improved plugins:
- web_log bugfixes, enhancements and optimizations
(including squid logs)
- web_log now enables parsing HTTP/2 logs in custom_log_format
- redis bugfixes
- haproxy bugfixes
- elasticsearch bugfixes and optimizations
- rabbitmq bugfixes and optimizations
- mdstat bugfixes
- tomcat improvements
- mysql improvements
- dovecot improvements
- postgres improvements
- cpufreq fixed a bug that prevented accurate reporting of
CPU frequencies.
- cpuidle performance improvements (faster under load)
- fail2ban bugfixes
- SNMP plugin new uses latest net-snmp and the corrupted 64-bit
counters encountered under certain node.js versions is fixed
* dashboard improvements:
- easypiecharts and gauges can now render arbitrary ranges and
animate clockwise or counterclockwise
- container network interfaces are now moved to the container
section
- containers and VMs now have summary gauges on the dashboard
- traditionally netdata was using 1024 bits = 1 kilobit.
It is fixed: 1000 bits = 1 kilobit.
- netdata charts should now work on wordpress pages
* alarms and notifications:
- alarm-notify.sh now supports debug mode, showing the exact
commands it runs to send notifications,
when export NETDATA_ALARM_NOTIFY_DEBUG=1
- alarm-notify.sh now supports setting the sender email address
of the emails it sends
- emails sent by alarm-notify.sh now include headers to reduce
the possibility of them being scored as spam
- network related alarms got new thresholds and improved badges
- netdata now detects if the system has been suspended and
pauses all alarms for 60 seconds on resume, to prevent false
alarms
- netdata alarms now support filtering based on hostname and OS
- slack notifications now show the host that sent the alarm
* statsd:
- the number of fractional points supported by statsd is now
configurable (1 to 7)
- fixed: 95th percentile calculation on statsd histograms and
timers was incorrectly averaging the values
- fixed: statsd metrics with non-ASCII text were processed by
the statsd server, but were breaking JSON data generated by
netdata
- Add download_files _service
- update source tarball to match upstream copy
- Update to v1.7.0
* netdata is now a fully featured statsd server
* improved metrics streaming and replication
* backend enhancements, including:
- host tags and metrics filtering on the netdata side
- rewritten prometheus support to utilize more prometheus
features and provide more flexibility and integration options
* now monitors ZFS, ElasticSearch, RabbitMQ, Go applications
(via expvar), samba, squid logs (with web_log plugin).
* improved dashboard loading times
* custom hook support for alarms
- Update netdata-smartd-log-path.patch
- mention patches added in last commit
* add netdata-automake-no-dist-xz.patch
* add netdata-logrotate-su.patch
* add netdata-smartd-log-path.patch
- Adjust default smartd log path to openSUSE default
- Update dependencies
* Enable more features
* Add soft dependencies
- Build on all enabled SUSE flavors
- Improve package description
- Use upstream .service/init.d/logrotate files (virtually identical)
- Fix permissions
- Clean up %install section (more readable)
- spec-cleaner
- Do not suppress errors from user/group creation.
- Trim sensationalist wording from description.
Replace -exec rm by just -delete.
- Add netdata-rpmlintrc for spec file.
- Fix url and source link.
- Update to version 1.6.0
- Add systemd unit, init script and logrotate config from netdata git repo
- Update to version 1.5.0 (see included ChangeLog)
- Added netdata-rpmlintrc
- Spec cleanup
- Upgraded to 1.2.0
- Initial package, version: 1.0.1
OBS-URL: https://build.opensuse.org/request/show/705267
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=10
|
|||
|
|
6ab2c1f795 | OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=33 | ||
|
|
c7a7da2866 | OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=32 | ||
|
|
f9fa996cb4 |
- Update to v1.15.0
Bug Fixes:
* Prowl notifications were not being sent, unless another
notification method was also active
* Fix exception handling in the python.d plugin
* The node applications group did not include all node processes.
* The nvidia_smi collector displayed incorrect power usage
* The python.d plugin would sometimes hang, because it lacked
a connect timeout
* The mongodb collector raised errors due to various KeyErrors
* The smartd_log collector would show incorrect temperature
values
Improvements:
* Support for aggregate node view
* Database engine
* New collector modules:
+ Go.d collectors for OpenVPN, the Tengine web server and
ScaleIO (VxFlex OS) instances
+ Monitor disk access latency like ioping does
* Energi Core daemon monitoring, suits other Bitcoin forks
* Collector improvements:
+ Add docker swarm manager metrics to the go.d docker_engine
collector
+ Implement unified cgroup cpu limit
+ python.d.plugin: Allow monitoring of HTTP(S) endpoints which
require POST data and make the UrlService more flexible
* Support the AWS Kinesis backend for long-term storage
* Add a new "text-only" chart renderer
* API Improvements:
+ Smarter caching of API calls. Do not cache alarms and info
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=31
|
||
| 09cc84f0db |
Accepting request 700414 from network
- Update to v1.14.0
The release introduces major additions to Kubernetes monitoring,
with tens of new charts for Kubelet, kube-proxy and coredns
metrics, as well as significant improvements to the netdata
helm chart.
Two new collectors were added, to monitor Docker hub and Docker
engine metrics.
Finally, v1.14 adds support for version 2 cgroups, OpenLDAP
over TLS, NVIDIA SMI free and per process memory and configurable
syslog facilities.
Bug Fixes:
* Fixed problem autodetecting failed jobs in python.d plugin.
It now properly restarts jobs that are being rechecked,
as soon as they are able to run.
* CouchdDB monitoring would stop sometimes with an exception.
Fixed the unhandled exception causing the issue.
* The netdata api deliberately returned http error 400 when
netdata ran in memory mode none. Modified the behavior to
return responses, regardless of the memory mode
* The python.d plugin sometimes does not receive SIGTERM when
netdata exits, resulting in zombie processes. Added a heartbeat
so that the process can exit on SIGPIPE.
* The new SMS Server Tools notifications did not handle errors
well, resulting in cryptic error messages.
Improved error handling.
* Fix segmentation fault in streaming, when two dimensions had
similar names.
* Kubernetes Helm Chart: Fixed incorrect use of namespaces in
ServiceAccount and ClusterRoleBinding RBAC fixes.
* Elastic search: The option to enable HTTPS was not included in
the config file, giving the erroneous impression that HTTPS was
not supported. The option was added.
* RocketChat notifications were not being sent properly.
Added default recipients for roles in the health alarm
notification configuration.
Improvements:
* go.d.plugin v0.4.0 : Docker Hub and k8s coredns collectors,
springboot2 URI filters support.
* go.d.plugin v0.3.1 : Add default job to run k8s_kubelet.conf,
k8s_kubeproxy, activemq modules
* go.d.plugin v0.3.0 : Docker engine, kubelet and kub-proxy
collectors. x509check module reading certs from file support
* Added unified cgroup support that includes v2 cgroups
* Disk stats: Added preferred disk id pattern, so that users can
see the id they prefer, when multiple ids appear for the same
device
* NVIDIA SMI: Added memory free and per process memory usage
charts to the collector
* OpenLDAP: Added TLS support, to allow monitoring of LDAPS.
* PHP-FPM: Add health check to raise alarms when the phpfm server
is unreachable
* PostgreSQL: Our configuration options to connect to a DB did
not support all possible option. Added option to connect to a
PostreSQL instance by defining a connection string (URI).
* python.d.plugin: There was no way to delete obsolete dimensions
in charts created by the python.d plugin. The plugin can now
delete dimension at runtime.
* netdata supports sending its logs to Syslog, but the facility
was hard-coded. We now support configurable Syslog facilities
in netdata.conf.
* Kubernetes Helm Chart improvements:
+ Added serviceName in statefulset spec to align with the k8s
documentation
+ Added preStart command to persist slave machine GUIDs,
so that pod deletion/addition during upgrades doesn't lose
the slave history.
+ Disabled non-essential master netdata collector plugins to
avoid duplicate data
+ Added preStop command to wait for netdata to exit gracefully
before removing the container
+ Extended configuration file support to provide more control
from the helm command line
+ Added option to disable Role-based access control
+ Added liveness and readiness probes.
OBS-URL: https://build.opensuse.org/request/show/700414
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=9
|
|||
|
|
02e763ac43 |
- Update to v1.14.0
The release introduces major additions to Kubernetes monitoring,
with tens of new charts for Kubelet, kube-proxy and coredns
metrics, as well as significant improvements to the netdata
helm chart.
Two new collectors were added, to monitor Docker hub and Docker
engine metrics.
Finally, v1.14 adds support for version 2 cgroups, OpenLDAP
over TLS, NVIDIA SMI free and per process memory and configurable
syslog facilities.
Bug Fixes:
* Fixed problem autodetecting failed jobs in python.d plugin.
It now properly restarts jobs that are being rechecked,
as soon as they are able to run.
* CouchdDB monitoring would stop sometimes with an exception.
Fixed the unhandled exception causing the issue.
* The netdata api deliberately returned http error 400 when
netdata ran in memory mode none. Modified the behavior to
return responses, regardless of the memory mode
* The python.d plugin sometimes does not receive SIGTERM when
netdata exits, resulting in zombie processes. Added a heartbeat
so that the process can exit on SIGPIPE.
* The new SMS Server Tools notifications did not handle errors
well, resulting in cryptic error messages.
Improved error handling.
* Fix segmentation fault in streaming, when two dimensions had
similar names.
* Kubernetes Helm Chart: Fixed incorrect use of namespaces in
ServiceAccount and ClusterRoleBinding RBAC fixes.
* Elastic search: The option to enable HTTPS was not included in
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=29
|
||
| 031d6f14ed |
Accepting request 685493 from network
- Update to v1.13.0
netdata has taken the first step into the world of Kubernetes,
with a beta version of a Helm chart for deployment to a k8s
cluster and proper naming of the cgroup containers.
We have big plans for Kubernetes, so stay tuned!
A major refactoring of the python.d plugin has resulted in a
dramatic decrease of the required memory, making netdata even
more resource efficient.
We also added charts for IPC shared memory segments and total
memory used.
Improvements:
* Kubernetes: Helm chart and proper cgroup naming
* python.d.plugin: Reduce memory usage with separate process
for initial module checking and loaders cleanup
* IPC shared memory charts
* mysql module add ssl connection support
* FreeIPMI: Have the debug option apply the internal freeipmi
debug flags
* Prometheus backend: Support legacy metric names for source=avg
* Registry: Allow deleting the host we are looking at
* SpigotMC: Use regexes for parsing.
Bug Fixes:
* Postgres: fix connection issues
* Proxmox container: Fix cgroup naming and use total_* memory
counters for cgroups
* proc.plugin and plugins.d: Fix memory leaks
* SpigotMC: Fix UnicodeDecodeError and py2 compatibility fix
* Fix non-obsolete dimension deletion
* UI: Fix incorrect icon for the streaming master
* Docker container names: Retry renaming when a name is not found
* apps.plugin: Don't send zeroes for empty process groups
* go.d.plugin: Correct sha256sum check
* Unbound module: Documentation corrected with troubleshooting
section.
* Streaming: Prevent UI issues upon GUID duplication between
master and slave netdata instances
* Linux power supply module: Fix missing zero dimensions
* Minor fixes around plugin_directories initialization
- Update to v1.12.2
Bug Fixes:
* Zombie processes exist after restart netdata - add heartbeat
to python.d plugin
* RocketChat notifications not working
* SIGSEGV crash during shutdown of tc plugin
* CMake warning for nfacct plugin
Improvements:
* Oracledb python module
* Show streamed servers even for users that are not signed in
OBS-URL: https://build.opensuse.org/request/show/685493
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=8
|
|||
|
|
5dbe092709 |
- Update to v1.13.0
netdata has taken the first step into the world of Kubernetes,
with a beta version of a Helm chart for deployment to a k8s
cluster and proper naming of the cgroup containers.
We have big plans for Kubernetes, so stay tuned!
A major refactoring of the python.d plugin has resulted in a
dramatic decrease of the required memory, making netdata even
more resource efficient.
We also added charts for IPC shared memory segments and total
memory used.
Improvements:
* Kubernetes: Helm chart and proper cgroup naming
* python.d.plugin: Reduce memory usage with separate process
for initial module checking and loaders cleanup
* IPC shared memory charts
* mysql module add ssl connection support
* FreeIPMI: Have the debug option apply the internal freeipmi
debug flags
* Prometheus backend: Support legacy metric names for source=avg
* Registry: Allow deleting the host we are looking at
* SpigotMC: Use regexes for parsing.
Bug Fixes:
* Postgres: fix connection issues
* Proxmox container: Fix cgroup naming and use total_* memory
counters for cgroups
* proc.plugin and plugins.d: Fix memory leaks
* SpigotMC: Fix UnicodeDecodeError and py2 compatibility fix
* Fix non-obsolete dimension deletion
* UI: Fix incorrect icon for the streaming master
* Docker container names: Retry renaming when a name is not found
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=27
|
||
| db053a5da4 |
Accepting request 678173 from network
- Drop GPG signature (no longer used)
- Drop spec compatibility with old distro versions
- Drop netdata-automake-no-dist-xz.patch
- Refresh netdata-smartd-log-path.patch
- Update to v1.12.1
Fixes:
* Fix SIGSEGV at startup: Don't free vars of charts that do not
exist #5455
* Prevent invalid Linux power supply alarms during startup #5447
* Correct duplicate flag enum in health.h #5441
* Remove extra 'v' for netdata version from Server response
header #5440 and spec URL #5427
* apcupsd - Treat ONBATT status the same as ONLINE #5435
* Fix #5430 - LogService._get_raw_data under python3 fails on
undecodable data #5431
* Correct version check in UI #5429
* Fix ERROR 405: Cannot download charts index from server -
cpuidle handle newlines in names #5425
* Fix clock_gettime() failures with the CLOCK_BOOTTIME argument
#5415
* Use netnsid for detecting cgroup networks; #5413
* Python module sensors fix #5406 (ilyam8)
* Fix ceph.chart.py for Python3 #5396 (GaetanF)
* Fix warning condition for mem.available #5353
* cups.plugin: Support older versions #5350
Improvements:
* Add driver-type option to the freeipmi plugin #5384
* Add support of tera-byte size for Linux bcache. #5373
* Split nfacct plugin into separate process #5361
* Add cgroup cpu and memory limits and alarms #5172
* Add message queue statistics #5115
- Update to v1.12.0
Key improvements:
* Introducing netdata.cloud, the free netdata service for all
netdata users
* High performance plugins with go.d.plugin (data collection
orchestrator written in Go)
* 7 new data collectors and 11 rewrites of existing data
collectors for improved performance
* A new management API for all netdata servers
* Bind different functions of the netdata APIs to different ports
Management API:
Netdata now has a management API. We plan to provide a full set
of configuration commands using this API.
In this release, the management API supports disabling or
silencing alarms during maintenance periods.
For more information about the management API, check
https://docs.netdata.cloud/web/api/health/#health-management-api
Anonymous statistics:
Anonymous usage information can be collected and sent to Google
Analytics. This functionality is disabled by default in openSUSE.
Remove /etc/netdata/.opt-out-from-anonymous-statistics to enable.
The statistics calculated from this information will be used for:
1. Quality assurance, to help us understand if netdata behaves as
expected and help us identify repeating issues for certain
distributions or environment.
2. Usage statistics, to help us focus on the parts of netdata
that are used the most, or help us identify the extend our
development decisions influence the community.
Information is sent to Netdata via two different channels:
* Google Tag Manager is used when an agent's dashboard is
accessed.
* The script anonymous-statistics.sh is executed by the Netdata
daemon, when Netdata starts, stops cleanly, or fails.
Both methods are controlled via the same opt-out mechanism.
For more information, check
https://docs.netdata.cloud/docs/anonymous-statistics/
Data collection:
This release introduces a new Go plugin orchestrator.
This plugin has its own github repo
(https://github.com/netdata/go-orchestrator). It is open-source,
using the same license and we welcome contributions.
The orchestrator can also be used to build custom data collection
plugins written in Go. We have used the orchestrator to write
many new Go plugins in our go.d plugin github repo.
For more information, check
https://github.com/netdata/go-orchestrator#go-orchestrator-wip
New data collectors:
* Activemq (Go)
* Consul (Go)
* Lighttpd2 (Go)
* Solr (Go)
* Springboot2 (Go)
* mdstat - nonredundant arrays (C)
* CUPS printing system (C)
High performance versions of older data collectors:
* apache (Go)
* dns_query (Go)
* Freeradius (Go)
* Httpcheck (Go)
* Lighttpd (Go)
* Portcheck (Go)
* Nginx (Go)
* cpufreq (C)
* cpuidle (C)
* mdstat (C)
* power supply (C)
Other improved data collectors:
* Fix the python plugin clock (collectors falling behind).
* adaptec_raid: add to python.d.conf.
* apcupsd: Detect if UPS is online.
* apps: Fix process statistics collection for FreeBSD.
* apps: Properly lookup docker container name when running in ECS
* fail2ban: Add 'Restore Ban' action.
* go_expavar: Don't check for duplicate expvars.
* hddtemp: Don't use disk model as dim name.
* megacli: add to python.d.conf.
* nvidia_smi: handle N/A values.
* postgres: Fix integer out of range error on Postgres 11,
fix locks count.
* proc: Don't show zero charts for ZFS filesystem.
* proc; Fix cached memory calculation.
* sensors: Don't ignore 0 RPM fans on start.
* smartd_log: check() unhandled exception: list index out of
range.
* SNMP: Gracefully ignore the offset if the value is not a number
Health Monitoring:
* Add Prowl notifications for iOS users.
* Show count of active alarms per state in email notifications.
* Show evaluated expression and expression variable values in
email notifications.
* Improve support for slack recipients (channels/users).
* Custom notifications: Fix bug with alarm role recipients.
Dashboards:
* Server filtering in my-netdata menu when signed in to
netdata.cloud
* All units are now IEC-compliant abbreviations (KiB, MiB etc.).
* GUI: Make entire row clickable in the registry menu showing
the list of servers.
Backends:
* Do not report stale metrics to prometheus.
Other:
* Treat DT_UNKNOWN files as regular files.
* API: Stricter rules for URL separators.
- Update to v1.11.1
Improved internal database:
Overflown incremental values (counters) do not show a zero point
at the charts. Netdata detects the width (8bit, 16bit, 32bit,
64bit) of each counter and properly calculates the delta when
the counter overflows.
The internal database format has been extended to support values
above 64bit.
New data collection plugins:
* openldap, to collect performance statistics from OpenLDAP
servers.
* tor, to collect traffic statistics from Tor.
* nvidia_smi to monitor NVIDIA GPUs.
Improved data collection plugins:
* BUG FIX: network interface names with colon (:) in them were
incorrectly parsed and resulted in faulty data collection
values.
* BUG FIX: smartd_log has been refactored, has better python v2
compatibility, and now supports SCSI smart attributes
* cpufreq has been re-written in C - since this module if common,
we decided to convert to an internal plugin to lower the
pressure on the python ones. There are a few more that will be
transitioned to C in the next release.
* BUG FIX: sensors got some compatibility fixes and improved
handling for lm-sensors errors.
Health monitoring:
* BUG FIX: max network interface speed data collection was
faulty, which resulted in false-positive alarms on systems with
multiple interfaces using different speeds (the speed of the
first network interface was used for all network interfaces).
Now the interface speed is shown as a badge.
* alerta.io notifications got a few improvements
* BUG FIX: conntrack_max alarm has been restored (was not working
due to an invalid variable name referenced)
Registry (my-netdata menu):
* It has been refactored a bit to reveal the URLs known for each
node and now it supports deleting individual URLs.
- Update to 1.11.0
* Stock config files are now in /usr/lib/netdata; use the
/etc/netdata/edit-config script to copy and edit them.
* The query engine of netdata has been re-written to support
query plugins. We have already added the following algorithms
that are available for alarm, charts and badges:
+ stddev, for calculating the standard deviation on any
time-frame.
+ ses or ema or ewma, for calculating the exponential weighted
moving average, or single/simple exponential smoothing on
any time-frame.
+ des, for calculating the double exponential smoothing on any
time-frame.
+ cv or rsd, for calculating the coefficient of variation for
any time-frame.
Fixed security issues:
* CVE-2018-18836 Fixed JSON Header Injection (an attacker could
send \n encoded in the request to inject a JSON fragment into
the response).
* CVE-2018-18837 Fixed HTTP Header Injection (an attacker could
send \n encoded in the request to inject an HTTP header into
the response).
* CVE-2018-18838 Fixed LOG Injection (an attacker could send \n
encoded in the request to inject a log line at access.log).
* CVE-2018-18839 Not fixed Full Path Disclosure, since these are
intended (netdata reports the absolute filename of web files,
alarm config files and alarm handlers).
* Fixed Privilege Escalation by manipulating apps.plugin or
cgroup-network error handling.
* Fixed LOG injection (by sending URLs with \n in them).
New data collection modules:
* rethinkdbs for monitoring RethinkDB performance
* proxysql for monitoring ProxySQL performance
* litespeed for monitoring LiteSpeed web server performance.
* uwsgi for monitoring uWSGI performance
* unbound for monitoring the performance of Unbound DNS servers.
* powerdns for monitoring the performance of PowerDNS servers.
* dockerd for monitoring the health of dockerd
* puppet for monitoring Puppet Server and Puppet DB.
* logind for monitoring the number of active users.
* adaptec_raid and megacli for monitoring the relevant raid
controller
* spigotmc for monitoring minecraft server statistics
* boinc for monitoring Berkeley Open Infrastructure Network
Computing clients.
* w1sensor for monitoring multiple 1-Wire temperature sensors.
* monit for collecting process, host, filesystem, etc checks
from monit.
* linux_power_supplies for monitoring Linux Power Supplies
attributes
Data collection orchestrators changes:
* node.d.plugin does not use the js command any more.
* python.d.plugin now uses monotonic clocks. There was a
discrepancy in clocks used in netdata that resulted in a shift
in time of python module after some time (it was missing 1 sec
per day).
* added MySQLService for quickly adding plugins using mysql
queries.
* URLService now supports self-signed certificates and supports
custom client certificates.
* all python.d.plugin modules that require sudo to collect
metrics, are now disabled by default, to avoid security alarms
on installations that do not need them.
Improved data collection modules:
* apps.plugin now detects changes in process file descriptors,
also fixed a couple of memory leaks. Its default configuration
has been enriched significantly, especially for IoT.
* freeipmi.plugin now supports option ignore-status to ignore
the status reported by given sensors.
* statsd.plugin (for collecting custom APM metrics)
+ The charting thread has been optimized for lowering its CPU
consumption when several millions of metrics are collected.
+ sets now report zeros instead of gaps when no data are
collected
+ histograms and timers have been optimized for lowering their
CPU consumption to support several thousands of such metrics
are collected.
+ histograms had wrong sampling rate calculations.
+ gauges now ignore sampling rate when no sign is included in
the value.
+ the minimum sampling rate supported is now 0.001.
+ netdata statsd is now drop-in replacement for datadog statsd
(although statsd tags are currently ignored by netdata).
* proc.plugin (Linux, system monitoring)
+ Unused interrupts and softirqs are not used in charts
(this saves quite some processing power and memory on systems
with dozens of CPU cores).
+ fixed /proc/net/snmp parsing of IcmpMsg lines that failed on
a few systems.
+ Veritas Volume Manager disks are now recognized and named
accordingly.
+ Now netdata collects TcpExtTCPReqQFullDrop and re-organizes
metrics in charts to properly monitor the TCP SYN queue and
the TCP Accept queue of the kernel.
+ Many charts that were previously reported as IPv4, were
actually reflecting metrics for both IPv4 and IPv6.
They have been renamed to ip.*.
+ netdata now monitors SCTP.
+ Fixed BTRFS over BCACHE sector size detection.
+ BCACHE data collection is now faster.
+ /proc/interrupts and /proc/softirqs parsing fixes.
* diskspace.plugin (Linux, disk space usage monitoring)
+ It does not stat() excluded mount points any more (it was
interfering with kerberos authenticated mount points).
+ several filesystems are now by default excluded from
disk-space monitoring, to avoid breaking suspend on
workstations.
* python.d.plugin PYTHON modules (applications monitoring)
+ web_log module now supports virtual hosts, reports http/https
metrics, support squid logs
+ nginx_plus module now handles non-continuous peer IDs
+ ipfs module is optimized, the use of its Pin API is now
disabled by default and can enabled with a netdata module
option (using the IPFS Pin API increases the load on the
IPFS server).
+ fail2ban module now supports IPv6 too.
+ ceph module now checks permissions and properly reports
issues
+ elasticsearch module got better error handling
+ nginx_plus module now uses upstream ip:port instead of
transient id to identify dimensions.
+ redis, now it supports Pika, collects evited keys, fixes
authentication issues reported and improves exception
handling.
+ beanstalk, bug fix for yaml config loading.
+ mysql, the % of active connections is now monitored,
query types are also charted.
+ varnish, now it supports versions above 5.0.0
+ couchdb
+ phpfpm, now supports IPv6 too.
+ apache, now supports IPv6 too.
+ icecast
+ mongodb, added support for connect URIs
+ postgress
+ elasticsearch, now it supports versions above 6.3.0,
fixed JSON parse errors
+ mdstat , now collects mismatch_cnt
+ openvpn_log
* node.d.plugin NODE.JS modules
+ snmp was incorrectly parsing a new OID names as float.
* charts.d.plugin BASH modules
+ nut now supports naming UPSes.
Health monitoring:
* Added variable $system.cpu.processors.
* Added alarms for detecting abnormally high load average.
* TCP SYN and TCP accept queue alarms, replacing the old softnet
dropped alarm that was too generic and reported many false
positives.
* system alarms are now enabled on FreeBSD.
* netdata now reads NIC speed and sets alarms on each interface
to detect congestion.
* Network alarms are now relaxed to avoid false positives.
* New bcache alarms.
* New mdstat alarms.
* New apcupsd alarms.
* New mysql alarms.
* New notification methods:
+ rocket.chat
+ Microsoft Teams
+ syslog
+ fleep.io
+ Amazon SNS
Backends:
* Host tags are now sent to Graphite
* Host variables are now sent to Prometheus
Streaming:
* Each netdata slave and proxy now filter the charts that are
streamed. This allows exposing netdata masters to third parties
by limiting the number of charts available at the master.
* Fixed a bug in streaming slaves that randomly prevented them
to resume streaming after network errors.
* Fixed a bug that on slaves that sent duplicated chart names
under certain conditions.
* Fixed a bug that caused slaves to consume 100% CPU (due to a
misplaced lock) when multiple threads were adding dimensions
on the same chart.
* The receiving nodes of streaming (netdata masters and proxies)
can now rate-limit the rate of inbound streaming requests
received.
* Re-worked time synchronization between netdata slaves and
masters.
API:
* Badges that report time, now show "undefined" instead of
"never".
Dashboard:
* Added UTC timezone to the list of available time-zones.
* The dashboard was sending some non-HTTP compliant characters
at the URLs that made netdata dashboards break when used under
certain proxies.
OBS-URL: https://build.opensuse.org/request/show/678173
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=7
|
|||
|
|
435056940f | OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=25 | ||
|
|
1b559a5ad2 |
- Drop GPG signature (no longer used)
- Drop spec compatibility with old distro versions
- Drop netdata-automake-no-dist-xz.patch
- Refresh netdata-smartd-log-path.patch
- Update to v1.12.1
Fixes:
* Fix SIGSEGV at startup: Don't free vars of charts that do not
exist #5455
* Prevent invalid Linux power supply alarms during startup #5447
* Correct duplicate flag enum in health.h #5441
* Remove extra 'v' for netdata version from Server response
header #5440 and spec URL #5427
* apcupsd - Treat ONBATT status the same as ONLINE #5435
* Fix #5430 - LogService._get_raw_data under python3 fails on
undecodable data #5431
* Correct version check in UI #5429
* Fix ERROR 405: Cannot download charts index from server -
cpuidle handle newlines in names #5425
* Fix clock_gettime() failures with the CLOCK_BOOTTIME argument
#5415
* Use netnsid for detecting cgroup networks; #5413
* Python module sensors fix #5406 (ilyam8)
* Fix ceph.chart.py for Python3 #5396 (GaetanF)
* Fix warning condition for mem.available #5353
* cups.plugin: Support older versions #5350
Improvements:
* Add driver-type option to the freeipmi plugin #5384
* Add support of tera-byte size for Linux bcache. #5373
* Split nfacct plugin into separate process #5361
* Add cgroup cpu and memory limits and alarms #5172
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=24
|
||
| d5caae0ccd |
Accepting request 591762 from network
- Remove _service
- Add GPG signature
- Use %license macro
- Update to v1.10.0
* new plugins:
- BTRFS - monitor the allocations of BTRFS filesystems (netdata can now
properly detect when btrfs is going out of space)
- BCACHE - monitor the caching block layer that allows building hybrid
disks using normal HDDs and SSDs
- Ceph - monitor ceph distributed storage
- nginx plus - monitor the nginx+ web servers
- libreswan - monitor IPSEC tunnels
- Traefik - monitor traefik reverse proxies
- icecast - monitor icecast streaming servers
- ntpd - monitor NTP servers
- httpcheck - monitor any remote web server
- portcheck - monitor any remote TCP port
- spring-boot - monitor java spring boot applications
- dnsdist - monitor dnsdist name servers
- hugepages - monitor the allocation of Linux hugepages
* enhanced/improved plugins:
- statsd
+ statsd dimensions now support the options the external plugin
dimensions support (currently the only usable option is hidden to add
the dimension, but make it hidden on the dashboard - a hidden
dimension can participate in various calculations, including alarms).
+ statsd now reports the CPU usage of its threads at the netdata
section.
+ statsd metrics are logged to access.log the first time they are
encountered.
+ statsd metrics now accept the special value zinit to allow them get
initialized without altering their values (this is useful if you have
rare metrics that you need to initialize when netdata starts).
+ statsd over TCP is now a lot faster - netdata can process up to 3.5mil
statsd metrics / second using just one core. Added options to control
the timeouts of TCP statsd connections.
+ fixed the title and context of statsd private charts
+ statsd private charts can now be hidden from the dashboard
- web_log
+ Added web server response timings histogram
- containers monitoring
+ netdata now monitors systemd-nspawn containers.
+ netdata now renames charts of kubernetes containers.
+ virsh is now called with -r to avoid prompting for password
+ cgroup-network is now a lot more strict, preventing unauthorized
privilege escalation
+ cgroup-network now searches for container processes in sub-cgroups
too - this improves the mapping of network interfaces to containers
+ cgroup-network now works even when there are no veth interfaces
in the system
- system memory
+ treat slab memory as cached
+ added a new chart for monitoring the memory available for use,
before hitting swap
+ netdata now monitors Linux hugepages and (transparent) hugepages
- diskspace
+ support huge amounts of mountpoints - netdata was crashing with stack
overflow due to recursion - now it is a loop, so any number of mount
points is supported
- network interfaces
+ moved tcp passive and active opens to a separate chart, to allow the
TCP issues dimensions scale better by default
+ updated the information presented on TCP charts to match the latest
v4.15 kernel source
- postgres
+ new charts:
checkpointer
bgwriter
autovacuum
replication delta
WAL archive
WAL
temporary files
+ The postgres plugin now also works when postgres is in recovery mode.
- rabbitmq
+ added Erlang run queue chart. This is useful in conjunction with the
existing Erlang processes chart to get a better overall idea of what's
going on in the Erlang VM.
+ added rabbitmq information on the dashboard to complement the charts.
- apps.plugin
netdata prior to this version was detecting the user and group of
processes by examining the ownership of /proc/PID/stat. Unfortunately it
seems that the owneship of files in /proc do not change when the process
switches user. So, netdata could not detect the user and group of
processes that started as root and then switched to another user.
Now netdata reads /proc/PID/status:
+ process ownship information is now accurate
+ eliminated the need to read /proc/PID/statm (all the information of
/proc/PID/statm is available in /proc/PID/status)
+ allowed netdata to read VmSwap, so a new chart has been added to
monitor the swap memory usage per process, user and group.
+ The new plugin is 20% more expensive in terms of CPU. We tried hard
to optimize it, but this is as good as it can get.
- haproxy
+ hrsp_1xx, hrsp_2xx, hrsp_3xx, hrsp_4xx, hrsp_5xx, hrsp_other,
hrsp_total for backands and frontends
+ qtime, ctime, rtime, ttime metrics for backend servers
+ backend servers In UP state
- uptime
+ netdata now uses /proc/uptime when CLOCK_BOOTTIME does not report the
same uptime. In containers CLOCK_BOOTTIME reports the uptime of the
host, while /proc/uptime reports the uptime of the container, so now
netdata correctly reports the uptime of the container.
- ksm
+ removed to_scan dimension
+ the savings % reported by netdata was less than the actual - fixed it.
- mdstat
+ various fixes to better monitor rebuild time and rate
- elasticsearch
+ Added several charts for translog / indices segments statistics and
JVM buffer pool utilization, which are often helpful when evaluating
an elasticsearch node health
- apcupsd
+ netdata now supports monitoring multiple APC UPSes.
- isc-dhcpd
+ netdata now also supports monitoring IPv6 leases
- fronius
+ added a new dimension solar_consumption
+ added alarms
- stiebeleltron
+ added alarms
- python.d
+ python.d.plugin can now start even if /etc/netdata/python.d.conf
is missing
+ python.d.plugin now has an internal run counter
+ the unicode decoding of the plugin has been fixed
+ the plugin now does not validate self-signed certificates
+ the plugin can not revive obsolete charts
- charts.d
+ charts.d.plugin BASH modules can now have custom number of retries in
case of data collection failures
* new alarm notifications plugins:
- alerta
- IRC
* web server:
- netdata now has a new internal web server that supports a fixed number
of threads - we call it static web server. This web server allows
netdata to work around memory fragmentation (since the treads are fixed,
the underlying memory allocators reuse the same memory arenas) and cpu
utilization (we can control the number of threads that will be used by
netdata). This is the default now.
- now the static threads web server reports the CPU usage of each of its
threads.
- the HTTP response headers now include the netdata version
* dashboard:
- the print button now respects the URL path netdata is hosted.
- dygraphs updated to the latest version - this fixes an issue that
prevented netdata charts from being interactive under certain conditions
- added dygraph theme logscale
- fontawesome updated to version 5
- d3 updated to the latest version (this broke c3 charts that require an
older version)
- added d3pie charts
- custom dashboards can now have alarms for specific roles (all, none,
one or more).
- allow stacked charts to zoom vertically when dimensions are selected
- netdata now has a global XSS protection
- netdata now uses intersectionObserver when available - this improves the
scrolling performance of the dashboard.
- prevent date, time and units from wrapping at the charts legends
- various units scaling improvements
- added data-common-colors="NAME" chart option for custom dashboards
- added wiki page for creating custom dashboards on Atlassian's Confluence
- prevented a double click on the charts' toolbox to select the text of
the buttons.
- fixed the alignment of dashboard icons
- added a simple js, called refresh-badges.js, to update badges on a
custom web page
* badges:
- netdata badges can now be scaled
* API:
- added gtime parameter, for group time. This is used to request from
netdata to return values in a different rate (i.e. gtime=60 on a X/sec
dimension, will return X/min).
- fixed a rounding bug in JSON generation
- the dimensions= parameter now supports simple patterns and added option
values match-ids and match-names to control which matches are executed
for dimensions.
* alarms:
- "system.swap" alarms now send notifications with a 30 seconds delay,
to work-around a kernel bug that incorrectly reports all swap as
instantly used under containers
- added alarm to predict the time a mount point will run out of inodes
- added alerta.io notifications
- added available memory alarm
- removed unsupported html tags from hipchat notifications.
- pagerduty notifications have been modified to avoid incident duplication
- alarm definitions can now use both chart IDs and chart names (prior to
this version only chart IDs were allowed).
- curl options (eg for disabling SSL certificates verification) for
alarm-notify.sh can now be defined in health_alarm_notify.conf.
- netdata can now send notifications to IRC channels
* backends:
- on netdata masters, allow filtering the hosts that will be sent to
backends with send hosts matching = * pattern.
- improved connection error handling and added retries to allow netdata
connect to certain backends that failed with EALREADY or EINPROGRESS.
- json backends now receive host tags (the tags have to be formatted in a
json friendly way)
- re-worked the alarm that triggers when backend data are lost,
to avoid flip-flops.
- prometheus backends:
+ added URL option timestamps=yes|no to /api/v1/allmetrics to support
prometheus Pushgateway
+ added netdata_info variable with the version of netdata
+ renamed netdata_host_tags to netdata_host_tags_info (the old exists
but is deprecated and will be removed eventually)
+ when prometheus uses average metrics, netdata remembers the last
access time the prometheus collected metrics, on a per host basis.
* metrics streaming between netdata:
- netdata masters and proxies now expose the version of the netdata
collecting the metrics, not their own. So, now a netdata master shows
on the dashboard and sends to backends the version of the netdata
collecting the metrics
- added stream.conf option multiple "connections = accept | deny" to allow
or deny multiple connection for the same netdata host. The default
remains "accept", but it is likely to be changed to no on future
versions.
* other:
- added global option gap when lost iterations to control the number of
iterations that should be lost to show a gap on the charts.
- various fixes/improvements related to netdata logs - the main change is
that now netdata logs the thread name that logged the message, providing
helpful insights about the thread that complained.
- re-worked the exit procedure of netdata to allow it cleanup properly -
sometimes netdata was deadlocked during exit, waiting forever - now
netdata always exits promptly
- fixed compilation on ancient gcc versions
- netdata was always setting itself to the idle process scheduling
priority, even when it was configured to do otherwise. Fixed it.
OBS-URL: https://build.opensuse.org/request/show/591762
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=6
|
|||
|
|
4e82777425 |
Fix SLE build
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=22 |
||
|
|
e05a2d8acb |
- Remove _service
- Add GPG signature
- Use %license macro
- Update to v1.10.0
* new plugins:
- BTRFS - monitor the allocations of BTRFS filesystems (netdata can now
properly detect when btrfs is going out of space)
- BCACHE - monitor the caching block layer that allows building hybrid
disks using normal HDDs and SSDs
- Ceph - monitor ceph distributed storage
- nginx plus - monitor the nginx+ web servers
- libreswan - monitor IPSEC tunnels
- Traefik - monitor traefik reverse proxies
- icecast - monitor icecast streaming servers
- ntpd - monitor NTP servers
- httpcheck - monitor any remote web server
- portcheck - monitor any remote TCP port
- spring-boot - monitor java spring boot applications
- dnsdist - monitor dnsdist name servers
- hugepages - monitor the allocation of Linux hugepages
* enhanced/improved plugins:
- statsd
+ statsd dimensions now support the options the external plugin
dimensions support (currently the only usable option is hidden to add
the dimension, but make it hidden on the dashboard - a hidden
dimension can participate in various calculations, including alarms).
+ statsd now reports the CPU usage of its threads at the netdata
section.
+ statsd metrics are logged to access.log the first time they are
encountered.
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=21
|
||
| 4910fe0e14 |
Accepting request 557884 from network
- Update to v1.9.0
* new features:
- IP address ACLs
- auto-scaling units (KB → MB → GB → TB etc.)
- dashboard snapshots for any timeframe/resolution
- detection of TCP overflows and dropped connections
- detection of redis background save failures
- export dashboards to PDF
- flock.com and kavenegar.com alarm notifications
- highlighting timeframes on all charts by holding Ctrl or Alt
- monitoring of couchdb, powerdns, beanstalkd and dnsdist
- monitoring of libvirt VMs
- new alarms:
1. too many tcp orphan sockets
2. tcp memory that detects that the tcp stack is under memory pressure
or close to giving memory errors
3. too many tcp connections (for kernels that do not support dynamic
allocation of connections)
- timezone support
- unit conversion (temperatures, time, etc.)
* fixes/improvements:
- better/faster dashboard scrolling
- python.d.plugin rewritten
+ supports option autodetection_retry: SECONDS
+ the new URL service disables certificates checks by default,
to allow self-signed certificates to work without configuration.
- streaming:
+ netdata proxies with more than 100 slaves had a timing issue that
caused them to crash randomly on slave reconnects. Parts of the code
have been rewritten to get rid of the timing issue.
+ netdata slaves and proxies now have a protection that ensures they will
never use 100% CPU, even if the master is misbehaving.
+ expired orphaned hosts are now removed from the my-netdata menu of the
dashboard.
+ streaming functions can now be monitored via access.log
+ streaming now transfers alarm variables too
- web_log plugin
+ custom regex now supports parsing hostnames and IPs
+ now parses lines with error 408 (request timeout - these are a special
case, since the request has not received by the web server, so the log
line is incomplete)
+ now properly parses resp_length with value
- disks under Linux are renamed using /dev/disk/by-label. An option has been
added at netdata.conf to also allow renaming based on /dev/disk/by-id.
- chrony is now disabled by default, because there have been reports that
chronyc enters an infinite loop in CentOS and RHEL.
- tomcat improvements to support flavors of the tomcat server
- chart system.io (the total system Disk I/O) is now calculated by
aggregating the reads and writes of all physical disks. The previous
system.io chart (that is based on pgpgin and pgpgout from /proc/vmstat)
is now named system.pgpgio. The key difference is that the new system.io
now sees ZFS I/O, and it also correctly and accurately sums the real disk
bandwidth of RAID arrays.
- chart system.net (the total system network bandwidth) is now calculated
by aggregating the bandwidth of all physical network interfaces and is
common for both IPv4 and IPv6.
- tc (QoS) charts now sort the dimensions on the legends, the same way tc
reports them.
- mysql (and mariadb) got new charts for galera replication
- postgres versions <= 10 the WAL directory was named pg_xlog' and from 10
upwards has been renamed to pg_wal
- freeipmi.plugin got a command line option (can be given in netdata.conf)
to ignore certain sensor IDs that are faulty.
- minor openvpn_log, smartd, smartd_log, varnish, mdstat, dns_query_time,
isc_dhcpd, freeradius and node.d.plugin fixes/improvements
- netdata now supports multiple plugin directories. The setting is the same
in netdata.conf, plugins directory = "DIRECTORY1" "DIRECTORY2" ...,
up to 20 directories.
- netdata now supports alarms variables. Each plugin can now define host
global and chart local variables with static values that can be used in
alarms' expressions.
- dashboard:
+ hover selection of charts is now faster on all browsers
+ the dashboard is now fixed when a modal is open, preventing scrolling
+ the dashboard now uses fontawesome 5.0.1 for icons
+ chart names can now be searched with browser control-F
+ netdata now detects libvirt VM network interfaces and moves them to the
VM section of the dashboard
+ now shows the context, resolution and plugin/module of each chart in
tooltips
+ should now put a lot less CPU pressure on the browser when the page
does not have focus
- statsd:
+ metrics can now be added to statsd synthetic charts using patterns
+ dimensions added to statsd synthetic charts can automatically be renamed
using a dictionary
+ timers and histograms now report zeros when nothing is collected
- badges:
+ fixed a bug in netdata badges that was incorrectly matching zero values
with the null color condition
+ added API option display_absolute to allow badges use the signed value
for color evaluation, but present the absolute value
- alarms/notifications:
+ warning emails sent by netdata are now a little bit more orange
(they were a bit greenish)
+ fixed a bug in email notifications that was triggering a corrupted MIME
match by anti-spam solutions
+ pushbullet notifications now track the devices, so that per-device
filtering at pushbullet is possible. Also improved the formatting a bit.
+ correct priority of warnings for pushover notifications
+ alarms can now use variables like this:
${variable with spaces or +, -, *, / in it}
- other:
+ access.log has been refactored to support monitoring all netdata
operations
+ inodes monitoring is now by default disabled for mount points based on
filesystems that do not have a maximum inode threshold (such as cephfs)
+ rabbitmq has been added to apps_groups.conf so that apps.plugin now
monitors (cpu, memory, disk I/O, sockets, etc) rabbitmq instances
+ several email and log management apps have been added to email and logs
targets of apps_groups.conf
+ ceph target added to apps_groups.conf to allow netdata to monitor Ceph
+ refactored several internal data collection plugins to eliminate a few
hundreds of index lookups per second
+ netdata.conf settings that are loaded from disk, but were the same with
the default ones, were generated commented when the server was asked to
return its config. Now all loaded settings are generated uncommented.
+ netdata simple patterns can now extract the the wildcarded part of the
string they match (used in statsd synthetic charts)
+ netdata simple patterns can allow escaping spaces by prefixing them
with a backslash
- Update netdata-smartd-log-path.patch
- Remove netdata-main-return.patch (fixed upstream)
- Fix rpmlint env-script-interpreter warnings
OBS-URL: https://build.opensuse.org/request/show/557884
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=5
|
|||
|
|
29ef9192b4 |
- Update to v1.9.0
* new features:
- IP address ACLs
- auto-scaling units (KB → MB → GB → TB etc.)
- dashboard snapshots for any timeframe/resolution
- detection of TCP overflows and dropped connections
- detection of redis background save failures
- export dashboards to PDF
- flock.com and kavenegar.com alarm notifications
- highlighting timeframes on all charts by holding Ctrl or Alt
- monitoring of couchdb, powerdns, beanstalkd and dnsdist
- monitoring of libvirt VMs
- new alarms:
1. too many tcp orphan sockets
2. tcp memory that detects that the tcp stack is under memory pressure
or close to giving memory errors
3. too many tcp connections (for kernels that do not support dynamic
allocation of connections)
- timezone support
- unit conversion (temperatures, time, etc.)
* fixes/improvements:
- better/faster dashboard scrolling
- python.d.plugin rewritten
+ supports option autodetection_retry: SECONDS
+ the new URL service disables certificates checks by default,
to allow self-signed certificates to work without configuration.
- streaming:
+ netdata proxies with more than 100 slaves had a timing issue that
caused them to crash randomly on slave reconnects. Parts of the code
have been rewritten to get rid of the timing issue.
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=19
|
||
| c10deb8732 |
Accepting request 526806 from network
- Add return statement to main function (-Wreturn-type)
* netdata-main-return.patch
- Update to v1.8.0
* fixed bugs:
- streaming slaves consuming 100% CPU
- missing alarm notifications on netdata masters
- API generating corrupted JSON
- crashing when starting on systems without writable disks
- python.d.plugin URLService did not support HTTP keep-alive
- streamed charts with duplicate names
* netdata enhancements:
- netdata can now listen on UNIX domain sockets
- systemd netdata.service now allows setting negative netdata
OOM score and restarts netdata if it crashes
* new plugins:
- CPU thermal throttling charts
- chrony plugin
- Stiebel Eltron plugin to collect metrics from heat pumps and
hot water installations from Stiebel Eltron ISG
* improved plugins:
- web_log bugfixes, enhancements and optimizations
(including squid logs)
- web_log now enables parsing HTTP/2 logs in custom_log_format
- redis bugfixes
- haproxy bugfixes
- elasticsearch bugfixes and optimizations
- rabbitmq bugfixes and optimizations
- mdstat bugfixes
- tomcat improvements
OBS-URL: https://build.opensuse.org/request/show/526806
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=4
|
|||
|
|
be90374ced |
Mention the patch in a way the SR check scripts understand
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=17 |
||
|
|
cd49c0b46d |
Forgot the patch
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=16 |
||
|
|
eb0ce1bc30 |
- Add return statement to main function (-Wreturn-type)
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=15 |
||
|
|
9db8dd0ae5 | OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=14 | ||
|
|
f0922098c8 |
- Update to v1.8.0
* fixed bugs:
- streaming slaves consuming 100% CPU
- missing alarm notifications on netdata masters
- API generating corrupted JSON
- crashing when starting on systems without writable disks
- python.d.plugin URLService did not support HTTP keep-alive
- streamed charts with duplicate names
* netdata enhancements:
- netdata can now listen on UNIX domain sockets
- systemd netdata.service now allows setting negative netdata
OOM score and restarts netdata if it crashes
* new plugins:
- CPU thermal throttling charts
- chrony plugin
- Stiebel Eltron plugin to collect metrics from heat pumps and
hot water installations from Stiebel Eltron ISG
* improved plugins:
- web_log bugfixes, enhancements and optimizations
(including squid logs)
- web_log now enables parsing HTTP/2 logs in custom_log_format
- redis bugfixes
- haproxy bugfixes
- elasticsearch bugfixes and optimizations
- rabbitmq bugfixes and optimizations
- mdstat bugfixes
- tomcat improvements
- mysql improvements
- dovecot improvements
- postgres improvements
OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=13
|
||
| 3414b30d52 |
Accepting request 512713 from network
- Add download_files _service
- update source tarball to match upstream copy
- Update to v1.7.0
* netdata is now a fully featured statsd server
* improved metrics streaming and replication
* backend enhancements, including:
- host tags and metrics filtering on the netdata side
- rewritten prometheus support to utilize more prometheus
features and provide more flexibility and integration options
* now monitors ZFS, ElasticSearch, RabbitMQ, Go applications
(via expvar), samba, squid logs (with web_log plugin).
* improved dashboard loading times
* custom hook support for alarms
- Update netdata-smartd-log-path.patch
OBS-URL: https://build.opensuse.org/request/show/512713
OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=3
|
|||
|
|
42adb0417e |
- Add download_files _service
- update source tarball to match upstream copy OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=11 |
||
| fa4fe24755 |
Accepting request 510524 from network
1 OBS-URL: https://build.opensuse.org/request/show/510524 OBS-URL: https://build.opensuse.org/package/show/openSUSE:Factory/netdata?expand=0&rev=2 |
|||
|
|
deaa7db77e |
Accepting request 510616 from home:lachs0r:branches:network
OBS-URL: https://build.opensuse.org/request/show/510616 OBS-URL: https://build.opensuse.org/package/show/network/netdata?expand=0&rev=9 |