Commit graph

13 commits

Author SHA1 Message Date
Ben Kochie
0d6e113f90
Enable Prometheus native histograms (#6524)
Add a NativeHistogramBucketFactor parameter to the use of
`NewHistogramVec` in order to enable use of Prometheus Native
Histograms.

This will store automatically computed sparse buckets in CoreDNS.
If a compatible Prometeus requests native histograms this data will
returned instead of the static buckets.

The default factor of 1.05 should provide high quality resolution data.

Signed-off-by: SuperQ <superq@gmail.com>
2024-03-11 16:09:09 -04:00
W. Trevor King
0063d7a80c
plugin/health: Poll localhost by default (#5934)
defaulting to localhost makes things explicit in CoreDNS code, and will give us valid URIs in
the logs

Signed-off-by: W. Trevor King <wking@tremily.us>
2023-03-29 09:57:54 -04:00
Chris O'Haver
037e4920c2
plugin/health: Bypass proxy in self health check (#5401)
* add detail to docs; bypass proxy in self health check

Signed-off-by: Chris O'Haver <cohaver@infoblox.com>
2022-06-17 15:49:53 -04:00
Ondřej Benkovský
a929b0b1ec
plugin/health : rework overloaded goroutine to support graceful shutdown (#5244)
Signed-off-by: Ondřej Benkovský <ondrej.benkovsky@jamf.com>
2022-04-13 13:09:03 -04:00
xuweiwei
1029fea906
Fix a typo in plugin/health (#4982)
Signed-off-by: xuweiwei <xuweiwei_yewu@cmss.chinamobile.com>
2021-11-15 07:29:52 -05:00
Ben Kochie
9edfaed631
Reduce the cardinality of health endpoint metrics (#4650)
The health endpoint histogram has a large amount of cardinality for a
simple endpoint. Introduce a new "Slim" set of buckets for `/health` to
reduce the metrics load on large deployments. Especially those that have
per-node DNS caching services.

Add a metric to count internal health check failures rather than use the
timeout value as side effect monitor of the check error. This avoids
incorrectly recording the timeout value if there is an error that is not
a timeout (ex. refused)

Signed-off-by: SuperQ <superq@gmail.com>
2021-05-27 15:16:38 +02:00
Miek Gieben
634e3fe8f5
plugin/health: add logging for local health request (#4533) 2021-03-19 03:40:38 -07:00
Serge
6f2281ed40
Fix health check endpoint (#4231)
Signed-off-by: Serge Logvinov <serge.logvinov@gmail.com>
2020-10-27 09:15:42 +01:00
Zou Nengren
4166dcc2fe
using promauto package to ensure all created metrics are properly registered (#4025)
Signed-off-by: zounengren <zounengren@cmss.chinamobile.com>
2020-07-25 08:06:28 -07:00
yeya24
88d25cdc20 remove an unused variable (#3278)
Signed-off-by: yeya24 <yb532204897@gmail.com>
2019-09-16 07:28:42 +01:00
Miek Gieben
890cdb5cab plugin/health: cleanups (#2811)
Small, trivial cleanup: got triggered because I saw a comment on how
health plugins polls other plugins which isn't true.

* Remove useless newHealth function
* healthParse -> parse
* Remove useless constants

Net deletion of code.

Signed-off-by: Miek Gieben <miek@miek.nl>
2019-05-04 16:06:04 -04:00
Miek Gieben
2338120f5b
plugin/metrics: add MustRegister function (#1648)
This registers the Collectors iff the metrics plugin has been loaded.
Safes a bunch of code in each and every plugin's setup code.
2018-04-01 13:58:13 +01:00
Miek Gieben
48059a6c3e
Overloaded (#1364)
* plugin/health: add 'overloaded metrics'

Query our on health endpoint and record (and export as a metric) the
time it takes. The Get has a 5s timeout, that, when reached, will set
the metric duration to 5s. The actually call "I'm I overloaded" is left
to an external entity.

* README

* golint and govet

* and the tests
2018-01-10 11:41:22 +00:00