coredns

Author	SHA1	Message	Date
Chris O'Haver	d903a963ee	dont lameduck when reloading (#5472 ) Signed-off-by: Chris O'Haver <cohaver@infoblox.com>	2022-07-06 13:52:18 -04:00
Chris O'Haver	037e4920c2	plugin/health: Bypass proxy in self health check (#5401 ) * add detail to docs; bypass proxy in self health check Signed-off-by: Chris O'Haver <cohaver@infoblox.com>	2022-06-17 15:49:53 -04:00
Ondřej Benkovský	a929b0b1ec	plugin/health : rework overloaded goroutine to support graceful shutdown (#5244 ) Signed-off-by: Ondřej Benkovský <ondrej.benkovsky@jamf.com>	2022-04-13 13:09:03 -04:00
xuweiwei	1029fea906	Fix a typo in plugin/health (#4982 ) Signed-off-by: xuweiwei <xuweiwei_yewu@cmss.chinamobile.com>	2021-11-15 07:29:52 -05:00
Zou Nengren	5191959bd7	cleanup deprecated package io/ioutil (#4920 ) Signed-off-by: zounengren <zouyee1989@gmail.com>	2021-10-13 09:30:31 +02:00
Ben Kochie	9edfaed631	Reduce the cardinality of health endpoint metrics (#4650 ) The health endpoint histogram has a large amount of cardinality for a simple endpoint. Introduce a new "Slim" set of buckets for `/health` to reduce the metrics load on large deployments. Especially those that have per-node DNS caching services. Add a metric to count internal health check failures rather than use the timeout value as side effect monitor of the check error. This avoids incorrectly recording the timeout value if there is an error that is not a timeout (ex. refused) Signed-off-by: SuperQ <superq@gmail.com>	2021-05-27 15:16:38 +02:00
Miek Gieben	634e3fe8f5	plugin/health: add logging for local health request (#4533 )	2021-03-19 03:40:38 -07:00
Serge	6f2281ed40	Fix health check endpoint (#4231 ) Signed-off-by: Serge Logvinov <serge.logvinov@gmail.com>	2020-10-27 09:15:42 +01:00
Chris O'Haver	042e57a177	fix lameduck docs (#4169 ) Signed-off-by: Chris O'Haver <cohaver@infoblox.com>	2020-10-01 08:03:34 -07:00
Miek Gieben	b003d06003	For caddy v1 in our org (#4018 ) * For caddy v1 in our org This RP changes all imports for caddyserver/caddy to coredns/caddy. This is the v1 code of caddy. For the coredns/caddy repo the following changes have been made: * anything not needed by us is deleted * all `telemetry` stuff is deleted * all its import paths are also changed to point to coredns/caddy * the v1 branch has been moved to the master branch * a v1.1.0 tag has been added to signal the latest release Signed-off-by: Miek Gieben <miek@miek.nl> * Fix imports Signed-off-by: Miek Gieben <miek@miek.nl> * Group coredns/caddy with out plugins Signed-off-by: Miek Gieben <miek@miek.nl> * remove this file Signed-off-by: Miek Gieben <miek@miek.nl> * Relax import ordering github.com/coredns is now also a coredns dep, this makes github.com/coredns/caddy fit more natural in the list. Signed-off-by: Miek Gieben <miek@miek.nl> * Fix final import Signed-off-by: Miek Gieben <miek@miek.nl>	2020-09-24 18:14:41 +02:00
Zou Nengren	4166dcc2fe	using promauto package to ensure all created metrics are properly registered (#4025 ) Signed-off-by: zounengren <zounengren@cmss.chinamobile.com>	2020-07-25 08:06:28 -07:00
Miek Gieben	fc546cf129	doc: fix generated manual pages (#3571 ) Went over all generated manual pages and fixed some markdown issues, mostly escaping "_" to avoid underlining entire paragraphs. Some textual fixes in route53 and other cloud DNS plugins. Regenerated the markdown with mmark. Signed-off-by: Miek Gieben <miek@miek.nl>	2019-12-29 13:35:17 +01:00
Miek Gieben	24176a97e6	Move to CODEOWNERS (#3489 ) * Move to CODEOWNERS No change in who own what; just a move to CODEOWNERS. This allows dreck cleanups. Added .dreck.yaml for alias and exec. Fixes: #3486 Signed-off-by: Miek Gieben <miek@miek.nl> * stickler bot Signed-off-by: Miek Gieben <miek@miek.nl> * sort the file Signed-off-by: Miek Gieben <miek@miek.nl>	2019-11-29 13:17:05 +00:00
Zou Nengren	768ca99c57	Fix reloading in health and ready (#3473 ) Signed-off-by: zouyee <zounengren@cmss.chinamobile.com>	2019-11-20 12:14:37 +00:00
Miek Gieben	65458b2de2	Directive -> plugin (#3363 ) Caught my eye, we name things directive still, esp when talking about the prometheus plugin. Rename everything that needs to be plugin to 'plugin'. Also make sure Metrics is a H2 section (not H1). Signed-off-by: Miek Gieben <miek@miek.nl>	2019-10-08 10:20:48 +01:00
Guangming Wang	081e45afa3	cleanup: remove redundant return statement (#3297 ) Signed-off-by: Guangming Wang <guangming.wang@daocloud.io>	2019-09-23 14:40:14 +01:00
Miek Gieben	004c5fca9d	all: simply registering plugins (#3287 ) Abstract the caddy call and make it simpler. See #3261 for some part of the discussion. Go from: ~~~ go func init() { caddy.RegisterPlugin("any", caddy.Plugin{ ServerType: "dns", Action: setup, }) } ~~~ To: ~~~ go func init() { plugin.Register("any", setup) } ~~~ This requires some external documents in coredns.io to be updated as well; the old way still works, so it's backwards compatible. Signed-off-by: Miek Gieben <miek@miek.nl>	2019-09-20 08:02:30 +01:00
yeya24	88d25cdc20	remove an unused variable (#3278 ) Signed-off-by: yeya24 <yb532204897@gmail.com>	2019-09-16 07:28:42 +01:00
xieyanker	9fe7fb95c6	return standardized text for ready and health endpoint (#3195 )	2019-08-26 10:31:24 +00:00
Chris O'Haver	3f47fc8ba4	typo fixes (#3169 ) * spelling fixes * its/it's	2019-08-21 16:08:55 -04:00
Yong Tang	f8bba51f84	Update Caddy to 1.0.1, and update import path (#2961 ) * Update Caddy to 1.0.1, and update import path This fix updates caddy to 1.0.1 and also updates the import path to github.com/caddyserver/caddy This fix fixes 2959 Signed-off-by: Yong Tang <yong.tang.github@outlook.com> * Also update plugin.cfg Signed-off-by: Yong Tang <yong.tang.github@outlook.com> * Update and bump zplugin.go Signed-off-by: Yong Tang <yong.tang.github@outlook.com>	2019-07-03 09:04:47 +08:00
Miek Gieben	076b8d4fba	plugin/health: add OnRestartFailed (#2812 ) Add OnReStartFailed which makes the health plugin stay up if the Corefile is corrupt and we revert to the previous version. Also needs a fix for the channel handling See #2659 Testing it will log the following when restarting with a corrupted Corefile ~~~ 2019-05-04T18:01:59.431Z [INFO] linux/amd64, go1.12.4, CoreDNS-1.5.0 linux/amd64, go1.12.4, [INFO] SIGUSR1: Reloading [INFO] Reloading [ERROR] Restart failed: Corefile:5 - Error during parsing: Unknown directive 'bdhfhdhj' [ERROR] SIGUSR1: starting with listener file descriptors: Corefile:5 - Error during parsing: Unknown directive 'bdhfhdhj' ~~~ After which the curl still works. This also needed a change to reset the channel used for the metrics go-routine which gets closed on shutdown, otherwise you'll see: ~~~ ^C[INFO] SIGINT: Shutting down panic: close of closed channel goroutine 90 [running]: github.com/coredns/coredns/plugin/health.(*health).OnFinalShutdown(0xc000089bc0, 0xc000063d88, 0x4afe6d) ~~~ Signed-off-by: Miek Gieben <miek@miek.nl>	2019-05-04 16:06:25 -04:00
Miek Gieben	890cdb5cab	plugin/health: cleanups (#2811 ) Small, trivial cleanup: got triggered because I saw a comment on how health plugins polls other plugins which isn't true. * Remove useless newHealth function * healthParse -> parse * Remove useless constants Net deletion of code. Signed-off-by: Miek Gieben <miek@miek.nl>	2019-05-04 16:06:04 -04:00
Miek Gieben	98c7a6effb	plugin/health: clarify use a bit (#2791 ) Make clearer how health works and that is it process wide. Signed-off-by: Miek Gieben <miek@miek.nl>	2019-04-18 09:21:02 -07:00
Miek Gieben	c778b3a67c	plugin/health: remove ability to poll other plugins (#2547 ) * plugin/health: remove ability to poll other plugins This mechanism defeats the purpose any plugin (mostly) caching can still be alive, we can probably forward queries still. Don't poll plugins, just tell the world we're up and running. It was only actually used in kubernetes; and there specifically would mean any network hiccup would NACK the entire server health. Fixes: #2534 Signed-off-by: Miek Gieben <miek@miek.nl> * update docs based on feedback Signed-off-by: Miek Gieben <miek@miek.nl>	2019-03-07 22:13:47 +00:00
Jiacheng Xu	ae7fbf31d6	Fix a typo in health plugin readme (#2274 )	2018-11-06 08:54:53 -08:00
Miek Gieben	aea2e9f62e	plugin/health: close codeblock Codeblock wasn't properly closed in the README. Signed-off-by: Miek Gieben <miek@miek.nl>	2018-09-19 10:36:41 -04:00
Francois Tur	f9bdd382dd	Ensure Re-register of metrics variables after a reload (#2080 ) * - ensure plugins that use prometheus.MustRegister, re-register after reload - removing once.Do on the startup function was simplest way to do it. * - fix underscored names (advice of bot) * - tune existing UT for reload, and add a test verifying failing reload does not prevent correct registering for metrics * - ensure different ports for tests that can run in same time ..	2018-09-19 02:11:24 -07:00
Miek Gieben	f3134da45e	Clean up tests logging (#1979 ) * Clean up tests logging This cleans up the travis logs so you can see the failures better. Older tests in tests/ would call log.SetOutput(ioutil.Discard) in a haphazard way. This add log.Discard and put an `init` function in each package's dir (no way to do this globally). The cleanup in tests/ is clear. All plugins also got this init function to have some uniformity and kill any (future) logging there in the tests as well. There is a one-off in pkg/healthcheck because that does log. Signed-off-by: Miek Gieben <miek@miek.nl> * bring back original log_test.go Signed-off-by: Miek Gieben <miek@miek.nl> * suppress logging here as well Signed-off-by: Miek Gieben <miek@miek.nl>	2018-07-19 16:23:06 +01:00
Miek Gieben	4083852b70	Remove trailing whitespace (#1955 ) Prevent future; "remove trailing whitespace" PR, but adding a simple presubmit that checks for this. This presubmit flagged quite some offenders, remove all trailing whitespace from. Apart from that there aren't any other changes. Signed-off-by: Miek Gieben <miek@miek.nl>	2018-07-09 08:08:02 -04:00
Francois Tur	30309861c5	- review BUG related doc for Health and Premotheus after change of behavior to be compatible with reload feature. (#1790 )	2018-05-09 15:09:06 +01:00
Miek Gieben	4c7ae4ea95	plugin/health: update README (#1739 ) * plugin/health: update README Make more clear in the readme that health is limited to 1 server. Fixes #1722 * rephrase and remove ~~~ corefile because it will fail	2018-04-26 08:44:33 +01:00
Miek Gieben	12b2ff9740	Use logging (#1718 ) * update docs * plugins: use plugin specific logging Hooking up pkg/log also changed NewWithPlugin to just take a string instead of a plugin.Handler as that is more flexible and for instance the Root "plugin" doesn't implement it fully. Same logging from the reload plugin: .:1043 2018/04/22 08:56:37 [INFO] CoreDNS-1.1.1 2018/04/22 08:56:37 [INFO] linux/amd64, go1.10.1, CoreDNS-1.1.1 linux/amd64, go1.10.1, 2018/04/22 08:56:37 [INFO] plugin/reload: Running configuration MD5 = ec4c9c55cd19759ea1c46b8c45742b06 2018/04/22 08:56:54 [INFO] Reloading 2018/04/22 08:56:54 [INFO] plugin/reload: Running configuration MD5 = 9e2bfdd85bdc9cceb740ba9c80f34c1a 2018/04/22 08:56:54 [INFO] Reloading complete * update docs * better doc	2018-04-22 21:40:33 +01:00
Miek Gieben	acbcad7b4e	reload: use OnRestart (#1709 ) * reload: use OnRestart Close the listener on OnRestart for health and metrics so the default setup function can setup the listener when the plugin is "starting up". Lightly test with some SIGUSR1-ing. Also checked the reload plugin with this, seems fine: .com.:1043 .:1043 2018/04/20 15:01:25 [INFO] CoreDNS-1.1.1 2018/04/20 15:01:25 [INFO] linux/amd64, go1.10, CoreDNS-1.1.1 linux/amd64, go1.10, 2018/04/20 15:01:25 [INFO] Running configuration MD5 = aa8b3f03946fb60546ca1f725d482714 2018/04/20 15:02:01 [INFO] Reloading 2018/04/20 15:02:01 [INFO] Running configuration MD5 = b34a96d99e01db4015a892212560155f 2018/04/20 15:02:01 [INFO] Reloading complete ^C2018/04/20 15:02:06 [INFO] SIGINT: Shutting down With this corefile: .com { proxy . 127.0.0.1:53 prometheus :9054 whoami reload } . { proxy . 127.0.0.1:53 prometheus :9054 whoami reload } The prometheus port was 9053, changed that to 54 so reload would pick it up. From a cursory look it seems this also fixes: Fixes #1604 #1618 #1686 #1492 * At least make it test * Use onfinalshutdown * reload: add reload test This test #1604 adn right now fails. * Address review comments * Add bug section explaining things a bit * compile tests * Fix tests * fixes * slightly less crazy * try to make prometheus setup less confusing * Use ephermal port for test * Don't use the listener * These are shared between goroutines, just use the boolean in the main structure. * Fix text in the reload README, * Set addr to TODO once stopping it * Morph fturb's comment into test, to test reload and scrape health and metric endpoint	2018-04-21 17:43:02 +01:00
Miek Gieben	ad13d88346	plugin/health: clarify server label (#1707 ) Health overloaded metrics does not carry the server label. Explain why.	2018-04-20 15:03:59 +01:00
Miek Gieben	26d1432ae6	Update all plugins to use plugin/pkg/log (#1694 ) * Update all plugins to use plugin/pkg/log I wish this could have been done with sed. Alas manually changed all callers to use the new plugin/pkg/log package. * Error -> Info * Add docs to debug plugin as well	2018-04-19 07:41:56 +01:00
Miek Gieben	2338120f5b	plugin/metrics: add MustRegister function (#1648 ) This registers the Collectors iff the metrics plugin has been loaded. Safes a bunch of code in each and every plugin's setup code.	2018-04-01 13:58:13 +01:00
Miek Gieben	804f745951	plugin/health: make reload work (#1585 ) * plugin/health: make reload work Remove the once.Do from the startup, so we can re-bind the HTTP listener. Also clarify the usage of health in multiple server blocks (this is not the best approach - but there isn't a generic solution at this point). Manual tested as we lack testing infra, i.e kill -SIGUSR1 and some CURLing of the health endpoint. * Readme test fix * update * dont need this	2018-03-02 21:40:14 -08:00
Miek Gieben	a131c22d24	plugin/health: doc updates (#1582 ) Fixes #1564	2018-03-01 18:32:15 -08:00
Miek Gieben	fd7abd9849	Add OWNERS file (#1486 ) This should have everyone, but the process was quite manual. The rename from middleware -> plugin also meant I had to do some extra digging on who actually submitted the PR. I also double checked the current list of people with commit access. Every plugin now has an OWNERS, except reverse. I'll file a bug for that.	2018-02-08 10:55:51 +00:00
Miek Gieben	c39e5cd014	plugin/health: add lameduck mode (#1379 ) * plugin/health: add lameduck mode Add a way to configure lameduck more, i.e. set health to false, stop polling plugins. Then wait for a duration before shutting down. As the health middleware is configured early on in the plugin list, it will hold up all other shutdown, meaning we still answer queries. * Add New * More tests * golint * remove confusing text	2018-01-18 10:40:09 +00:00
Miek Gieben	48059a6c3e	Overloaded (#1364 ) * plugin/health: add 'overloaded metrics' Query our on health endpoint and record (and export as a metric) the time it takes. The Get has a 5s timeout, that, when reached, will set the metric duration to 5s. The actually call "I'm I overloaded" is left to an external entity. * README * golint and govet * and the tests	2018-01-10 11:41:22 +00:00
Miek Gieben	58221f55db	Manual pages (#1346 ) * Add manual pages Generate manual pages from the README and extend README with Name and Description sections. The generation requires 'ronn' which may not be available. Just check in all generated manual pages.	2018-01-04 12:53:07 +00:00
James Hartig	a469a17cdf	Instead of hardcoding plugin lists in autopath/health, use interfaces. (#1306 ) Switched health and autopath plugin to allow any plugins to be used instead of a hardcoded list. I did not switch federation over since it wasn't obvious that anything other than kubernetes could be used with it. Fixes #1291	2017-12-12 20:40:30 +00:00
Miek Gieben	52b49f4838	plugin/health: implement dyn health checks (#1214 ) Implement health.Healther in erratic and kubernetes plugin. The kubernetes' healtcheck is only performed on startup - i.e. turn healthy after the initial loading. Erratic follow the drop count: every query%drop turns the healthcheck unhealthy. Fixes: #985	2017-11-13 09:52:40 +00:00
Miek Gieben	c1f67493de	docs: less CoreDNS in docs (#1154 ) Various other changes.	2017-10-20 09:47:43 +01:00
Miek Gieben	427aed6f5b	doc update (#1140 ) * doc update Go through all README and fix mistakes, extend example and let more corefile snippets be test for validity. * Cant use spefic addr in test	2017-10-10 09:39:35 +02:00
Damian Myerscough	aecf916377	Fixing a small typo (#1097 )	2017-09-21 07:22:13 +01:00
Miek Gieben	be47709270	More Middleware -> Plugin conversions (#1088 ) Forgot about these.	2017-09-16 14:13:28 +01:00
Miek Gieben	2388e36c2c	plugin: README.md updates (#1084 ) updates so the look better on coredns.io	2017-09-15 22:27:55 +01:00

1 2

51 commits