morph: Fail if there is no events #1015
No reviewers
Labels
No labels
P0
P1
P2
P3
badger
frostfs-adm
frostfs-cli
frostfs-ir
frostfs-lens
frostfs-node
good first issue
triage
Infrastructure
blocked
bug
config
discussion
documentation
duplicate
enhancement
go
help wanted
internal
invalid
kludge
observability
perfomance
question
refactoring
wontfix
No milestone
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: TrueCloudLab/frostfs-node#1015
Loading…
Reference in a new issue
No description provided.
Delete branch "dstepanov-yadro/frostfs-node:fix/morph_reconnect"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
There was the following situation:
Only logs from node 2 and metrics left.
I haven't found any code issues. Also I can't reproduce this bug on virtual or hardware environment. So I decided to do such fix: if node (or IR) doesn't get eny neo-go events for 20 min, it looks like connection fail, so node fails.
b114542e9f
tobb78b96830
We can't just stop, no events is an expected situation in case of split brain.
WebSocket support some form of healthchecks https://www.rfc-editor.org/rfc/rfc6455#section-5.5.2, can we try using sth like this here?
wrong fix
Pull request closed