AIO fails client connections after some time #32
Labels
No labels
good first issue
Infrastructure
blocked
bug
config
discussion
documentation
duplicate
enhancement
go
help wanted
internal
invalid
kludge
observability
perfomance
question
refactoring
wontfix
No milestone
No project
No assignees
2 participants
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: TrueCloudLab/frostfs-aio#32
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
There are reports that AIO services are unavailable after it runs for some time (days, weeks). AIO contains 14k containers with small objects. Clients can't connect to the storage node and receive
no healthy client
errors while using SDK Pool.Try to observe it by running longevity AIO stand and adding debug settings (enable pprof, etc.).
AIO connection after some timeto AIO fails client connections after some timeCould the problem be somehow related to the connection handling in SDK pool, though?
Didn't reproduce. AIO image was running for 3 months preloaded with thousand of containers and objects.
All new connections with frostfs-cli and SDK Pool test app are handled just fine.
There was an issue with environment restart that was found in this longrun, it was described and fixed in #34.
Closed.