Verify data integrity in parallel with response streaming #1499
Labels
No labels
P0
P1
P2
P3
badger
frostfs-adm
frostfs-cli
frostfs-ir
frostfs-lens
frostfs-node
good first issue
triage
Infrastructure
blocked
bug
config
discussion
documentation
duplicate
enhancement
go
help wanted
internal
invalid
kludge
observability
perfomance
question
refactoring
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference: TrueCloudLab/frostfs-node#1499
Loading…
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Is your feature request related to a problem? Please describe.
Currently frostfs-node does not perform any data integrity verification when it streams objects to client from disk. As far as I know, we had this feature previously and have disabled it for performance reasons (iirc, memory usage and latency considerations). By streaming data directly from disk to network we avoid putting large objects into memory and minimize the time to first byte.
Responsibility for bitrot protection is currently delegated to client (app developer): discussion in private chat.
Describe the solution you'd like
Let's discuss whether it's possible to add integrity verification back:
This approach would introduce some CPU overhead for hashing but would be pretty cheap in terms of memory and latency. TTFB would not be affected at all.
This suggestion is inspired in part by structured replies in NBD protocol. They were introduced to solve a similar problem there.
Describe alternatives you've considered
Keep things as they are: no bitrot protection, data integrity verification delegated to client.