xk6 load balancing - limit number of connection to one endpoint #170

Open
opened 2024-09-24 08:52:14 +00:00 by a-baranov · 2 comments

When we have "slow" / "bad" node in the list of endpoints (e.g. with broaken disk) this dramatically impact performance of the whole run. In theory round-robin would evenly distribute threads among endpoints but finished thereads again evenly distributed among all nodes (including slow nodes which has "hanging" connections.

In worst case in 2-3 minutes after beginning all threads are running for bad node. other nodes has 0 thereads.

Describe the solution you'd like

  • Use differennt load balansing algorythm - count number of active connections to each endpoint and distribute connections evenly.
  • Provide statistics for each endpoint to uncover slow nodes.
## Is your feature request related to a problem? Please describe. When we have "slow" / "bad" node in the list of endpoints (e.g. with broaken disk) this dramatically impact performance of the whole run. In theory round-robin would evenly distribute threads among endpoints but finished thereads again evenly distributed among all nodes (including slow nodes which has "hanging" connections. In worst case in 2-3 minutes after beginning all threads are running for bad node. other nodes has 0 thereads. ## Describe the solution you'd like - Use differennt load balansing algorythm - count number of active connections to each endpoint and distribute connections evenly. - Provide statistics for each endpoint to uncover slow nodes.
Owner

This is a part of scenario and should be implemented by a perfomance team cc @mmalygina @a.bogatyrev
Currently, we assign a network address to a worker during initialization, so it should already be fixed in default scenarios. Those slow thread skew results, however.

This is a part of scenario and should be implemented by a perfomance team cc @mmalygina @a.bogatyrev Currently, we assign a network address to a worker during [initialization](https://git.frostfs.info/TrueCloudLab/xk6-frostfs/src/commit/a47bf149d824aede7303b290f756267ca0ed49c7/scenarios/grpc.js#L30), so it should already be fixed in default scenarios. Those slow thread skew results, however.
Owner

Regarding statistics: let's research, we can attach tags and metadata to metric samples, need to investigate how will it look in report.

Regarding statistics: let's research, we can attach tags and metadata to metric samples, need to investigate how will it look in report.
Sign in to join this conversation.
No milestone
No project
No assignees
2 participants
Notifications
Due date
The due date is invalid or out of range. Please use the format "yyyy-mm-dd".

No due date set.

Dependencies

No dependencies set.

Reference: TrueCloudLab/xk6-frostfs#170
No description provided.