Skip to content

Conversation

@ShaneHarvey
Copy link
Member

One of the goals of new client backpressure overload retry behavior is to withstand temporary load spikes on the server. The operation burst workload (PERF-7190) shows that 3 retries is insufficient at reducing the error rate during brief overload periods. This can be attributed to the short effective retry time window of 3 retries and initial delay 100ms which is only 100ms + 200ms + 400ms = 700ms. With 5 retries the time window increases to 100ms + 200ms + 400ms + 800ms + 1600ms = 3100ms.

Without any retries the workload encounters ~8000 overload errors. With 3 max retries the workload encounters ~500 errors. With 5 max retries the workload encounters ~5 errors.

The design has been updated to propose increasing the retry limit to 5 and initial delay from 50ms to 100ms.

@ShaneHarvey ShaneHarvey requested a review from a team as a code owner October 23, 2025 15:39
@ShaneHarvey ShaneHarvey requested review from blink1073 and caseyclements and removed request for a team October 23, 2025 15:39
Copy link
Member

@blink1073 blink1073 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM!

@ShaneHarvey ShaneHarvey merged commit 27785ae into mongodb:backpressure Oct 23, 2025
74 of 79 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants