DRIVERS-2868 Adjust getMore maxTimeMS Calculation for tailable awaitData Cursors #1749

prestonvasquez · 2025-01-29T22:50:36Z

DRIVERS-2868

Please complete the following before merging:

Update changelog.
Test changes in at least one language driver. (POC) GODRIVER-3444 Adjust getMore maxTimeMS Calculation for tailable awaitData Cursors mongo-go-driver#1925
Test these changes against all server versions and topologies (including standalone, replica set, sharded
clusters, and serverless).

source/client-side-operations-timeout/client-side-operations-timeout.md

vbabanin · 2025-02-08T00:36:43Z

source/client-side-operations-timeout/tests/tailable-awaitData.json

+          "name": "iterateOnce",
+          "object": "tailableCursor",
+          "arguments": {
+            "timeoutMS": 50


In timeoutMS is refreshed for getMore if maxAwaitTimeMS is set test case, timeoutMS is set when the cursor is created. I suggest we follow the same approach here and the other test since not all drivers allow setting a timeout for each next() call as an argument - it gets refreshed under the hood and i believe is supposed to behave the same in the drivers which additionally allow to override timeouMS per next() call.

For example, we could set timeoutMS to 150ms and maxAwaitTimeMS to 100ms like this:

{ "name": "createFindCursor", "object": "collection", "arguments": { "filter": {}, "cursorType": "tailableAwait", "batchSize": 1, "timeoutMS": 150 "maxAwaitTimeMS": 100 }, "saveResultAsEntity": "tailableCursor" },

Also, we configure a failpoint to block the find command for 60ms and call iterateOnce(). At this point, maxTimeMS on the command should be ≤ 90ms, which is ≤ maxAwaitTimeMS."

@vbabanin It's not idiomatic for the Go Driver to use the timeoutMS applied to constructing a cursor to each next() call. This would violate user expectations of context.Context. For example,

findCtx, findCancel := context.WithTimeout(context.Background(), 1*time.Second) defer findCancel() cur, _ := coll.Find(findCtx, bson.D{}) // User is applying a 1s timeout // User does not apply a timeoutMS to the context passed to Next, internally // applying a timeout here would violate expectations. for cursor.Next(context.Background()) { // ... }

Tests such as timeoutMS is refreshed for getMore if maxAwaitTimeMS is set are skipped in the Go Driver specifically because timeoutMS is not being applied at either the client or operation level. More than likely we will create a DRIVERS ticket to address this, see GODRIVER-3480 for more info. IIUC, we are likely to suggest putting the same timeout on both constructor and next. Would this work for Java?

operations: - name: createFindCursor object: *collection arguments: filter: {} cursorType: tailableAwait batchSize: 1 maxAwaitTimeMS: 50 timeoutMS: 100 saveResultAsEntity: &tailableCursor tailableCursor # Iterate twice to force a getMore. - name: iterateOnce object: *tailableCursor arguments: timeoutMS: 100 - name: iterateOnce object: *tailableCursor arguments: timeoutMS: 100

summary of discussion with @prestonvasquez:

Currently, the spec mandates that drivers reuse the same timeoutMS specified during cursor creation:

Change stream: Drivers MUST also apply the original timeoutMS value to each next call on the change stream."
Cursors: After the operation has executed, the original timeoutMS value MUST also be applied to each next call on the created cursor."

However, since Python and Go allow timeoutMS overrides per call, we should update the spec to clarify that overriding timeoutMS per next call is permitted, as it is idiomatic to their language. We should create a separate DRIVERS ticket to address that and update other unified tests for consistency to cover a use-case with override per getMore.

Some drivers might not currently expect timeoutMS: 100 argument in iterateOnce, that could lead to failed tests - this is the case for Java.

@vbabanin For now, I think we should update the unified spec tests to pass per the specifications (i.e. not applying a timeoutMS iteratively). Can you verify that the updated tests pass for Java?

I agree. I ran the tests and one of them applying the remaining timeoutMS if it's less than maxAwaitTimeMS fails with state should be: maxAwaitTimeMS must be less than timeoutMS during cursor creation as expected by the spec. I’ve posted a suggestion for handling this in a separate comment: #1749 (comment).

…imeout.md Co-authored-by: Viacheslav Babanin <[email protected]>

…ns into DRIVERS-2868

vbabanin · 2025-02-28T20:28:02Z

source/client-side-operations-timeout/tests/tailable-awaitData.json

+            "maxAwaitTimeMS": 100,
+            "timeoutMS": 50


By spec, maxAwaitTimeMS must be less than timeoutMS when passed to createFindCursor.

Suggested change

"maxAwaitTimeMS": 100,

"timeoutMS": 50

"maxAwaitTimeMS": 50,

"timeoutMS": 51

If we use this config, timeoutMS will likely be less than maxAwaitTimeMS by the time getMore executes— since there is just a 1ms difference.

Alternatively, we could configure a failpoint to delay the find command execution to lower exhaust timeout so that it becomes less then maxAwaitTimeMS, making the test more robust. This could prevent potential edge cases where fast-executing drivers issue a getMore at 50ms.

Example with the failpoint:

{ "description": "apply remaining timeoutMS if less than maxAwaitTimeMS", "operations": [ { "name": "failPoint", "object": "testRunner", "arguments": { "client": "failPointClient", "failPoint": { "configureFailPoint": "failCommand", "mode": { "times": 1 }, "data": { "failCommands": [ "find" ], "blockConnection": true, "blockTimeMS": 60 } } } }, { "name": "createFindCursor", "object": "collection", "arguments": { "filter": {}, "cursorType": "tailableAwait", "batchSize": 1, "maxAwaitTimeMS": 50, "timeoutMS": 100 }, "saveResultAsEntity": "tailableCursor" }, { "name": "iterateOnce", "object": "tailableCursor" }, { "name": "iterateOnce", "object": "tailableCursor" } ], "expectEvents": [ { "client": "client", "events": [ { "commandStartedEvent": { "commandName": "find", "databaseName": "test" } }, { "commandStartedEvent": { "commandName": "getMore", "databaseName": "test", "command": { "maxTimeMS": { "$$lte": 50 } } } } ] } ] }

@vbabanin It's unclear to me how blocking find will reduce the remaining timeout on a call to Next.

# Each next call starts a fresh timeout_ms. def next(cursor): timeout_ms = cursor['timeout_ms'] # ... while get_more(remaining_timeout_ms): pass # ...

Here is a gist with the updated JSON: https://gist.github.com/prestonvasquez/498ccc00c59a72b367de88272f0a777b

Does this pass in the Java driver?

@vbabanin I've updated the unified spec tests with the changes discussed offline.

vbabanin

LGTM!

ShaneHarvey · 2025-03-07T23:26:14Z

@vbabanin can you confirm this test passes in .net on both linux and windows?

prestonvasquez added 3 commits January 29, 2025 15:32

DRIVERS-2869 Adjust getMore maxTimeMS Calculation for Tailable Cursors

2299f03

DRIVESRS-2868 Remove extra tick from maxTimeMS

343abf4

DRIVERS-2868 Update changelog

eb3396b

prestonvasquez requested a review from a team as a code owner January 29, 2025 22:50

prestonvasquez requested a review from vbabanin January 29, 2025 22:50

prestonvasquez changed the title ~~DRIVERS-2869 Adjust getMore maxTimeMS Calculation for tailable awaitData Cursors~~ DRIVERS-2868 Adjust getMore maxTimeMS Calculation for tailable awaitData Cursors Jan 29, 2025

prestonvasquez added 5 commits January 30, 2025 09:43

DRIVERS-2868 Include min rtt in adjustment

5d4ea30

DRIVERS-2868 Add maxAwaitTimeMS lt remaining timeout test

b7083a7

DRIVERS-2868 Forbid serverless for tailable awaitData tests

3615285

DRIVERS-2868 Move tests to tailable awaitdata cursor

855fdf4

DRIVERS-2868 Include updates to change stream

4687176

vbabanin reviewed Feb 8, 2025

View reviewed changes

prestonvasquez and others added 2 commits February 11, 2025 09:08

DRIVERS-2868 Remove reference to DRIVERS-2884

b76d91b

Update source/client-side-operations-timeout/client-side-operations-t…

3cefdb4

…imeout.md Co-authored-by: Viacheslav Babanin <[email protected]>

prestonvasquez requested a review from vbabanin February 11, 2025 16:09

prestonvasquez added 5 commits February 26, 2025 15:07

Merge branch 'master' into DRIVERS-2868

95edf04

DRIVERS-2868 Update tests with no iterative timeout

7c167e2

Merge branch 'DRIVERS-2868' of github.com:prestonvasquez/specificatio…

46a19ef

…ns into DRIVERS-2868

DRIVERS-2868 Revert changes to wrong test

58f9a6d

DRIVERS-2868 Remove arguments

1546921

vbabanin reviewed Feb 28, 2025

View reviewed changes

DRIVERS-2868 Update unified spec tests

0940bfb

prestonvasquez requested a review from vbabanin March 7, 2025 02:31

DRIVERS-2868 Sync tests

b4a80a5

vbabanin approved these changes Mar 7, 2025

View reviewed changes

prestonvasquez requested a review from ShaneHarvey March 7, 2025 19:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

DRIVERS-2868 Adjust getMore maxTimeMS Calculation for tailable awaitData Cursors #1749

DRIVERS-2868 Adjust getMore maxTimeMS Calculation for tailable awaitData Cursors #1749

prestonvasquez commented Jan 29, 2025 •

edited

Loading

vbabanin Feb 8, 2025 •

edited

Loading

prestonvasquez Feb 10, 2025

vbabanin Feb 22, 2025 •

edited

Loading

prestonvasquez Feb 26, 2025

vbabanin Feb 28, 2025 •

edited

Loading

vbabanin Feb 28, 2025 •

edited

Loading

prestonvasquez Feb 28, 2025 •

edited

Loading

prestonvasquez Mar 7, 2025

vbabanin left a comment

ShaneHarvey commented Mar 7, 2025

DRIVERS-2868 Adjust getMore maxTimeMS Calculation for tailable awaitData Cursors #1749

Are you sure you want to change the base?

DRIVERS-2868 Adjust getMore maxTimeMS Calculation for tailable awaitData Cursors #1749

Conversation

prestonvasquez commented Jan 29, 2025 • edited Loading

vbabanin Feb 8, 2025 • edited Loading

Choose a reason for hiding this comment

prestonvasquez Feb 10, 2025

Choose a reason for hiding this comment

vbabanin Feb 22, 2025 • edited Loading

Choose a reason for hiding this comment

prestonvasquez Feb 26, 2025

Choose a reason for hiding this comment

vbabanin Feb 28, 2025 • edited Loading

Choose a reason for hiding this comment

vbabanin Feb 28, 2025 • edited Loading

Choose a reason for hiding this comment

prestonvasquez Feb 28, 2025 • edited Loading

Choose a reason for hiding this comment

prestonvasquez Mar 7, 2025

Choose a reason for hiding this comment

vbabanin left a comment

Choose a reason for hiding this comment

ShaneHarvey commented Mar 7, 2025

prestonvasquez commented Jan 29, 2025 •

edited

Loading

vbabanin Feb 8, 2025 •

edited

Loading

vbabanin Feb 22, 2025 •

edited

Loading

vbabanin Feb 28, 2025 •

edited

Loading

vbabanin Feb 28, 2025 •

edited

Loading

prestonvasquez Feb 28, 2025 •

edited

Loading