Update protocol to match automerge-repo JS 1.0 #67

alexjg · 2023-12-14T12:26:17Z

This is a fairly chunky change, the primary reason for which is that the automerge repo JS protocol introduces a request workflow for requesting documents which you don't have, in contrast to the symmetric sync protocol we have been using so far. This requires the addition of two new message types: a "request" and "unavailable" type. The requesting peer sends a request and then the responding peer responds with either a "sync" or "unavailable" message. Some further complexity is introduced by the fact that each peer is expected to only return unavailable once it has queried both it's storage and all the other peers it knows about.

To implement this then there are the following major changes:

Modify request_document to return an Option. This is the primary advantage of the new workflow, that you no longer need to use a timeout in the common case when noone has the document, instead everyone will return Unavailable and we can return Ok(None) from request_document
Split RepoEvent::NewDoc into RepoEvent::NewDoc and RepoEvent:RequestDoc, so that we can distinguish between the two cases and decide whether to send request messages to our peers or not
Make Repo::new_document return a future, this allows us to move the DocumentInfo creation into the repo loop
Introduce a Request state machine which is tracked by the Repo to figure out when to send requests to other peers on behalf of incoming requests and when a request is complete.

This brings us up to interop with the current JS implementation, with the exception of ephemeral messages, which I have another patch for which I will submit once this is merged.

This will be a breaking change to the network protocol and would thus require a network wide upgrade. It is possible that we could write a compatibility layer which we could maintain for a while. @issackelly is that something you would need or is a big bang upgrade okay for you?

Context: the automerge-repo JS implementation supports a request workflow for syncing with a document we don't have. In this workflow the requesting peer sends a "request" message which is identical to the current sync message except tagged with a different message type. Responding peers can then either respond with a sync message or with an "unavailable" message, which allows the receiver to tell the difference between a document the other end doesn't have and a document the other end has but which is empty. Problem: in the repo loop we have no way of telling the difference between a request for a new document and announcing a document we have. This is because both situations are expressed using the RepoEvent::NewDoc event. Solution: split `NewDoc` into a `RequestDoc` and `NewDoc` event. This allows us to send request messages for `RequestDoc` and sync messages for `NewDoc`.

Context: when a peer requests a document from us we often don't want to just check our own storage, we also want to ask all other peers we are connected to if they have the document. This is the case for e.g. a sync server which is relaying documents between two peers. Problem: the current immplementation just checks local storage Solution: when we receive a request, first send a request out to all the peers we are connected to and once they have all responded only then send a response to the requestor.

gterzian

LGTM although I didn't have enough time yet to review in all details...

Good to see many existing TODO's taken care off...

gterzian

I think I have finally wrapped my head around the logic.

At a very high-level I'm wondering: why separate the new request concept from the existing peer connection and document info? I find the back and forth between these hard to read, and I'm hoping the logic can be consolidated by updating the existing concepts, as opposed to introduce a new and separate one(which is logically not separate because of the back and forth).

Couple of things:

The equivalent ofSelf::fail_request could be just called in the main loop on all requests that are complete. The benefit is that that logic is in one place, as opposed to being called at different points.
I'm wondering if Request could not be replaced by a combination of a awaiting_response_from: HashSet<RepoId> on DocumentInfo, and awaiting_our_response could be inferred from the existence of PeerConnection::PendingAuth(perhaps with additional data to differentiate between a peer who requested the document, like a boolean) when the docstate transitions out of bootstrap state?.
Request::initiate_local seems to me the same as having a document in the bootstrap state without any peers awaiting our response?

In all cases, the goal is to not have to read all the code to understand the logic, instead just looking at the enums and structs should be enough. Currently this is not possible because request is separate from the document info, and the "state" of the algorithm appears shared between the state of the request and the state of the document info.

It would also be beneficial to layer the logic, as opposed to have what seems like special conditions such as in the handling of NetworkEvent::Request, which skips the share auth phase if the document is not syncing.

Layering would look like:

Always first get the share decisions
If sharing is accepted, either start syncing if DocState::Sync, otherwise request from other peers.
As sync messages, or Unavailable events are handled, update the document info and or peer connection and send outgoing messages.

In other words, always follow the same steps, but with different data.

gterzian · 2023-12-18T08:15:24Z

src/repo.rs

+                    request::Request::new(document_id.clone())
+                });
+
+                if info.state.is_bootstrapping() {


Perhaps move this to the DocState::Bootstrap match arm above?

gterzian · 2023-12-18T09:06:54Z

src/repo.rs

+                                "responding to request with sync as we have the doc"
+                            );
+                            // if we have this document then just start syncing
+                            Self::enqueue_share_decisions(


Why don't we do this first in all cases?

alexjg added 3 commits December 14, 2023 12:02

Update message encoding to match stable automerge-repo JS

8f305e0

Make RepoHandle::request_document return an Option

78a7612

alexjg force-pushed the update-protocol-implement-request branch from 77d267a to 782b2c3 Compare December 14, 2023 12:29

alexjg requested a review from gterzian December 14, 2023 12:31

alexjg force-pushed the update-protocol-implement-request branch from 782b2c3 to c012dfb Compare December 14, 2023 12:41

alexjg force-pushed the update-protocol-implement-request branch from c012dfb to 6aa2fd6 Compare December 14, 2023 13:07

gterzian approved these changes Dec 15, 2023

View reviewed changes

gterzian requested changes Dec 18, 2023

View reviewed changes

pkgw mentioned this pull request Aug 6, 2025

Compatibility w.r.t. JavaScript WebSocket protocol #89

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Update protocol to match automerge-repo JS 1.0 #67

Update protocol to match automerge-repo JS 1.0 #67

Uh oh!

alexjg commented Dec 14, 2023

Uh oh!

gterzian left a comment

Uh oh!

gterzian left a comment •

edited

Loading

Uh oh!

gterzian Dec 18, 2023

Uh oh!

gterzian Dec 18, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Update protocol to match automerge-repo JS 1.0 #67

Are you sure you want to change the base?

Update protocol to match automerge-repo JS 1.0 #67

Uh oh!

Conversation

alexjg commented Dec 14, 2023

Uh oh!

gterzian left a comment

Choose a reason for hiding this comment

Uh oh!

gterzian left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gterzian Dec 18, 2023

Choose a reason for hiding this comment

Uh oh!

gterzian Dec 18, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

gterzian left a comment •

edited

Loading