local echo (5/n): Create outbox messages on send #1472

PIG208 · 2025-04-10T23:19:07Z

This is stacked atop #1463.

No UI/user-facing change in this PR.

chrisbobbe

Thanks! Comments below.

chrisbobbe · 2025-04-16T04:03:46Z

lib/model/store.dart

-  /// Always equal to `connection.zulipFeatureLevel`
-  /// and `account.zulipFeatureLevel`.
-  int get zulipFeatureLevel => connection.zulipFeatureLevel!;
-
  String get zulipVersion => account.zulipVersion;


How about also moving zulipVersion along with zulipFeatureLevel, so they stay together?

chrisbobbe · 2025-04-16T05:48:26Z

lib/model/message_list.dart

@@ -626,6 +627,17 @@ class MessageListView with ChangeNotifier, _MessageSequence {
    }
  }

+  void handleOutboxMessage(OutboxMessage outboxMessage) {


How about a name like _addOutboxMessage? There are multiple tasks that could accurately be described as a message list "handling" an outbox message.

chrisbobbe · 2025-04-16T05:49:05Z

lib/model/message_list.dart

+  /// Remove the [outboxMessage] from the view.
+  ///
+  /// This is a no-op if the message is not found.
+  void removeOutboxMessageIfExists(OutboxMessage outboxMessage) {


How about just removeOutboxMessage? I think we can leave the "if exists" part as implied.

chrisbobbe · 2025-04-16T05:57:14Z

lib/model/message.dart

+/// ```
+///
+/// During its lifecycle, it is guaranteed that the outbox message is deleted
+/// as soon an message event with a matching [MessageEvent.localMessageId]


nit: "a message event"

chrisbobbe · 2025-04-16T06:18:14Z

lib/model/message.dart

+/// ```
+///                              ┌─────────────────────────────────────┐
+///                              │                  Event received,    │
+///              Send            │                  or we abandoned    │
+///           immediately.       │      200.        the queue.         ▼
+/// (create) ──────────────► sending ────────► sent ──────────────► (delete)
+///                              │4xx,          │                      ▲
+///                              │other error,  │Reached       User    │
+///                              │or reached    │time limit.   cancels.│
+///                              │time limit.   ▼                      │
+///                              └───────────► failed ─────────────────┘
+/// ```


Can we leave out the "reached time limit" parts in this first version? The "failed" state doesn't feel accurate for when that happens (especially when coming there from "sent"), and it's not part of the spec for #133, the more complicated outbox design, so we'd have to remove it when implementing that. It's also not necessary for the main idea of #1441:

The point is to better handle the case where you type out a message and hit send, and it fails because you don't have working network at that moment.

It looks like "reached time limit" comes from a parenthetical in the #1441 spec, with "perhaps":

Then if a message fails to send, we show on the local-echo placeholder an option that lets you recover it. (Similarly perhaps if it's been a while, like 10s, since trying to send and the request hasn't completed one way or another.)

If we want to keep "reached time limit" in this PR, how about adding a new node for it in the diagram, separate from "failed"? Then I think it'll be easier to reason clearly about some things later: what should the UI say for this state (not "failed" because we haven't been told that it failed); what should happen if the send-message event arrives when in this state.

It looks like the "User cancels" label comes from #133. In that context, it means the user decided not to press a "Retry" button to retry the message-send request, and they want us to just forget about the message-send attempts and their failures.

That label doesn't feel accurate in this context, where a retry button isn't part of the picture. I think the word from the #1441 spec is "recover"; how about we say "User recovers the draft" or similar:

Then if a message fails to send, we show on the local-echo placeholder an option that lets you recover it. […]

You might want to retry sending, or just copy the text to save elsewhere. To cover both options, it can take the text and just put it back in the compose box. […] The placeholder in the message list then disappears.

Is the "sent" state needed? What would happen if we just ignored a 200 response and removed "sent" from the diagram, and didn't consider the message to be sent until we got its event?

I think the "Send immediately" label is implied and can be removed.

Combining all those points, what do you think of this as an updated diagram:

/// We abandoned the queue. /// ┌──────────────────────────────────────┐ /// │ │ /// │ Event received. ▼ /// (create) ─► sending ──────────────────────────────► (delete) /// │ ▲ /// │ 4xx or other User restores │ /// │ error. the draft. │ /// └──────────────► failed ───────────────┘

Regarding time limit, started a discussion here: #mobile > handle failed send @ 💬

For the diagram update, started a discussion here: #mobile > #F1441 Handle retry state machine @ 💬

chrisbobbe · 2025-04-16T08:39:49Z

test/model/message_test.dart

+      check(connection.lastRequest).isA<http.Request>()
+        ..bodyFields['queue_id'].equals(store.queueId)
+        ..bodyFields['local_id'].equals('${outboxMessage.localMessageId}');


Is this connection.lastRequest needed, and if so, should the other similar tests have a check like it too?

We could move these checks to the helper, but I moved them from there to this test in a previous revision since just checking it once seems fine to me; this de-duplicates some helper code and most tests focus on other things.

chrisbobbe · 2025-04-16T08:42:31Z

test/model/message_test.dart

+        ..hidden.isTrue();
+    }));
+
+    test('while message is being sent, message event arrives, then the send fails', () => awaitFakeAsync((async) async {


Perhaps worth a comment (if there isn't one in the implementation) that this can actually happen: the message-send can succeed, but then the message-send request has a network issue that doesn't affect the event-poll request.

chrisbobbe · 2025-04-16T08:51:38Z

test/model/message_test.dart

+
+      // Handle the event after the message is sent but before the debounce
+      // timeout.  The outbox message should remain hidden since the send
+      // request was sucessful.


nit: "successful"

chrisbobbe · 2025-04-16T09:12:09Z

test/model/message_test.dart

+      check(store.outboxMessages).isEmpty();
+      check(outboxMessage)
+        ..state.equals(OutboxMessageLifecycle.sent)
+        ..hidden.isTrue();


A lot of these tests have checks on outboxMessage after checking that store.outboxMessages is empty. Do they all need checks like that? Anyway, this test and later ones do it without a comment explaining why:

// […] The outbox message should no // longer get updated because it is not in the store any more.

The second sentence of the comment before store.handleEvent is meant to address this check:

// Handle the event after the message is sent but before the debounce // timeout. The outbox message should remain hidden since the send // request was successful.

Perhaps it will be clearer to move this right before the check.

chrisbobbe · 2025-04-16T09:17:13Z

test/model/message_test.dart

+        ..hidden.isFalse();
+    });
+
+    test('send request pending until after kSendMessageTimeLimit, completes successfully, then message event arrives', () => awaitFakeAsync((async) async {


(I skipped reading the tests with kSendMessageTimeLimit in their names, pending an earlier comment that might lead to removing them)

PIG208 · 2025-04-16T21:14:19Z

Updated the PR! Thanks for the review. There are some pending questions mainly with regards to the state diagram and the send time limit. I introduced the new waitPeriodExpired state in this revision, with some changes to the states.

PIG208 · 2025-04-17T19:29:39Z

Will be working on a new revision to reorganize some of the implementation code with further state machine changes.

PIG208 · 2025-04-17T21:29:56Z

The PR has been updated to implement the state diagram discussed in chat: #mobile > #F1441 Handle retry state machine @ 💬

PIG208 · 2025-04-22T19:53:56Z

(pushed another update to make timestamps used in sendMessage testable)

The point of this helper is to replicate what a topic sent from the client will become, after being processed by the server. This important when trying to create a local copy of a stream message, whose topic can get translated when it's delivered by the server.

This will be the same as `DateTime.timestamp()` in live code (therefore the NFC). For testing, utcNow uses a clock instance that can be controlled by FakeAsync. We could have made call sites of `DateTime.now()` use it too, but those for now don't need it for testing.

While we do create outbox messages, there are in no way user-visible changes since the outbox messages don't end up in message list views. We create skeletons for helpers needed from message list view, but don't implement them yet, to make the diff smaller. For testing, similar to TypingNotifier.debugEnable, we add MessageStoreImpl.debugOutboxEnable for tests that do not intend to cover outbox messages. Some of the delays to fake responses added in tests are not necessary because the future of sendMessage is not completed immediately, but we still add them to keep the tests realistic.

chrisbobbe

Thanks! Here's a review of the first four commits:

0035700 api [nfc]: Add TopicName.interpretAsServer
d7cb9ec store [nfc]: Move zulip{FeatureLevel,Version} to PerAccountStoreBase
d09a52d test [nfc]: Generate timestamps
44af0d3 binding [nfc]: Add utcNow

and a partial review of the fifth, the main commit:

d61e252 message: Create an outbox message on send; manage its states

chrisbobbe · 2025-04-25T22:34:27Z

lib/api/model/model.dart

+  /// Convert this topic to match how it would appear on a message object from
+  /// the server, assuming the topic is originally for a send-message request.
+  ///
+  /// For a client that does not support empty topics,
+  /// a modern server (FL>=334) would convert "(no topic)" and empty topics to
+  /// `store.realmEmptyTopicDisplayName`.
+  ///
+  /// See also: https://zulip.com/api/send-message#parameter-topic
+  TopicName interpretAsServer({
+    required int zulipFeatureLevel,
+    required String? realmEmptyTopicDisplayName,
+  }) {
+    if (zulipFeatureLevel < 334) {
+      assert(_value.isNotEmpty);
+      return this;
+    }
+    if (_value == kNoTopicTopic || _value.isEmpty) {
+      // TODO(#1250): this assumes that the 'support_empty_topics'
+      //   client_capability is false; update this when we set it to true
+      return TopicName(realmEmptyTopicDisplayName!);
+    }
+    return TopicName(_value);
+  }


Let's add a // TODO(server-10) for…removing this method, I guess? Or at least simplifying.

Is whitespace-trimming also a step in any servers' interpretation of topics? Not that this method has to mirror that logic necessarily. But we'd probably want to explicitly expect (and assert) that that step has already been done when this method is called, otherwise the summary line wouldn't be quite accurate:

/// Convert this topic to match how it would appear on a message object from /// the server, assuming the topic is originally for a send-message request.

chrisbobbe · 2025-04-25T22:47:33Z

lib/model/message.dart

+const kLocalEchoDebounceDuration = Duration(milliseconds: 300);  // TODO(#1441) find the right values for this
+const kSendMessageRetryWaitPeriod = Duration(seconds: 10);  // TODO(#1441) find the right values for this


nit: "find the right value for this"

Also, kSendMessageRetryWaitPeriod doesn't sound like the right name to me. There's no automatic retry, and, as I mentioned at #1472 (comment), no "Retry" button. This wait-period logic will need to be cleanly removed as part of #133 (see my comment just before that one) which has both kinds of retry, and that should be easier if this thing doesn't also have the "retry" label :)

Maybe kSendMessageOfferRestoreWaitPeriod?

chrisbobbe · 2025-04-25T23:59:37Z

lib/model/message.dart

+            // Because either of the values can get updated, the actual topic
+            // can change, for example, between "(no topic)" and "general chat",
+            // or between different names of "general chat".  This should be
+            // uncommon during the lifespan of an outbox message.
+            //
+            // There's also an unavoidable race that has the same effect:
+            // an admin could change the name of "general chat"
+            // (i.e. the value of realmEmptyTopicDisplayName) concurrently with
+            // the user making the send request, so that the setting in effect
+            // by the time the request arrives is different from the setting the
+            // client last heard about.  The realm update events do not have
+            // information about this race for us to update the prediction
+            // correctly.
+            zulipFeatureLevel: zulipFeatureLevel,
+            realmEmptyTopicDisplayName: realmEmptyTopicDisplayName),


Here's a draft where I try to explain the bug a bit more (especially with the first line):

// Doing this interpretation just once on creating the outbox message // allows an uncommon bug, because either of these values can change. // During the outbox message's life, a predicted "(no topic)" topic // could become stale/wrong when zulipFeatureLevel changes, // or a predicted "general chat" topic could become stale/wrong // when realmEmptyTopicDisplayName changes. // // Shrug. The same effect is caused by an unavoidable race: // an admin could change the name of "general chat" // (i.e. the value of realmEmptyTopicDisplayName) // concurrently with the user making the send request, // so that the setting in effect by the time the request arrives // is different from the setting the client last heard about. zulipFeatureLevel: zulipFeatureLevel, realmEmptyTopicDisplayName: realmEmptyTopicDisplayName),

chrisbobbe · 2025-04-26T00:08:41Z

lib/model/message.dart

+    _outboxMessageDebounceTimers[localMessageId] = Timer(kLocalEchoDebounceDuration, () {
+      assert(outboxMessages.containsKey(localMessageId));
+      _outboxMessageDebounceTimers.remove(localMessageId);
+      _updateOutboxMessage(localMessageId, newState: OutboxMessageState.waiting);
+    });
+
+    _outboxMessageWaitPeriodTimers[localMessageId] = Timer(kSendMessageRetryWaitPeriod, () {
+      assert(outboxMessages.containsKey(localMessageId));
+      _outboxMessageWaitPeriodTimers.remove(localMessageId);
+      _updateOutboxMessage(localMessageId, newState: OutboxMessageState.waitPeriodExpired);
+    });


The asserts don't look right to me; wouldn't they throw if an outbox message was removed before the timer finished?, which can happen in this part of the state diagram:

/// Event received. /// Or we abandoned the queue. /// (any state) ────────────────────────────► (delete)

chrisbobbe · 2025-04-26T00:11:21Z

lib/model/message.dart

+      // `localMessageId` is not necessarily in the store. This is because
+      // message event can still arrive before the send request fails to
+      // networking issues.


nit: maybe "fails with networking issues" or "fails because of networking issues"

chrisbobbe · 2025-04-26T00:19:03Z

lib/model/message.dart

+    required OutboxMessageState newState,
+  }) {
+    final outboxMessage = outboxMessages[localMessageId];
+    if (outboxMessage == null || outboxMessage.state == newState) {


Is there a legitimate need for callers to pass a new state that's the same as the old state? Reading this, I'm wondering if it might be accidentally covering up some confusion in how the state machine is implemented.

PIG208 force-pushed the pr-echo-5 branch 15 times, most recently from eeb6ef2 to ac35860 Compare April 16, 2025 01:35

PIG208 requested a review from chrisbobbe April 16, 2025 01:35

PIG208 assigned chrisbobbe Apr 16, 2025

PIG208 added the maintainer review PR ready for review by Zulip maintainers label Apr 16, 2025

chrisbobbe reviewed Apr 16, 2025

View reviewed changes

PIG208 force-pushed the pr-echo-5 branch from ac35860 to 05ea106 Compare April 16, 2025 21:06

PIG208 removed the maintainer review PR ready for review by Zulip maintainers label Apr 17, 2025

PIG208 force-pushed the pr-echo-5 branch from 05ea106 to eed5bf0 Compare April 17, 2025 21:29

PIG208 force-pushed the pr-echo-5 branch 2 times, most recently from b96c5e9 to 13edb83 Compare April 17, 2025 22:56

PIG208 added the maintainer review PR ready for review by Zulip maintainers label Apr 18, 2025

PIG208 requested a review from chrisbobbe April 18, 2025 00:44

PIG208 force-pushed the pr-echo-5 branch from 13edb83 to ca25c0f Compare April 22, 2025 02:58

PIG208 force-pushed the pr-echo-5 branch from ca25c0f to c10d060 Compare April 22, 2025 19:53

PIG208 removed the maintainer review PR ready for review by Zulip maintainers label Apr 22, 2025

PIG208 force-pushed the pr-echo-5 branch from c10d060 to 1343d1a Compare April 22, 2025 20:58

PIG208 added the maintainer review PR ready for review by Zulip maintainers label Apr 22, 2025

PIG208 force-pushed the pr-echo-5 branch 3 times, most recently from 975bc28 to e6c9192 Compare April 23, 2025 22:39

PIG208 added 5 commits April 25, 2025 19:40

store [nfc]: Move zulip{FeatureLevel,Version} to PerAccountStoreBase

d7cb9ec

test [nfc]: Generate timestamps

d09a52d

PIG208 force-pushed the pr-echo-5 branch from e6c9192 to d61e252 Compare April 25, 2025 23:43

chrisbobbe reviewed Apr 26, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

local echo (5/n): Create outbox messages on send #1472

local echo (5/n): Create outbox messages on send #1472

PIG208 commented Apr 10, 2025 •

edited

Loading

chrisbobbe left a comment

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

PIG208 Apr 16, 2025

PIG208 Apr 16, 2025

chrisbobbe Apr 16, 2025

PIG208 Apr 16, 2025

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

chrisbobbe Apr 16, 2025

PIG208 Apr 16, 2025

chrisbobbe Apr 16, 2025

PIG208 commented Apr 16, 2025 •

edited

Loading

PIG208 commented Apr 17, 2025

PIG208 commented Apr 17, 2025

PIG208 commented Apr 22, 2025 •

edited

Loading

chrisbobbe left a comment

chrisbobbe Apr 25, 2025

chrisbobbe Apr 25, 2025

chrisbobbe Apr 25, 2025

chrisbobbe Apr 25, 2025

chrisbobbe Apr 25, 2025

chrisbobbe Apr 26, 2025

chrisbobbe Apr 26, 2025

chrisbobbe Apr 26, 2025

		const kLocalEchoDebounceDuration = Duration(milliseconds: 300); // TODO(#1441) find the right values for this
		const kSendMessageRetryWaitPeriod = Duration(seconds: 10); // TODO(#1441) find the right values for this

local echo (5/n): Create outbox messages on send #1472

Are you sure you want to change the base?

local echo (5/n): Create outbox messages on send #1472

Conversation

PIG208 commented Apr 10, 2025 • edited Loading

chrisbobbe left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PIG208 commented Apr 16, 2025 • edited Loading

PIG208 commented Apr 17, 2025

PIG208 commented Apr 17, 2025

PIG208 commented Apr 22, 2025 • edited Loading

chrisbobbe left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

PIG208 commented Apr 10, 2025 •

edited

Loading

PIG208 commented Apr 16, 2025 •

edited

Loading

PIG208 commented Apr 22, 2025 •

edited

Loading