Mike/ait token streaming OpenAI sdk #3074

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Merged

mschristensen merged 9 commits into AIT-token-streaming-OpenAI-SDK from mike/AIT-token-streaming-OpenAI-SDK

Jan 7, 2026

Contributor

mschristensen commented Jan 5, 2026 •

edited

Loading

Description

This PR refactors the OpenAI SDK guide for the message-per-token streaming pattern.

Renames the guide title and nav item
Restructures content into clear step-by-step progression
Streamlines code and copy for readability

Checklist

Commits have been rebased.
Linting has been run against the changed file(s).
The PR adheres to the writing style guide and contribution guide.

coderabbitai bot commented Jan 5, 2026

Important

Review skipped

Auto reviews are disabled on this repository.

Please check the settings in the CodeRabbit UI or the .coderabbit.yaml file in this repository. To trigger a single review, invoke the @coderabbitai review command.

You can disable this status message by setting the reviews.review_status to false in the CodeRabbit configuration file.

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

mschristensen force-pushed the mike/AIT-token-streaming-OpenAI-SDK branch from a06f5d1 to 52b2e41 Compare

January 5, 2026 18:20

mschristensen added 2 commits

January 6, 2026 09:05


          fixup! Fixup based on style review comments

85109fc


          fixup! Fixup based on style review comments

2fd0330

mschristensen force-pushed the mike/AIT-token-streaming-OpenAI-SDK branch from 52b2e41 to 3411525 Compare

January 6, 2026 10:33


          fixup! Fixup based on style review comments

b7116ee

mschristensen force-pushed the mike/AIT-token-streaming-OpenAI-SDK branch from 3411525 to b7116ee Compare

January 6, 2026 10:34

mschristensen added 2 commits

January 6, 2026 17:16


          ait/guides: refactor openai message per token

aac77ad

Uses numbered steps in a tutorial-like format.
Simplifies the code and descriptions of the event streaming model,
restricting to the relevant context for this tutorial.
Improves copy and code for readability.


          ait/guides: rename openai sdk guide

cc361c8

Indicates use of the message per token pattern.

mschristensen requested a review from rainbowFi

January 6, 2026 17:57

mschristensen marked this pull request as ready for review

January 6, 2026 17:57

mschristensen added the review-app label

ably-ci temporarily deployed to ably-docs-mike-ait-toke-p1zano

January 6, 2026 18:02

Inactive


          fixup! ait/guides: rename openai sdk guide

1c06919

ably-ci temporarily deployed to ably-docs-mike-ait-toke-p1zano

January 6, 2026 18:07

Inactive


          fixup! ait/guides: rename openai sdk guide

53b0736

ably-ci temporarily deployed to ably-docs-mike-ait-toke-p1zano

January 6, 2026 18:44

Inactive

paddybyers approved these changes

View reviewed changes

Member

paddybyers left a comment

lgtm

src/pages/docs/guides/ai-transport/openai-message-per-token.mdx Outdated

+              </Code>
+              <Aside data-type="note">
+              This guide uses version 4.x of the OpenAI SDK. Some details of interacting with the OpenAI SDK may diverge from those given here if using a different major version.

Member

paddybyers Jan 7, 2026

Suggested change

      
            This guide uses version 4.x of the OpenAI SDK. Some details of interacting with the OpenAI SDK may diverge from those given here if using a different major version.
          
            This guide uses version 4.x of the OpenAI SDK. Some details of interacting with the OpenAI SDK may differ from those given here if using a different major version.

Contributor Author

mschristensen Jan 7, 2026

src/pages/docs/guides/ai-transport/openai-message-per-token.mdx Outdated

+              </Code>
+              <Aside data-type="note">
+              This is only a representative example for a simple "text in, text out" use case and may not reflect the exact sequence of events that you observe from the OpenAI API. It also does not describe response generation errors or refusals. For complete details on all event types and their properties, see [OpenAI Streaming events](https://platform.openai.com/docs/api-reference/responses-streaming/response).

Member

paddybyers Jan 7, 2026

Suggested change

      
            This is only a representative example for a simple "text in, text out" use case and may not reflect the exact sequence of events that you observe from the OpenAI API. It also does not describe response generation errors or refusals. For complete details on all event types and their properties, see [OpenAI Streaming events](https://platform.openai.com/docs/api-reference/responses-streaming/response).
          
            This is only an illustrative example for a simple "text in, text out" use case and may not reflect the exact sequence of events that you observe from the OpenAI API. It also does not describe response generation errors or refusals. For complete details on all event types and their properties, see [OpenAI Streaming events](https://platform.openai.com/docs/api-reference/responses-streaming/response).

Contributor Author

mschristensen Jan 7, 2026

src/pages/docs/guides/ai-transport/openai-message-per-token.mdx

+              - Publishes a `stop` event when the response completes
+              <Aside data-type="note">
+              Ably messages are published without `await` to maximize throughput. Ably maintains message ordering even without awaiting each publish. For more information, see [Publishing tokens](/docs/ai-transport/features/token-streaming/message-per-token#publishing).

Member

paddybyers Jan 7, 2026

How do we recommend checking for error responses from the publish?

Contributor Author

mschristensen Jan 7, 2026

There's a separate ticket to address this: https://ably.atlassian.net/browse/AIT-238

src/pages/docs/guides/ai-transport/openai-message-per-token.mdx Outdated


		### Publishing concurrent responses <a id="multiple-publishers"/>

		The implementation uses `responseId` in message extras to correlate tokens with their originating response. This enables multiple publishers to stream different responses concurrently on the same channel, with each subscriber correctly tracking all responses independently.

Member

paddybyers Jan 7, 2026

Suggested change

      
            The implementation uses `responseId` in message extras to correlate tokens with their originating response. This enables multiple publishers to stream different responses concurrently on the same channel, with each subscriber correctly tracking all responses independently.
          
            The implementation uses `responseId` in message `extras` to correlate tokens with their originating response. This enables multiple publishers to stream different responses concurrently on the same channel, with each subscriber correctly tracking all responses independently.

(Should we be quoting extras, name etc with backticks?)

Contributor Author

mschristensen Jan 7, 2026

rainbowFi reviewed

View reviewed changes

src/pages/docs/guides/ai-transport/openai-message-per-token.mdx Outdated


		This guide shows you how to stream AI responses from OpenAI's [Responses API](https://platform.openai.com/docs/api-reference/responses) over Ably using the [message-per-token pattern](/docs/ai-transport/features/token-streaming/message-per-token). Specifically, it implements the [explicit start/stop events approach](/docs/ai-transport/features/token-streaming/message-per-token#explicit-events), which publishes each response token as an individual message, along with explicit lifecycle events to signal when responses begin and end.

		Using Ably to distribute tokens from the OpenAI SDK enables you to broadcast AI responses to thousands of concurrent subscribers with reliable message delivery and ordering guarantees. This approach decouples your AI inference from client connections, enabling you to scale independently and handle reconnections gracefully while ensuring each client receives the complete response stream with all tokens delivered in order.

Contributor

rainbowFi Jan 7, 2026

Suggested change

      
            Using Ably to distribute tokens from the OpenAI SDK enables you to broadcast AI responses to thousands of concurrent subscribers with reliable message delivery and ordering guarantees. This approach decouples your AI inference from client connections, enabling you to scale independently and handle reconnections gracefully while ensuring each client receives the complete response stream with all tokens delivered in order.
          
            Using Ably to distribute tokens from the OpenAI SDK enables you to broadcast AI responses to thousands of concurrent subscribers with reliable message delivery and ordering guarantees, ensuring that each client receives the complete response stream with all tokens delivered in order. This approach decouples your AI inference from client connections, enabling you to scale each independently and handle reconnections gracefully.

Contributor Author

mschristensen Jan 7, 2026

src/pages/docs/guides/ai-transport/openai-message-per-token.mdx Outdated


		- [`response.created`](https://platform.openai.com/docs/api-reference/responses-streaming/response/created): Signals the start of a response. Contains `response.id` to correlate subsequent events.

		- [`response.output_item.added`](https://platform.openai.com/docs/api-reference/responses-streaming/response/output_item/added): Indicates a new output item. If `item.type === "message"` the item contains model response text; other types may be specified, such as `"reasoning"` for internal reasoning tokens. The `item.id` can be used to filter which tokens to stream. The `output_index` indicates the position of this item in the response's output array.

Contributor

rainbowFi Jan 7, 2026

The item.id can be used to filter which tokens to stream.
I don't quite get this line. I think you're saying "The item.id will be present on on all events relating to this item, so you can use it to filter which tokens are streamed to the clients" but there's probably a better way of saying it.

Contributor Author

mschristensen Jan 7, 2026


          fixup! ait/guides: rename openai sdk guide

878d248

ably-ci temporarily deployed to ably-docs-mike-ait-toke-p1zano

January 7, 2026 08:50

Inactive


          fixup! ait/guides: rename openai sdk guide

ba71b9a

ably-ci temporarily deployed to ably-docs-mike-ait-toke-p1zano

January 7, 2026 08:57

Inactive

mschristensen requested a review from rainbowFi

January 7, 2026 08:57

rainbowFi approved these changes

View reviewed changes

mschristensen merged commit 2c085fd into AIT-token-streaming-OpenAI-SDK

7 checks passed

mschristensen deleted the mike/AIT-token-streaming-OpenAI-SDK branch

January 7, 2026 09:32

mschristensen mentioned this pull request

AI Transport: Add a guide for token streaming using the OpenAI SDK #3024

Merged

3 tasks

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels