[Feature] Realtime API from OpenAI working #545

franpb14 · 2024-11-04T20:34:28Z

This is a very basic PR that we can use to iterate. Right now I'm using it with ActionCable in a rails app like this:

class OpenAiChannel < ApplicationCable::Channel
  def subscribed
    stream_from "open_ai_channel"
    @client = OpenAI::Client.new(access_token: ENV['OPENAI_API_KEY'])
    @client.real_time.on_message do |event|
      ActionCable.server.broadcast 'open_ai_channel', { message: event.data.force_encoding('UTF-8') }
    end
    @client.real_time.connect
  end

  def send_message(data)
    @client.real_time.send_event(data['event'])
  end
end

In the example data['event'] can be something like this:

{
  type: "response.create",
  response: {
    modalities: ["text", "audio"],
    instructions: "Please assist the user.",
  }
}

Maybe we could add more functions in order to facilitate event management. Something like I said in this comment.

Dependencies

faye-websocket
eventmachine

All Submissions:

Have you followed the guidelines in our Contributing document?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?
Have you added an explanation of what your changes do and why you'd like us to include them?

Closes #524

franpb14 · 2024-11-04T20:39:26Z

lib/openai/real_time.rb

+    end
+
+    def connect(model: "gpt-4o-realtime-preview-2024-10-01")
+      uri = "#{File.join(@client.websocket_uri_base, @client.api_version, 'realtime')}?model=#{model}"


probably this uri shouldn't be here but I was not sure where to put it

drnic · 2024-11-07T03:06:07Z

Is it out of scope to add a sample sinatra app into the repo with some stimulusjs that demos the client-side setup of the websocket/connection to feed/receive messages to the backend?

franpb14 · 2024-12-09T21:06:27Z

lib/openai/real_time.rb

+      EM.run do
+        @websocket = Faye::WebSocket::Client.new(uri, nil, headers: openai_realtime_headers)
+        @websocket.on :message, @on_message
+      end


Perhaps we should replace Eventmachine since it last release was 6 yeas ago. I think we could use async. I have tried and it works fine with something like this:

Async do endpoint = Async::HTTP::Endpoint.parse(uri, alpn_protocols: Async::HTTP::Protocol::HTTP11.names) Async::WebSocket::Client.connect(endpoint, headers: @headers) do |connection| @websocket = connection while (message = connection.read) @on_message end end end

franpb14 · 2024-12-09T21:10:21Z

@drnic Sorry the delay in the answer, I think it'd be a good thing to have, for me it was the hardest part. If we finally merge this I could do that app easily.

ngelx · 2025-03-03T05:13:20Z

What is the status of this PR? looks likes just what it is need to ease RealTime implementation. I can jump in and help to move this forward in case it needs some more work force.

In addition, I'm trying to understand the failure on CircleCI, but the logs seem to be private. Would it be possible to share the details of the failing job? Thank you!

alexrudall · 2025-03-03T07:43:15Z

Thanks @ngelx - Realtime is my top priority for v8.1 - would you be able to test this PR and see how useful you find it?

franpb14 · 2025-03-03T08:05:33Z

@alexrudall @ngelx since I did this PR, openAI has introduced the possibility of doing it with webRTC https://platform.openai.com/docs/guides/realtime-webrtc and it works pretty well, by using it you don't need to set up Faye or other dependency, I'm not sure if this PR makes sense and we should only include the endpoint to get the ephemeral key or if we should have both possibilities. What do you think?

ngelx · 2025-03-04T03:05:33Z

@alexrudall @ngelx since I did this PR, openAI has introduced the possibility of doing it with webRTC https://platform.openai.com/docs/guides/realtime-webrtc and it works pretty well, by using it you don't need to set up Faye or other dependency, I'm not sure if this PR makes sense and we should only include the endpoint to get the ephemeral key or if we should have both possibilities. What do you think?

Why not both?

As you mentioned, the WebRTC implementation only requires exposing the ephemeral key endpoint, leaving the rest to the client. Since it's part of the API, it makes sense to support it.

On the other hand, the WebSocket implementation requires more backend work but simplifies the client-side integration. It's also officially beta supported by other SDKs (e.g., openai-python realtime api).

To sum up, @franpb14 raises a valid point by suggesting WebRTC. Maybe @alexrudall already has plans to support both options?

alexrudall · 2025-03-04T07:18:34Z

@alexrudall @ngelx since I did this PR, openAI has introduced the possibility of doing it with webRTC https://platform.openai.com/docs/guides/realtime-webrtc and it works pretty well, by using it you don't need to set up Faye or other dependency, I'm not sure if this PR makes sense and we should only include the endpoint to get the ephemeral key or if we should have both possibilities. What do you think?

Why not both?

As you mentioned, the WebRTC implementation only requires exposing the ephemeral key endpoint, leaving the rest to the client. Since it's part of the API, it makes sense to support it.

On the other hand, the WebSocket implementation requires more backend work but simplifies the client-side integration. It's also officially beta supported by other SDKs (e.g., openai-python realtime api).

To sum up, @franpb14 raises a valid point by suggesting WebRTC. Maybe @alexrudall already has plans to support both options?

Agree I think. Although I need to understand it better. If there's one way that makes it easier and simpler for the user I prefer to support that and only that, even if it's more work in the gem, but in this case maybe both make sense. Have you seen this thread?

ngelx · 2025-03-20T11:51:39Z

Sorry for the delay on the reply. I did try this fork and did work, but in particular for my application, was too laggy. So i went for the WebRTC implementation.

I did a separate PR #582 but following the structure of this one so they can be easily merged. The code is quite simple. The heavy part is in the web client, but i guess that is what WebRTC is all about.

alexrudall · 2025-08-10T16:37:47Z

Just released @ngelx 's WebRTC PR, #582, in v8.2.0. Still open to the websockets side also, if there's demand. This PR would need to rebased and rewritten a bit to work with the existing Realtime class now.

franpb14 added 4 commits November 4, 2024 01:07

a first step connect and send event

59d973c

more reusable

cc5bdb4

defining on_message on its own function

5412f1a

header with the other headers

39060be

franpb14 commented Nov 4, 2024

View reviewed changes

franpb14 commented Dec 9, 2024

View reviewed changes

ngelx mentioned this pull request Mar 20, 2025

RealTime session create to retrieve Ephemeral Token #582

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

[Feature] Realtime API from OpenAI working #545

[Feature] Realtime API from OpenAI working #545

Uh oh!

franpb14 commented Nov 4, 2024 •

edited

Loading

Uh oh!

franpb14 Nov 4, 2024

Uh oh!

drnic commented Nov 7, 2024 •

edited

Loading

Uh oh!

franpb14 Dec 9, 2024

Uh oh!

franpb14 commented Dec 9, 2024

Uh oh!

ngelx commented Mar 3, 2025 •

edited

Loading

Uh oh!

alexrudall commented Mar 3, 2025

Uh oh!

franpb14 commented Mar 3, 2025 •

edited

Loading

Uh oh!

ngelx commented Mar 4, 2025

Uh oh!

alexrudall commented Mar 4, 2025

Uh oh!

ngelx commented Mar 20, 2025

Uh oh!

alexrudall commented Aug 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Uh oh!

[Feature] Realtime API from OpenAI working #545

Are you sure you want to change the base?

[Feature] Realtime API from OpenAI working #545

Uh oh!

Conversation

franpb14 commented Nov 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Dependencies

All Submissions:

Uh oh!

franpb14 Nov 4, 2024

Choose a reason for hiding this comment

Uh oh!

drnic commented Nov 7, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

franpb14 Dec 9, 2024

Choose a reason for hiding this comment

Uh oh!

franpb14 commented Dec 9, 2024

Uh oh!

ngelx commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

alexrudall commented Mar 3, 2025

Uh oh!

franpb14 commented Mar 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ngelx commented Mar 4, 2025

Uh oh!

alexrudall commented Mar 4, 2025

Uh oh!

ngelx commented Mar 20, 2025

Uh oh!

alexrudall commented Aug 10, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

franpb14 commented Nov 4, 2024 •

edited

Loading

drnic commented Nov 7, 2024 •

edited

Loading

ngelx commented Mar 3, 2025 •

edited

Loading

franpb14 commented Mar 3, 2025 •

edited

Loading