Skip to content

Request: Enhance audio-text synchronization for RESPONSE_AUDIO_TRANSCRIPT_DELTA and RESPONSE_AUDIO_DELTA events #25

Open
@opchronatron

Description

@opchronatron

Objective:
Improve the ability to align text and audio deltas for smoother playback and interruption handling.
Proposed solutions (in order of preference):

  • Implement corresponding event_ids between text and audio delta events.
  • Alternatively, provide approximate audio frame numbers for sentence pauses or completions.

Benefits:
Enables graceful sentence completion before cutting off buffered audio from the previous turn.
Improves overall user experience with more natural speech flow and interruptions.

This would make life much easier :)

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions