Skip to content

Keep speaker text together #106

Closed Answered by SandraRodgers
randwvb asked this question in General help
Discussion options

You must be logged in to vote

Hi @randwvb,

There are probably different ways to do this. The current format is building each line off of each utterance. If you look at the response object, you can see that there is an utterances array with objects for each utterance:

  "utterances": [
     {
       "start": 10.345,
       "end": 13.785,
       "confidence": 0.89388895,
       "channel": 0,
       "transcript": "Well, can't remembered I logged in. So I'm already in the room.",
       "words": [ ...
       ],
       "speaker": 0,
       "id": "ff42479c-decb-49fd-b0d2-9a05d519cba8"
     },

Each utterance is created based on natural pauses in the flow of the speaker's speech. They aren't determined by the speaker/diarizat…

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer selected by jpvajda
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
2 participants