Streaming - Automatic Language Detection (Speech-to-Text) #1039

peterkrueck · 2025-01-07T14:09:43Z

peterkrueck
Jan 7, 2025

Hello,

I want to implement a Speech-To-Text feature to fill out a search bar (Like in the ChatGPT App). Currently we are using Diagram with Streaming and for English it works fine. Can we enable automatic language detection?

Reading here it says it's only available for pre-recorded audio? Isn't that to slow for an app like mine?
https://developers.deepgram.com/docs/language-detection

Looking forward to hear from you

Answered by jkroll-deepgram

Jan 7, 2025

Hi @peterkrueck, you're correct that our language detection is only supported for pre-recorded audio.

One option is to record the user's speech, and then make a pre-recorded API request with language detection.

Another option is to require the user to set their language, so you know from the start which language to use in transcribing their streaming audio.

It sounds like your audio inputs will be brief. Deepgram can transcribe audio in far less than real time. Say your user speaks for 10 seconds, and you send that as a pre-recorded request - you'll receive a transcription back in a couple seconds or less. If that's unacceptably long for your application, you'll need to manage the languag…

View full answer

2025-01-07T14:09:45Z

deepgram-community[bot]
bot Jan 7, 2025

Thanks for asking your question. Please be sure to reply with as much detail as possible so the community can assist you efficiently.
_{Consider joining our Discord community for more opportunity to engage with your fellow Deepgram users. You can earn points which can be redeemed for cool stuff by being active in our communities!}

0 replies

2025-01-07T14:09:55Z

deepgram-community[bot]
bot Jan 7, 2025

Hey there! It looks like you haven't connected your GitHub account to your Deepgram account. You can do this at https://community.deepgram.com - being verified through this process will allow our team to help you in a much more streamlined fashion.

0 replies

2025-01-07T14:09:57Z

deepgram-community[bot]
bot Jan 7, 2025

It looks like we're missing some important information to help debug your issue. Would you mind providing us with the following details in a reply?

A request ID that triggered your error or issue.

0 replies

jkroll-deepgram · 2025-01-07T17:40:08Z

jkroll-deepgram
Jan 7, 2025
Collaborator

Hi @peterkrueck, you're correct that our language detection is only supported for pre-recorded audio.

One option is to record the user's speech, and then make a pre-recorded API request with language detection.

Another option is to require the user to set their language, so you know from the start which language to use in transcribing their streaming audio.

It sounds like your audio inputs will be brief. Deepgram can transcribe audio in far less than real time. Say your user speaks for 10 seconds, and you send that as a pre-recorded request - you'll receive a transcription back in a couple seconds or less. If that's unacceptably long for your application, you'll need to manage the language detection in some other way so that you can make use of streaming transcription.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deepgram

Streaming - Automatic Language Detection (Speech-to-Text) #1039

{{title}}

Replies: 4 comments

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

Deepgram

Streaming - Automatic Language Detection (Speech-to-Text) #1039

peterkrueck Jan 7, 2025

Replies: 4 comments

deepgram-community[bot] bot Jan 7, 2025

deepgram-community[bot] bot Jan 7, 2025

deepgram-community[bot] bot Jan 7, 2025

jkroll-deepgram Jan 7, 2025 Collaborator

peterkrueck
Jan 7, 2025

deepgram-community[bot]
bot Jan 7, 2025

deepgram-community[bot]
bot Jan 7, 2025

deepgram-community[bot]
bot Jan 7, 2025

jkroll-deepgram
Jan 7, 2025
Collaborator