Not able to get transcript when sending audio in webm format #1073
Replies: 11 comments 6 replies
-
Thanks for asking your question. Please be sure to reply with as much detail as possible so the community can assist you efficiently. |
Beta Was this translation helpful? Give feedback.
-
Hey there! It looks like you haven't connected your GitHub account to your Deepgram account. You can do this at https://community.deepgram.com - being verified through this process will allow our team to help you in a much more streamlined fashion. |
Beta Was this translation helpful? Give feedback.
-
It looks like we're missing some important information to help debug your issue. Would you mind providing us with the following details in a reply?
|
Beta Was this translation helpful? Give feedback.
-
@jkroll-deepgram @jpvajda any sort of assistance would be highly appreciated! |
Beta Was this translation helpful? Give feedback.
-
Another thing that I would like to add is that the example stated in this article works perfectly fine with webm audio. It's just that the approach that I am using above is breaking for some reason. |
Beta Was this translation helpful? Give feedback.
-
@jkroll-deepgram @davidvonthenen @jpvajda any kind of assistance would be appreciated! |
Beta Was this translation helpful? Give feedback.
-
Hi @deva-gopalani, you mention "I am not able to get any transcription" - can you clarify, are you getting empty transcript responses from Deepgram, or no responses? Do you have a Deepgram request ID you're able to share? If so, then I can look into the audio to see what Deepgram is receiving. It's possible that either the audio you're sending to Deepgram is corrupt in some way, or that your audio parameters (e.g. encoding, sample rate) need to be adjusted for the different audio format. |
Beta Was this translation helpful? Give feedback.
-
I have a same issue. Also streaming from browser to backend and DeepGram in WebM. I can see that DeepGram is receiving the audio chunks, but never responds back with the transcript. |
Beta Was this translation helpful? Give feedback.
-
@jkroll-deepgram any update? Seems to be a genuine issue as @olivernaaris is also facing the same. |
Beta Was this translation helpful? Give feedback.
-
@olivernaaris let's keep this all on one thread please. We'll try to assist both you and @deva-gopalani here on Github Discussions. Please keep in mind this is Community Support and we our best to provide assistance when we can. Pinging the Deepgram Team directly and posting multiple threads on the same topic isn't ideal and actually violates our Community Code of Conduct So if you can follow these guidelines we'd appreciate it. Thank you! |
Beta Was this translation helpful? Give feedback.
-
Hey @jkroll-deepgram @olivernaaris , and @deva-gopalani |
Beta Was this translation helpful? Give feedback.
-
I have got a web app in which I am collecting the audio of users via mic. On collecting the audio I am extracting the base64 value of the raw audio and sending it over a websocket connection to my backend. The code responsible for doing all this is -
Once I receive this audio in the backend, I decode it and then send it to deepgram for transcription. Now the issue is that I am not able to get any transcription. What could be the possible reason for it? The code present in the backend is as follows -
The URL with which I created the websocket connection with deepgram is as follows -
wss://api.deepgram.com/v1/listen?model=nova-2&filler_words=true&diarize=true&language=en&vad_events=true&encoding=opus&sample_rate=48000&channels=1&interim_results=true&utterance_end_ms=1000
I have specified the encoding as opus as the browser is giving me the audio in webm format and the sample rate of it is 48KHz hence the sample rate has been set to the same.
What could be the possible reason for this to not work? If you have any other better recommended approach to make this flow work then also do let me know.
Beta Was this translation helpful? Give feedback.
All reactions