Replies: 2 comments 2 replies
-
|
Well over a year old and not even response. I suppose not. |
Beta Was this translation helpful? Give feedback.
-
|
Reasonable. lets refine "real time" by a the standards of a realistic use case: whisper-diarization is listening in on some multi-person audio call and generating a best effort "who said what" as the audio stream in or chunks roll in. Now, its understandable that the STT wont be instant. The "real time" request would be whisper-diarization take an audio stream, or some set audio chunk size in a loop and spits out data as soon as it can. It is very reasonable to divert the actually audio chunking, chunk queuing and feeding to be an external issue and not whisper-diarization responsibility. But, the chunks should be treated as an ongoing session. Does this help? |
Beta Was this translation helpful? Give feedback.
Uh oh!
There was an error while loading. Please reload this page.
-
Can you make the process (asr+diarization) real time? Many thanks!🙏
Beta Was this translation helpful? Give feedback.
All reactions