Can I Fine-Tune the Diarization Model to Recognize a Specific Individual's Voice? #234
Unanswered
shivamtawari
asked this question in
Q&A
Replies: 1 comment 1 reply
-
|
It's doable but not through finetuning, you will use the intermediate embeddings generated from MSDD model and compare them to reference embeddings that you generated to identify which speaker is XYZ |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi @MahmoudAshraf97
I'm curious to know if it's possible to customize the diarization output. Specifically, can we assign a custom name, such as 'Mr. XYZ', to dialogues spoken by a particular person, while the rest are labeled as 'Person 0', 'Person 1', etc.?
Thanks!
Beta Was this translation helpful? Give feedback.
All reactions