You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
First, thank you for providing such excellent code.
I have three questions:
What is the exact role of the greetings? Is it intended to function as a prompt for zero-shot speaker adaptation?
Looking at the provided greeting file in jsonl format, it seems to be entirely in Chinese, and the video demo is also in Chinese. Does the model not support English conversations?
When decoding the jsonl file, there are many special tokens like <|empty|>. Could you clarify how the text and audio should be preprocessed to properly utilize the greetings?
The text was updated successfully, but these errors were encountered:
Hello,
First, thank you for providing such excellent code.
I have three questions:
What is the exact role of the
greetings
? Is it intended to function as a prompt for zero-shot speaker adaptation?Looking at the provided
greeting
file injsonl
format, it seems to be entirely in Chinese, and the video demo is also in Chinese. Does the model not support English conversations?When decoding the
jsonl
file, there are many special tokens like<|empty|>
. Could you clarify how the text and audio should be preprocessed to properly utilize thegreetings
?The text was updated successfully, but these errors were encountered: