-
Notifications
You must be signed in to change notification settings - Fork 27
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
First 3 words getting cut off #21
Comments
Some style actually might have issues. However some style is not. I think this might caused by kokoro model training dataset of certain speaker. Can u take a deep look at different speakers? |
First complete word from the sentence af: 6 history |
the first several words is muted or relatively small volume? Does all sentences like this |
I tried a hack in pub fn infer(
&self,
tokens: Vec<Vec<i64>>,
styles: Vec<Vec<f32>>,
) -> Result<ArrayBase<OwnedRepr<f32>, IxDyn>, Box<dyn std::error::Error>> {
let mut tokens = tokens;
let mut first_entry = tokens[0].clone();
for _ in 0..3 {
first_entry.insert(0, 30); // token 30 seems to be kinda silent..
}
tokens[0] = first_entry; @lucasjinreal idk if this is a viable solution? wfm, but might introduce a short pause when inferring multiple sentences.. |
@mrorigo This is a solution, however, I would suggestion append outside ort, since it is for universal inference only. The tokens can be appended outside as well. Would u consider make a PR for the feature? |
Try this sentence with any voice.
We have entered a moment in history where artificial intelligence stands poised to redefine our very conception of progress.
The text was updated successfully, but these errors were encountered: