Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Generation doesn't get finished #99

Open
ForxBase opened this issue Feb 15, 2025 · 4 comments
Open

Generation doesn't get finished #99

ForxBase opened this issue Feb 15, 2025 · 4 comments

Comments

@ForxBase
Copy link

Very often the whole generation doesn't get finished. The audio input is clear and short. The input text is clear, too:

Image

@mytait
Copy link

mytait commented Feb 16, 2025

audio generation is limited to 30 seconds. any longer and your text gets cut.

Also the motions are unstable (it says so in Gradio). If you leave the standard setting its the stablest. Changing the emotions params can produce emotions or introduce errors. dont use

@ForxBase
Copy link
Author

audio generation is limited to 30 seconds. any longer and your text gets cut.

Also the motions are unstable (it says so in Gradio). If you leave the standard setting its the stablest. Changing the emotions params can produce emotions or introduce errors. dont use

Thanks. Is getting longer generations something that will come in a later version, and more authentic cloning?

@ForxBase
Copy link
Author

audio generation is limited to 30 seconds. any longer and your text gets cut.

Also the motions are unstable (it says so in Gradio). If you leave the standard setting its the stablest. Changing the emotions params can produce emotions or introduce errors. dont use

So what happens is that even in generations under 30 seconds, the generation just has long pauses in between words, or after commas or punctuations.

@soljaragcnc
Copy link

soljaragcnc commented Feb 18, 2025

audio generation is limited to 30 seconds. any longer and your text gets cut.
Also the motions are unstable (it says so in Gradio). If you leave the standard setting its the stablest. Changing the emotions params can produce emotions or introduce errors. dont use

So what happens is that even in generations under 30 seconds, the generation just has long pauses in between words, or after commas or punctuations.

Yeah, I'm having the exact same issue. seems like the more times I try to use it, the worse it gets. at one point it would just say the first 4 words and the rest would be blank. Even with the Emotions set to unconditional, its doing this.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants