Skip to content

Unclear voice cloning instructions / documentation #9

@hybridherbst

Description

@hybridherbst

Hey, I tested Moss-TTS-Nano, especially the voice cloning part.

What I don't understand from the docs is

  • if the source audio transcription can be passed along somewhere (seems not?)
  • what the expected source audio length is – I tried 2s, 3s, 6s, 10s, 30s, and only the 3s-part had somewhat decent results, the others all produced garbage.
  • how to cache a voice profile for multiple generations.

Thanks!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions