Skip to content

haedalprogramming/customTTS

Folders and files

NameName
Last commit message
Last commit date

Latest commit

f3107fe · Mar 10, 2025

History

6 Commits
Mar 10, 2025
Mar 10, 2025

Repository files navigation

customTTS

오픈소스로 만드는 커스텀 TTS

Installation on Windows

  1. Open Source TTS 다운로드
  2. Double click install_env.bat
    • install miniconda, create virtual env, install packages
  3. If you want to enable compilation acceleration:
    1. Download and install the LLVM compiler-17.0.6
      • check the Add Path to Current User
    2. Download and install the Microsoft Visual C++ Redistributable MSVC++ 14.40.33810.0 Download
    3. Download and install Visual Studio Community Edition to get MSVC++ build tools
      • Visual Studio Download
      • After installing Visual Studio Installer, download Visual Studio Community 2022.
      • As shown below, click the Modify button and find the Desktop development with C++ option to select and download.
    4. Download and install CUDA Toolkit 12.1

Inference using Web UI

  1. Open fish-speech-main folder
  2. Download checkpoints to checkpoints folder
  3. Open terminal on fish-speech-main folder
  4. run
    fishenv\env\python.exe -m tools.run_webui  --llama-checkpoint-path "checkpoints/fish-speech-1.5-250227-lora"  --decoder-checkpoint-path "checkpoints/fish-speech-1.5/firefly-gan-vq-fsq-8x1024-21hz-generator.pth"  --decoder-config-name firefly_gan_vq --compile
  5. When the * Running on local URL: http://127.0.0.1:xxxx log appears, you're ready. Go to that address using a web browser.
  6. There are settings at the bottom of the screen.

  7. Upload or record reference audio to determine the voice you want to imitate.
  8. Enter the text you want to generate on the left and press the Generate button.
  9. Check the generated voice and if the quality is too low, press the Generate again button.

Troubleshooting

  • If you installed following step 3 but still cannot use the GPU, run the following command.
    fishenv\env\python.exe -m pip install torch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 --index-url https://download.pytorch.org/whl/cu121

References

About

오픈소스로 만드는 커스텀 TTS

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published