오픈소스로 만드는 커스텀 TTS
- Open Source TTS 다운로드
- fish-speech -> Code -> Dounload Zip
- unzip files
- fish-speech -> Code -> Dounload Zip
- Double click install_env.bat
- install miniconda, create virtual env, install packages
- If you want to enable compilation acceleration:
- Download and install the LLVM compiler-17.0.6
- Download and install the Microsoft Visual C++ Redistributable MSVC++ 14.40.33810.0 Download
- Download and install Visual Studio Community Edition to get MSVC++ build tools
- Visual Studio Download
- After installing Visual Studio Installer, download
Visual Studio Community 2022
. - As shown below, click the
Modify
button and find theDesktop development with C++
option to select and download.
- Download and install CUDA Toolkit 12.1
- Open
fish-speech-main
folder - Download checkpoints to
checkpoints
folder - Open terminal on
fish-speech-main
folder - run
fishenv\env\python.exe -m tools.run_webui --llama-checkpoint-path "checkpoints/fish-speech-1.5-250227-lora" --decoder-checkpoint-path "checkpoints/fish-speech-1.5/firefly-gan-vq-fsq-8x1024-21hz-generator.pth" --decoder-config-name firefly_gan_vq --compile
- When the
* Running on local URL: http://127.0.0.1:xxxx
log appears, you're ready. Go to that address using a web browser. - There are settings at the bottom of the screen.
- Upload or record reference audio to determine the voice you want to imitate.
- Enter the text you want to generate on the left and press the Generate button.
- Check the generated voice and if the quality is too low, press the Generate again button.
- If you installed following step 3 but still cannot use the GPU, run the following command.
fishenv\env\python.exe -m pip install torch==2.4.1 torchvision==0.19.1 torchaudio==2.4.1 --index-url https://download.pytorch.org/whl/cu121