A web-based interface for Microsoft Edge's Text-to-Speech service that allows you to convert text to speech using various neural voices.
- Support for multiple file formats:
- Plain text (.txt)
- FictionBook (.fb2)
- EPUB (.epub)
- ZIP archives containing supported formats
- Rich voice selection with multiple languages and neural voices
- Adjustable speech parameters:
- Speed control (-50% to +100%)
- Pitch adjustment (-50Hz to +50Hz)
- Volume control
- Multi-threaded processing for faster conversion
- Customizable text processing modes (V1, V2, V3) for handling periods and punctuation
- Option to merge multiple MP3 files
- Progress tracking with detailed status updates
- Settings persistence across sessions
- Modern and intuitive user interface
- Open the web interface
- Select your preferred voice from the dropdown menu
- Adjust speech parameters (speed, pitch, volume) as needed
- Either paste your text directly or upload supported file formats
- Click "Save to MP3" to start the conversion
- Choose a directory to save the generated audio files
Set merge chunk size by 1. Then process with https://docs.applio.org/applio/getting-started/installation
This project is a fork of the original EdgeTTS Web Interface repository.