-
Notifications
You must be signed in to change notification settings - Fork 10
Dev #4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Dev #4
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
- Implement main.py with comprehensive TTS API routes - Add routes for model discovery, language support, and speech synthesis - Include error handling and streaming response for audio generation - Update requirements.txt to remove version constraints - Modify Dockerfile to install Zonos repository directly
- Add new sampling parameters to TTSService: top_p, top_k, linear, confidence, and quadratic - Update generate method to support expanded sampling configuration - Create a comprehensive sampling_params dictionary for more granular audio generation control
- Update Dockerfile to install Zonos package directly - Modify main.py route for model conditioners and add debugging logs - Enhance models.py with default values and optional parameters - Refactor tts.py service with improved logging and model initialization - Add more comprehensive error handling and logging across components
- Update TTSRequest model to include sampling configuration parameters - Modify main.py to use dynamic sampling parameters from API request - Add default values for top_p, top_k, linear, confidence, and quadratic parameters - Make seed parameter non-optional with a default value
- Consolidate pip installations in Dockerfile for better layer caching - Add --no-cache-dir flag to reduce image size - Modify COPY command for app directory - Increase TTS generation max tokens from 30 to 60 seconds
…endency management
…ncy installation with caching
…d improve dependency management
…ependencies installation
…e dependency installation with uv package manager
…improve dependency management
…UN steps and cache cleanup
…ove dependency management
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Working GPU based API, run using docker
or directly in dev mode as: