The following projects are built and maintained by the community. We appreciate all contributions! Note that these projects are not officially supported by the OmniVoice team.
If you have a project you'd like to add, please open a PR.
-
ComfyUI-OmniVoice-TTS — ComfyUI custom node for OmniVoice text-to-speech generation.
-
vLLM-Omni — A framework for efficient model inference with omni-modality model. Supports OmniVoice serving.
-
pyVideoTrans — Video translation tool with dubbing & subtitles. Supports OmniVoice as a TTS engine.
-
MLX-Audio — TTS, STT, and STS library built on Apple's MLX framework. Supports OmniVoice among other models for efficient speech processing on Apple Silicon.
-
OmniVoice-MLX — MLX inference backend and conversion/staging tools for running OmniVoice on Apple Silicon, with community model weights hosted under mlx-community.
-
RealtimeTTS — Converts text to speech in realtime. Supports OmniVoice as a TTS engine.
-
TTS-WebUI — Gradio web UI for multiple TTS models. Supports OmniVoice as one of its backends.
-
OmniVoice-Studio — Desktop application for OmniVoice voice generation.
-
omnivoice-server — OpenAI-compatible HTTP server for serving OmniVoice via
/v1/audio/speech. Supports voice profiles for persistent cloning, sentence-level streaming, and optional Bearer auth. -
omnivoice-rs — GPU-first Rust workspace for OmniVoice inference, parity validation, CLI execution, and an OpenAI-compatible HTTP server built with Candle.
-
omnivoice-trtllm — Deploy OmniVoice TTS model using TensorRT-LLM and Triton Inference Server on Modal, faster than PyTorch.
-
Auris — Offline audiobook reader for EPUB, PDF, and TXT with local OmniVoice TTS, character-aware voices, and per-book narrator control.
-
tts-audiobook-tool — Audiobook creation tool supporting a dozen different TTS models including OmniVoice, Qwen3-TTS, VibeVoice, etc., focused on high-quality output