Skip to content

Latest commit

 

History

History
54 lines (38 loc) · 2.69 KB

File metadata and controls

54 lines (38 loc) · 2.69 KB

Community Projects

The following projects are built and maintained by the community. We appreciate all contributions! Note that these projects are not officially supported by the OmniVoice team.

If you have a project you'd like to add, please open a PR.


  • ComfyUI-OmniVoice-TTS — ComfyUI custom node for OmniVoice text-to-speech generation.

  • vLLM-Omni — A framework for efficient model inference with omni-modality model. Supports OmniVoice serving.

  • pyVideoTrans — Video translation tool with dubbing & subtitles. Supports OmniVoice as a TTS engine.

  • MLX-Audio — TTS, STT, and STS library built on Apple's MLX framework. Supports OmniVoice among other models for efficient speech processing on Apple Silicon.

  • OmniVoice-MLX — MLX inference backend and conversion/staging tools for running OmniVoice on Apple Silicon, with community model weights hosted under mlx-community.

  • RealtimeTTS — Converts text to speech in realtime. Supports OmniVoice as a TTS engine.

  • TTS-WebUI — Gradio web UI for multiple TTS models. Supports OmniVoice as one of its backends.

  • OmniVoice-Studio — Desktop application for OmniVoice voice generation.

  • omnivoice-server — OpenAI-compatible HTTP server for serving OmniVoice via /v1/audio/speech. Supports voice profiles for persistent cloning, sentence-level streaming, and optional Bearer auth.

  • omnivoice-rs — GPU-first Rust workspace for OmniVoice inference, parity validation, CLI execution, and an OpenAI-compatible HTTP server built with Candle.

  • omnivoice-trtllm — Deploy OmniVoice TTS model using TensorRT-LLM and Triton Inference Server on Modal, faster than PyTorch.

  • Auris — Offline audiobook reader for EPUB, PDF, and TXT with local OmniVoice TTS, character-aware voices, and per-book narrator control.

  • tts-audiobook-tool — Audiobook creation tool supporting a dozen different TTS models including OmniVoice, Qwen3-TTS, VibeVoice, etc., focused on high-quality output