Skip to content
Change the repository type filter

All

    Repositories list

    • Self-contained DramaBox voice acting pipeline: VoiceNet taxonomy, multi-GPU prompt generation, TTS synthesis, and audio refinement
      Python
      Apache License 2.0
      0000Updated Jun 1, 2026Jun 1, 2026
    • Python
      0000Updated Jun 1, 2026Jun 1, 2026
    • laion.ai

      Public
      HTML
      MIT License
      4612353Updated May 31, 2026May 31, 2026
    • bud-e

      Public
      A general human-ai interaction platform.
      Dart
      Apache License 2.0
      101800Updated May 27, 2026May 27, 2026
    • School Bud-E is an intelligent and empathetic learning assistant designed to revolutionize the educational experience.
      Dart
      Apache License 2.0
      3100Updated May 27, 2026May 27, 2026
    • Open-weights voice acting pipeline combining zero-shot voice cloning with natural-language direction. Provide a reference voice (or generate one) and describe h…
      HTML
      Apache License 2.0
      01400Updated May 25, 2026May 25, 2026
    • Dream-E

      Public
      TypeScript
      0300Updated May 22, 2026May 22, 2026
    • Admin Bud-E is a lightweight, privacy-first control center for AI chat, speech-to-text, and text-to-speech. Manage providers, routing, and costs with a simple A…
      Python
      Apache License 2.0
      2100Updated May 18, 2026May 18, 2026
    • Benchmark analysis
      Python
      MIT License
      1000Updated May 13, 2026May 13, 2026
    • Jupyter Notebook
      42200Updated May 12, 2026May 12, 2026
    • BVD

      Public
      Python
      0100Updated May 7, 2026May 7, 2026
    • tunes

      Public
      Python
      0000Updated May 7, 2026May 7, 2026
    • Python
      0000Updated May 7, 2026May 7, 2026
    • Collection of three complementary voice taxonomies: VoiceNet (59 speech dimensions), EmoNet (40 emotion categories), VocalBurst (82 non-speech sounds)
      0300Updated May 2, 2026May 2, 2026
    • Building an agentic voice assistant for mobile & desktop devices with episodic, semantic & procedural memories
      Apache License 2.0
      0200Updated Apr 12, 2026Apr 12, 2026
    • Retrieval-augmented voice cloning and emotion conditioning data generation pipeline. Combines Echo TTS, ChatterboxVC, and Empathic Insight Voice+ to generate la…
      Python
      Other
      0300Updated Apr 3, 2026Apr 3, 2026
    • Multi-node scaling benchmarks for CLAP contrastive audio-language models on HPC clusters
      Python
      0000Updated Mar 29, 2026Mar 29, 2026
    • Scaled diffusion transformer for text-to-speech synthesis (DiT + T5Gemma2 conditioning, TorchTitan & Megatron backends, tested up to 1024 GPUs)
      Python
      02400Updated Mar 29, 2026Mar 29, 2026
    • JAX/TPU training code for EchoTTS with DACVAE latent codec
      Python
      0000Updated Mar 29, 2026Mar 29, 2026
    • High-level Python library for zero-shot voice conversion using Resemble AI's Chatterbox S3Gen model
      Python
      Apache License 2.0
      2100Updated Mar 23, 2026Mar 23, 2026
    • CLIP-like model evaluation
      Python
      MIT License
      103813306Updated Mar 19, 2026Mar 19, 2026
    • Python
      1010920Updated Feb 28, 2026Feb 28, 2026
    • Apache License 2.0
      0800Updated Feb 13, 2026Feb 13, 2026
    • AIW

      Public
      Alice in Wonderland code base for experiments and raw experiments data
      Python
      Apache License 2.0
      1113121Updated Feb 4, 2026Feb 4, 2026
    • Audio Dataset for training CLAP and other models
      Python
      59740215Updated Jan 8, 2026Jan 8, 2026
    • vocolino

      Public
      Apache License 2.0
      0000Updated Dec 10, 2025Dec 10, 2025
    • OpenCLIP fork with MaMMUT support
      Python
      Other
      2511Updated Nov 10, 2025Nov 10, 2025
    • MegaTron open-sci fork
      Python
      Other
      4k700Updated Oct 29, 2025Oct 29, 2025
    • A frontend that is compatible to the school-bud-e-backend.
      TypeScript
      MIT License
      102201Updated Oct 23, 2025Oct 23, 2025
    • Official repository for the NeurIPS 2025 paper “EmoNet-Face: An Expert-Annotated Benchmark for Synthetic Emotion Recognition.” Includes a 40-category emotion ta…
      Jupyter Notebook
      MIT License
      0400Updated Oct 20, 2025Oct 20, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.