Skip to content
Change the repository type filter

All

    Repositories list

    • Towards Open-source GPT-4o with Vision, Speech and Duplex Capabilities。
      Python
      MIT License
      1811.6k290Updated Jan 16, 2025Jan 16, 2025
    • mini-omni

      Public
      open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
      Python
      MIT License
      2723.2k324Updated Nov 5, 2024Nov 5, 2024