-
open to work
- Nantes, France
- @my_kiwi
- @[email protected]
- in/GautierRomain
Highlights
AI
The reproduced code for Google's SoundStorm
OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340
first base model for full-duplex conversational audio
Claude Memory: Long-term memory for Claude
An open-source visual programming environment for battle-testing prompts to LLMs.
LLM-powered multiagent persona simulation for imagination enhancement and business insights.
LLM plugin providing access to models running on an Ollama server
Qwen2.5-Coder is the code version of Qwen2.5, the large language model series developed by Qwen team, Alibaba Cloud.
Document to Markdown OCR library with Llama 3.2 vision
Open-source platform for extracting structured data from documents using AI.
Qodo-Cover: An AI-Powered Tool for Automated Test Generation and Code Coverage Enhancement! 💻🤖🧪🐞
Everything about the SmolLM2 and SmolVLM family of models
Readable YouTube Transcripts using Gemini 1.5 Flash 8B
Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer(RVC), zero-shot Voice Cloning (E2, F5-TTS), YouTub…
🔥 Open Source Browser API for AI Agents & Apps. Steel Browser is a batteries-included browser instance that lets you automate the web without worrying about infrastructure.
Beautiful docs that write, translate, and optimize themselves
Local Speech to Text in PHP made easy thanks to Whisper.cpp and OpenAI
🪢 Open source LLM engineering platform: LLM Observability, metrics, evals, prompt management, playground, datasets. Integrates with LlamaIndex, Langchain, OpenAI SDK, LiteLLM, and more. 🍊YC W23
🪄 Create rich visualizations with AI
RAG Logger is an open-source logging tool designed specifically for Retrieval-Augmented Generation (RAG) applications. It serves as a lightweight, open-source alternative to LangSmith, focusing on …
Replace 'hub' with 'ingest' in any github url to get a prompt-friendly extract of a codebase
User-friendly AI Interface (Supports Ollama, OpenAI API, ...)
Convert ebooks to audiobooks with chapters and metadata using dynamic AI models and voice cloning. Supports 1,107+ languages!
An open-source RAG-based tool for chatting with your documents.