Skip to content
View mykiwi's full-sized avatar
🥝
$ docker buildx bake future
🥝
$ docker buildx bake future

Organizations

@sbrk-org @SymfonyLive @SymfonyCon @the-fast-track @SymfonyWorld

Block or report mykiwi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Stars

AI

154 repositories

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 4,839 272 Updated Jan 26, 2025

[NeurIPS 2024] Official code for PuLID: Pure and Lightning ID Customization via Contrastive Alignment

Python 3,046 214 Updated Nov 27, 2024

Transcribe on your own!

TypeScript 1,644 94 Updated Jan 26, 2025

Interact with your SQL database, Natural Language to SQL using LLMs

Python 3,406 241 Updated Jul 24, 2024

A self-organizing file system with llama 3

Jupyter Notebook 5,087 320 Updated Oct 24, 2024

turnkey self-hosted offline transcription and diarization service with llm summary

Python 796 46 Updated Sep 25, 2024

Stateful load balancer custom-tailored for llama.cpp 🏓🦙

Rust 692 28 Updated Jan 20, 2025

LLM Inference on AWS Lambda

Python 10 Updated Jun 3, 2024

LLM training in simple, raw C/CUDA

Cuda 25,205 2,893 Updated Oct 2, 2024

Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 14,572 972 Updated Jan 23, 2025

LLM Analytics

TypeScript 638 25 Updated Oct 19, 2024

a text-based terminal client for Ollama

Python 1,266 79 Updated Jan 31, 2025

llama3 implementation one matrix multiplication at a time

Jupyter Notebook 14,084 1,155 Updated May 23, 2024

WARC + AI - Experimental Retrieval Augmented Generation Pipeline for Web Archive Collections.

Python 237 23 Updated Jan 9, 2025

AI-powered Jupyter Notebook — use local AI to generate and edit code cells, automatically fix errors, and chat with your data

JavaScript 1,083 56 Updated Jan 5, 2025

Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective…

Python 18,843 2,465 Updated Sep 19, 2024

Chat with AI large language models running natively in your browser. Enjoy private, server-free, seamless AI conversations.

TypeScript 601 89 Updated Jan 29, 2025

YaFSDP: Yet another Fully Sharded Data Parallel

Python 889 49 Updated Jan 29, 2025

Localized watermarking for AI-generated speech audios, with SOTA on robustness and very fast detector

Python 516 63 Updated Oct 26, 2024

SWE-agent takes a GitHub issue and tries to automatically fix it, using GPT-4, or your LM of choice. It can also be employed for offensive cybersecurity or competitive coding challenges. [NeurIPS 2…

Python 14,315 1,457 Updated Jan 30, 2025

[ICLR 2025] From anything to mesh like human artists. Official impl. of "MeshAnything: Artist-Created Mesh Generation with Autoregressive Transformers"

Python 2,098 91 Updated Aug 5, 2024

Generate ideal question-answers for testing RAG

Python 126 3 Updated Dec 19, 2024

[ICML 2024] EvTexture: Event-driven Texture Enhancement for Video Super-Resolution

Python 1,082 72 Updated Sep 17, 2024

Bring portraits to life!

Python 13,783 1,477 Updated Feb 2, 2025

The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.

Python 65,580 7,007 Updated Feb 2, 2025

Run your own AI cluster at home with everyday devices 📱💻 🖥️⌚

Python 21,457 1,225 Updated Feb 1, 2025

PDF to Markdown with vision models

TypeScript 9,260 592 Updated Feb 3, 2025

Easy Docker setup for Stable Diffusion with user-friendly UI

Shell 6,960 1,174 Updated Aug 18, 2024

Gollama: Your offline conversational AI companion. An interactive tool for generating creative responses from various models, right in your terminal. Ideal for brainstorming, creative writing, or s…

Go 129 7 Updated Dec 23, 2024