Skip to content
Change the repository type filter

All

    Repositories list

    • vllm

      Public
      A high-throughput and memory-efficient inference and serving engine for LLMs
      Python
      12k0011Updated Jan 2, 2026Jan 2, 2026
    • dynamo

      Public
      A Datacenter Scale Distributed Inference Serving Framework
      Rust
      761000Updated Dec 25, 2025Dec 25, 2025
    • Reference implementations of MLPerf™ inference benchmarks
      Python
      600007Updated Dec 24, 2025Dec 24, 2025
    • Python
      0003Updated Dec 19, 2025Dec 19, 2025
    • Typer extension to enable pydantic support
      Python
      3000Updated Dec 18, 2025Dec 18, 2025
    • The official Python library for the OpenAI API
      Python
      4.5k000Updated Dec 9, 2025Dec 9, 2025
    • codex

      Public
      A comprehensive collection of integration examples for CentML. This repository serves as a resource hub for developers looking to seamlessly incorporate CentML's capabilities into their applications. Explore a variety of use cases and implementations to accelerate your integration process.
      Python
      1508Updated Dec 2, 2025Dec 2, 2025
    • Python
      1191Updated Nov 26, 2025Nov 26, 2025
    • Go
      1001Updated Nov 20, 2025Nov 20, 2025
    • Pull from private ECR repos... anywhere
      Go
      1110Updated Oct 27, 2025Oct 27, 2025
    • LoRAFusion: Efficient LoRA Fine-Tuning for LLMs
      Python
      22111Updated Sep 23, 2025Sep 23, 2025
    • A modular, extensible LLM inference benchmarking framework that supports multiple benchmarking frameworks and paradigms.
      Python
      11290Updated Aug 27, 2025Aug 27, 2025
    • Mist

      Public
      [EuroSys'25] Mist: Efficient Distributed Training of Large Language Models via Memory-Parallelism Co-Optimization
      Python
      52102Updated Aug 6, 2025Aug 6, 2025
    • MDX
      0020Updated Jul 24, 2025Jul 24, 2025
    • An open-source, code-first Python toolkit for building, evaluating, and deploying sophisticated AI agents with flexibility and control.
      Python
      2.7k000Updated Jun 5, 2025Jun 5, 2025
    • A Kubernetes Operator to create and manage Cloudflare Tunnels and DNS records for (HTTP/TCP/UDP*) Service Resources
      Go
      59000Updated May 3, 2025May 3, 2025
    • aisuite

      Public
      Simple, unified interface to multiple Generative AI providers
      Python
      1.4k008Updated Apr 29, 2025Apr 29, 2025
    • Sylva

      Public
      Boost fine-tuning performance with sparse embedded adapters and hierarchical approximate second-order information.
      Python
      0202Updated Apr 29, 2025Apr 29, 2025
    • A simple configurable kubernetes sidecar injector.
      Go
      1000Updated Apr 17, 2025Apr 17, 2025
    • An Agent that reviews the papers published on a given day and picks the one most aligned with our mission.
      TypeScript
      1000Updated Mar 14, 2025Mar 14, 2025
    • Benchmarking suite for popular AI APIs
      Python
      16003Updated Feb 12, 2025Feb 12, 2025
    • 🏙 Interactive performance profiling and debugging tool for PyTorch neural networks.
      Python
      116400Updated Jan 21, 2025Jan 21, 2025
    • Composable building blocks to build Llama Apps
      Python
      1.2k000Updated Jan 2, 2025Jan 2, 2025
    • 🔮 Execution time predictions for deep neural network training iterations across different GPUs.
      Python
      31430Updated Dec 16, 2024Dec 16, 2024
    • 🛠 VSCode plugin that provides visual interface for CentML Tools
      TypeScript
      21520Updated Dec 5, 2024Dec 5, 2024
    • platform_docs

      Public archive
      CentML Platform Documentation
      MDX
      11000Updated Nov 5, 2024Nov 5, 2024
    • FastChat

      Public
      An open platform for training, serving, and evaluating large language models. Release repo for Vicuna and Chatbot Arena.
      Python
      4.8k000Updated Sep 17, 2024Sep 17, 2024
    • Distributed ML Training and Fine-Tuning on Kubernetes
      Go
      859001Updated Aug 22, 2024Aug 22, 2024
    • Lightweight and extensible LLM Inference serving benchmark tool written in Rust.
      Rust
      0400Updated Apr 4, 2024Apr 4, 2024
    • examples

      Public
      A set of examples around pytorch in Vision, Text, Reinforcement Learning, etc.
      Python
      9.8k000Updated Feb 28, 2024Feb 28, 2024