llm-d incubation

llm-d-infra Public

llm-d helm charts and deployment examples

Go Template 50 55

llm-d-modelservice Public

helm charts for deploying models with llm-d

Go Template 28 51

llm-d-fast-model-actuation Public

Kubernetes controllers for fast model actuation using vLLM sleep/wake and launcher-based model swapping

Go 9 11

batch-gateway Public

The batch gateway is an llm-d implementation of the OpenAI batch inference API

Go 5 9

ig-wva Public

Workload Variant Autoscaler is a service to compute the cost-optimal provisioning of heterogeneous accelerators for inference workloads with varying request latency objectives

Jupyter Notebook 2 1

llm-d-ci Public

Shell 2 2

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-d incubation

Popular repositories Loading

Repositories

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

People

Top languages

Uh oh!

Most used topics

Uh oh!