LLM Wrapper

A lightweight LLM protocol aggregation wrapper, similar to litellm but more minimal.

Features

Multi-upstream Aggregation: Configure multiple upstream LLM APIs
Multi-protocol Support: Chat Completions, Responses, Anthropic Messages with automatic conversion
Model Aliases: Define local aliases for upstream models
Parameter Settings: Supports override (force) and default (fallback) modes
CLIProxyAPI Auth: Built-in OAuth management for Claude/Codex upstreams via CLIProxyAPI sidecar
Hot Config Reload: WebUI config changes take effect immediately without restart
YAML Configuration: Persistent config file support
Single-file WebUI: Management interface built with pure HTML + JS
API Key Masking: Auto-masks API keys in management endpoints
Auto Alias: One-click passthrough alias creation by clicking upstream model tags

Quick Start

Prerequisites

CLIProxyAPI is included as a git submodule. Clone with submodule:

git clone --recursive <repo-url>
# Or if already cloned:
git submodule update --init

Build

cargo build --release

Run

./target/release/llm-wrapper

CLI Arguments

llm-wrapper -c config.yaml -a 0.0.0.0:3000

-c, --config <PATH>   Config file path (default: config.yaml)
-a, --addr <ADDR>     Bind address (default: 0.0.0.0:3000)
-v, --version         Print version
-h, --help            Print help

CLI args take precedence over environment variables.

Docker Deployment

Run with Docker (Recommended):

docker run -d \
  --name llm-wrapper \
  -p 3000:3000 \
  -p 8317:8317 \
  -v $(pwd)/config:/app/config \
  -v llm-wrapper-data:/app/.llm-wrapper \
  -e BIND_ADDR=0.0.0.0:3000 \
  -e CONFIG_PATH=/app/config/config.yaml \
  sczhengyabin/llm-wrapper:latest

Ports:

3000 - Main API and WebUI
8317 - CLIProxyAPI (OAuth management for Claude/Codex)

Volumes:

/app/config - Configuration directory
/app/.llm-wrapper - Token cache and CLIProxyAPI data

With docker-compose:

# Start
docker-compose up -d

# View logs
docker-compose logs -f

# Stop
docker-compose down

Build image locally (optional):

git submodule update --init
docker build -t llm-wrapper:latest .

Environment Variables

CONFIG_PATH - Config file path (default: config.yaml)
BIND_ADDR - Bind address (default: 0.0.0.0:3000)

Configuration Example

# CLIProxyAPI endpoint (for OAuth upstreams, default: http://127.0.0.1:8317)
cli_proxy_api_endpoint: http://127.0.0.1:8317

# Upstream config (name as unique identifier)
upstreams:
  - name: qwen-test
    base_url: http://192.168.100.7:30002
    auth:
      type: api_key
      key: null  # or "your-api-key"
    enabled: true
    support_chat_completions: true   # Supports OpenAI chat/completions
    support_responses: false          # Supports OpenAI responses
    support_anthropic_messages: false # Supports Anthropic messages

  - name: claude
    base_url: https://api.anthropic.com
    auth:
      type: anthropic_oauth  # OAuth managed by CLIProxyAPI
    enabled: true
    support_chat_completions: false
    support_responses: false
    support_anthropic_messages: true

# Model alias config
aliases:
  - alias: qwen
    target_model: Qwen/Qwen3.5-122B-A10B-GPTQ-Int4
    upstream: qwen-test
    param_overrides:
      - key: temperature
        value: 0.7
        mode: default  # or override
      # extra_body configured separately
      - key: extra_body
        value:
          chat_template_kwargs:
            enable_thinking: false
        mode: default
    source: manual  # manually created alias

Auth Types

Type	Description
`api_key`	Static API key (default)
`anthropic_oauth`	OAuth for Anthropic, managed by CLIProxyAPI
`codex_oauth`	OAuth for Codex, managed by CLIProxyAPI

CLIProxyAPI Auth Flow

For anthropic_oauth / codex_oauth upstreams, login via WebUI or API:

# Start login
curl -X POST http://localhost:3000/api/cli-proxy-api/login/claude

# The response contains an auth URL, open it in browser to authenticate
# Token is automatically cached and refreshed

Routing Rules

Alias Matching: The model parameter in requests only matches the alias field
Target Model is Not Routed: target_model is only used to replace the model name when forwarding, not for routing
Direct Upstream Call: If no alias match is found and model matches an enabled upstream name, use that upstream directly

This means:

With alias: my-model -> target_model: gpt-4, you must call with model: "my-model"
To support model: "gpt-4", create an alias: gpt-4 -> target_model: gpt-4 auto alias

API Endpoints

Config Management

GET /api/config - Get current config
PUT /api/config - Update config (saves to YAML file)

Upstream Model Management

GET /api/upstream-models - Get model list from all upstreams
POST /api/upstream-models/alias - Create auto alias for upstream model

Authentication

POST /api/auth/login/{upstream_name} - OAuth login for upstream
DELETE /api/auth/token/{upstream_name} - Clear OAuth token

CLIProxyAPI Auth

POST /api/cli-proxy-api/login/{upstream_name} - Start CLIProxyAPI login
POST /api/cli-proxy-api/complete-login/{upstream_name} - Complete login callback
GET /api/cli-proxy-api/login-stream/{upstream_name} - SSE login progress
GET /api/cli-proxy-api/status - Get CLIProxyAPI account status

OpenAI Compatible API

POST /v1/chat/completions - Chat completions
POST /v1/responses - Responses API (upstream support required)
POST /v1/messages - Anthropic Messages API (upstream support required)
GET /v1/models - Model list (returns all aliases)

Debug Endpoints

GET /api/debug - Get latest debug info
DELETE /api/debug - Clear debug info
GET /api/debug/stream - SSE streaming debug info

WebUI

GET / - WebUI management interface

WebUI Features

Aggregated Model List

Displays all model aliases accessible via /v1/models at the top of the page, grouped by upstream.

Upstream Model Tags

Blue dashed border: Available model, click to create auto alias
Green solid border: Enabled auto alias, click to delete
Red background: Alias name conflict, cannot create

Auto Alias

Auto alias is a passthrough alias: alias = target_model = upstream model name, with no parameter overrides.

Create:

WebUI: Click model tags in upstream config cards
API: POST /api/upstream-models/alias

Delete:

WebUI: Click enabled green model tags
Manually delete alias

Usage Examples

Chat Completions

curl -X POST http://localhost:3000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen",
    "messages": [
      {"role": "user", "content": "Hello"}
    ]
  }'

List Models

curl http://localhost:3000/v1/models

Create Auto Alias

curl -X POST http://localhost:3000/api/upstream-models/alias \
  -H "Content-Type: application/json" \
  -d '{
    "upstream": "qwen-test",
    "model": "Qwen/Qwen3.5-122B-A10B-GPTQ-Int4"
  }'

Responses API

curl -X POST http://localhost:3000/v1/responses \
  -H "Content-Type: application/json" \
  -d '{
    "model": "qwen",
    "input": "Hello"
  }'

Note: Responses API requires upstream support. If the upstream only supports Chat Completions, the response format may not comply with the Responses API spec.

Anthropic Messages API

curl -X POST http://localhost:3000/v1/messages \
  -H "Content-Type: application/json" \
  -H "x-api-key: your-anthropic-api-key" \
  -d '{
    "model": "claude-sonnet-4",
    "max_tokens": 1024,
    "messages": [
      {"role": "user", "content": "Hello"}
    ]
  }'

Note: Messages API requires upstream support for the Anthropic protocol (e.g., Anthropic API). If unsupported, it will return 404/405 errors.

Debug Mode

Enable debug mode with the X-Debug-Mode: true header to get full request/response debug info:

curl -X POST http://localhost:3000/v1/chat/completions \
  -H "Content-Type: application/json" \
  -H "X-Debug-Mode: true" \
  -d '{
    "model": "qwen",
    "messages": [
      {"role": "user", "content": "Hello"}
    ]
  }'

Response includes:

client_request: Original request sent to the Wrapper
client_ip: Client source IP
client_url: Client request URL
endpoint: Called endpoint
upstream_url: Upstream request URL
upstream_request: Request sent to upstream (with param overrides applied)
upstream_response: Response from upstream

Name		Name	Last commit message	Last commit date
Latest commit History 77 Commits
.github/workflows		.github/workflows
CLIProxyAPI @ 6bfcb0c		CLIProxyAPI @ 6bfcb0c
docs		docs
src		src
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.gitmodules		.gitmodules
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
Dockerfile		Dockerfile
README.md		README.md
README_zh.md		README_zh.md
config.yaml.example		config.yaml.example
docker-compose.yml		docker-compose.yml
logo.png		logo.png

Folders and files

Latest commit

History

Repository files navigation