tooti 🦜

A decentralized AI inference protocol with an OpenAI-compatible gateway.

tooti connects model providers and application developers through a shared protocol layer:

operators run nodes and advertise available models
the network discovers healthy providers
the gateway routes requests to the best available node
clients use a familiar OpenAI-style API

Why tooti

AI apps need reliable inference, but today teams often face:

single-provider dependency
regional outages and latency spikes
limited control over routing and cost
difficult migration paths between model backends

tooti exists to make inference portable, resilient, and open:

portable for developers (standard API, minimal lock-in)
resilient for production (multi-node discovery and failover)
open for operators (bring your own backend: Ollama, vLLM, llama.cpp, and more)

Architecture

flowchart LR
    A[AI App / Client] -->|OpenAI-compatible HTTP| B[Tooti Gateway]
    B --> C[Router]
    C --> D[(Node Registry)]
    D --> E[Node Agent A]
    D --> F[Node Agent B]
    D --> G[Node Agent C]
    E --> H[Ollama / vLLM / llama.cpp]
    F --> I[Ollama / vLLM / llama.cpp]
    G --> J[Ollama / vLLM / llama.cpp]
    E <-. libp2p heartbeat + capabilities .-> D
    F <-. libp2p heartbeat + capabilities .-> D
    G <-. libp2p heartbeat + capabilities .-> D

What each part does

Node Agent: advertises model capabilities, health, and availability over libp2p
Registry + Router: tracks live nodes and selects where each request should go
Gateway: exposes /v1/models and /v1/chat/completions for client apps

Network bootstrap

Configure network.bootstrap_peers (e.g. /dnsaddr/discover.tooti.network to expand peer ids from TXT, or a full /ip4/.../p2p/...).

Quick start

1) Build

go build -o tooti ./cmd/tooti

2) Validate and run

# Validate config first
./tooti config-check -file ./node.yaml

# Start node
./tooti node start -file ./node.yaml

# Start gateway (same host or another host)
./tooti gateway start -file ./node.yaml

3) Test the API

# List models
curl -s http://127.0.0.1:8080/v1/models

# Stream chat completion
curl -N http://127.0.0.1:8080/v1/chat/completions \
  -H 'Content-Type: application/json' \
  -d '{"model":"llama3.2:latest","stream":true,"messages":[{"role":"user","content":"say hi"}]}'

OpenClaw integration

Guide: docs/openclaw.md
Example config: docs/openclaw.json.example

Contributing

See CONTRIBUTING.md.

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
.cursor/rules		.cursor/rules
assets		assets
cmd		cmd
db/migrations		db/migrations
docs		docs
pkg		pkg
proto/tooti/v0		proto/tooti/v0
.gitignore		.gitignore
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md
go.mod		go.mod
go.sum		go.sum
node.example.yaml		node.example.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

tooti 🦜

Why tooti

Architecture

What each part does

Network bootstrap

Quick start

1) Build

2) Validate and run

3) Test the API

OpenClaw integration

Contributing

License

About

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

tooti 🦜

Why tooti

Architecture

What each part does

Network bootstrap

Quick start

1) Build

2) Validate and run

3) Test the API

OpenClaw integration

Contributing

License

About

Topics

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Contributors

Uh oh!

Languages