Deployment Modes

Overview

The system supports two deployment patterns with automatic protocol detection for seamless integration with different environments.

Standalone Server Mode

Single process containing all services on one port (default: 6280). This mode combines:

MCP server accessible via /mcp and /sse endpoints
Web interface for job management
Embedded worker for document processing
API (tRPC over HTTP) for programmatic access

Use Cases

Development environments
Single-container deployments
Simple production setups
Local documentation indexing

Service Configuration

Services can be selectively enabled via AppServerConfig:

enableMcpServer: MCP protocol endpoint
enableWebInterface: Web UI and management API
enableWorker: Embedded job processing
enableApiServer: HTTP API for pipeline and data operations (served at /api)

Distributed Mode

Separate coordinator and worker processes for scaling. The coordinator handles interfaces while workers process jobs.

Architecture

Coordinator: Runs MCP server, web interface, and API
Workers: Execute document processing jobs
Communication: Coordinator uses the API (tRPC over HTTP) to talk to workers

Use Cases

High-volume processing
Container orchestration (Kubernetes, Docker Swarm)
Horizontal scaling requirements
Resource isolation

Worker Management

Workers may expose a simple /health or container-level healthcheck for monitoring. Coordinators communicate with workers via Pipeline RPC.

Protocol Auto-Detection

The system automatically selects communication protocol based on execution environment:

Detection Logic

if (!process.stdin.isTTY && !process.stdout.isTTY) {
  return "stdio";  // AI tools, CI/CD
} else {
  return "http";   // Interactive terminals
}

Stdio Mode

Direct MCP communication via stdin/stdout
Used by VS Code, Claude Desktop, other AI tools
No HTTP server required
Minimal resource usage

HTTP Mode

Server-Sent Events transport for MCP
Full web interface available
API accessible at /api
Suitable for browser access

Manual Override

Protocol can be explicitly set via --protocol stdio|http flag, bypassing auto-detection.

Configuration

Deployment mode, ports, and embedding settings are resolved through the shared configuration loader (defaults → docs-mcp.config.yaml or DOCS_MCP_CONFIG → legacy envs → generic env DOCS_MCP_<KEY> → CLI flags for the current run). Override with YAML or env keys such as DOCS_MCP_PROTOCOL, DOCS_MCP_PORT, and DOCS_MCP_EMBEDDING_MODEL; use CLI flags like --protocol, --port, --server-url, or --resume when you need per-invocation changes.

Job Recovery

Job recovery behavior depends on deployment mode:

Standalone Server

Embedded worker recovers pending jobs from database
Enabled by default for persistent job processing
Prevents job loss during server restarts

Distributed Mode

Workers handle their own job recovery
Coordinators do not recover jobs to avoid conflicts
Each worker maintains independent job state

CLI Commands

No job recovery to prevent conflicts
Immediate execution model
Safe for concurrent CLI usage

Container Deployment

Single Container

FROM ghcr.io/arabold/docs-mcp-server:latest
EXPOSE 6280
CMD ["--protocol", "http", "--port", "6280"]

Multi-Container (Docker Compose)

services:
  coordinator:
    image: ghcr.io/arabold/docs-mcp-server:latest
    ports: ["6280:6280"]
  command: ["mcp", "--server-url", "http://worker:8080/api"]

  worker:
    image: ghcr.io/arabold/docs-mcp-server:latest
    ports: ["8080:8080"]
    command: ["worker", "--port", "8080"]

Load Balancing

Multiple Workers

Use a load balancer (or DNS) in front of multiple worker instances. The coordinator is configured with a single --server-url that points to the balancer.

Health Checks

Expose a lightweight /health endpoint or container healthcheck for load balancers and monitoring.

Scaling Strategies

Horizontal: Add more worker containers
Vertical: Increase worker resource allocation
Hybrid: Combine both strategies based on workload

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Deployment Modes

Overview

Standalone Server Mode

Use Cases

Service Configuration

Distributed Mode

Architecture

Use Cases

Worker Management

Protocol Auto-Detection

Detection Logic

Stdio Mode

HTTP Mode

Manual Override

Configuration

Job Recovery

Standalone Server

Distributed Mode

CLI Commands

Container Deployment

Single Container

Multi-Container (Docker Compose)

Load Balancing

Multiple Workers

Health Checks

Scaling Strategies

FilesExpand file tree

deployment-modes.md

Latest commit

History

deployment-modes.md

File metadata and controls

Deployment Modes

Overview

Standalone Server Mode

Use Cases

Service Configuration

Distributed Mode

Architecture

Use Cases

Worker Management

Protocol Auto-Detection

Detection Logic

Stdio Mode

HTTP Mode

Manual Override

Configuration

Job Recovery

Standalone Server

Distributed Mode

CLI Commands

Container Deployment

Single Container

Multi-Container (Docker Compose)

Load Balancing

Multiple Workers

Health Checks

Scaling Strategies