🦀 ferris-compute-cuda

A remote-execution command line tool for CUDA programming. Write CUDA code on your local machine (macOS/Linux/Windows) and execute it instantly on a remote server equipped with NVIDIA GPUs.

🏗 Project Structure

This project is a Rust Workspace optimized for cross-platform GPU development.

crates/client/: The CLI tool used to send code, manage workspaces, and query telemetry.
crates/host/: The daemon featuring auto-discovery for MSVC, Token Interceptors, and Workspace Reconstruction.
crates/common/: Shared logic and gRPC Protobuf definitions.

📚 Living Lab Manual (Changelog)

We maintain a historical record of all architectural pivots:

🚀 Quick Start

1. Setup Authentication

Create a .env file in the root directory (or use environment variables) on both the Host and Client.

# .env
FERRIS_AUTH_TOKEN=your-secure-token
FERRIS_SERVER=http://<SERVER_IP>:50051

2. Start the Host (GPU Server)

The host performs a Pre-flight Check to locate Visual Studio (Windows) and NVCC. It requires a token to authorize incoming requests. If FERRIS_AUTH_TOKEN doesn't exist in environment variable (defined in .env file), you will have to provide a --token argument:

# Start the daemon
cargo run -p host -- --token "optional-override-token"

3. Use the Client (Local Machine)

The client supports subcommands for monitoring and execution.

🔧 Install the CLI (Optional)

For a shorter command, install the binary globally:

cargo install --path crates/client
# Now use `ferris-run` directly:
ferris-run status
ferris-run run ./samples/helloworld/matrix_addition.cu

Alternatively, use the built-in cargo aliases (no install required):

cargo ferris-status
cargo ferris-run ./samples/helloworld/matrix_addition.cu

If FERRIS_SERVER is set in your .env or environment, you can omit --server entirely.

📊 Check GPU Status

# With FERRIS_SERVER in .env (shortest form):
cargo ferris-status

# Or with explicit server:
cargo run -p client -- status --server "http://<SERVER_IP>:50051"

🔍 Discover hosts on the LAN

The discover subcommand scans the local network for hosts advertising the ferris-compute mDNS service (_ferris-compute._tcp). The scan runs for about 3 seconds, then prints each host’s base URL and hostname. The GPU host must be running so it can advertise; discovery does not require a token.

# From the workspace:
cargo run -p client -- discover

# If the client binary is on your PATH (e.g. after `cargo install --path crates/client`):
client discover

If nothing is found, ensure the host is up, both machines share a LAN (or loopback), and multicast/mDNS is not blocked by a firewall. On Windows, if mDNS does not resolve the local host, the client may still list http://127.0.0.1:50051 when something is listening on the default port (see 0029-windows-local-discovery-fallback.md).

Discovered hosts also appear in the interactive server prompt when you run run or status without --server, env, or config (TTY only).

🏃 Run CUDA Code

The client automatically detects if you are sending a single file or an entire project (multi-file support with directories). You have to supply entry point file as first file argument to the run subcommand:

# Single File (with FERRIS_SERVER in .env):
cargo ferris-run ./samples/helloworld/matrix_addition.cu

# Single File (explicit server):
cargo run -p client -- run --server "http://<SERVER_IP>:50051" ./samples/helloworld/matrix_addition.cu

# Multiple Files (supports .cu, .cuh, .cpp. .hpp, .h)
cargo run -p client -- run --server "http://<SERVER_IP>:50051" "<ENTRY_POINT_FILE>" "<INCLUDE_FILE_1>" "<INCLUDE_FILE_2>"

Note: The first path provided is treated as the primary entry point.

🧪 Run Tests

Run unit and integration tests:

cargo test -p client   # Client: CLI parsing, file gathering, ignore rules
cargo test -p host     # Host: auth, workspace prep, nvcc command building

Tests live under crates/client/tests/ and crates/host/tests/.

Pre-commit hook: Run ./scripts/setup-git-hooks.sh once to run client and host tests automatically before each commit.

Real example - multiple files:

cargo run -p client -- run --server "http://10.0.0.181:50051" ./samples/deep_project/main.cu ./samples/deep_project/core/wrapper.cuh ./samples/deep_project/core/math/operations.cuh ./samples/deep_project/core/math/constants.cuh
# Equivalent to:
cargo run -p client -- run --server "http://10.0.0.181:50051" ./samples/deep_project/main.cu ./samples/deep_project/

📂 Directory Structures

Run this command cargo run -p client -- run --server "http://<HOST_IP>:<PORT>" ./project/main.cu ./project/include/utils.cuh will sync local folder to remote workspace /scratch/<UUID>/

When syncing a folder, ferris-compute-cuda maintains the internal hierarchy:

Local:                     Remote NVCC Workspace on Host:
project/                   scratch/<UUID>/
 ├── main.cu        --->    ├── main.cu
 └── include/               └── include/
      └── utils.cuh              └── utils.cuh

🛠 Configuration Priority

Priority	Method	Token Example	Server Example
1 (Highest)	CLI Argument	`--token "xyz"`	`--server "http://..."`
2	Shell Variable	`export FERRIS_AUTH_TOKEN="xyz"`	`export FERRIS_SERVER="http://..."`
3	`.env` File	`FERRIS_AUTH_TOKEN=xyz`	`FERRIS_SERVER=http://...`
4 (Lowest)	Config File	—	`~/.ferris-compute/config.toml`

🛡 Features & Security

Smart Compilation: Automatically injects -rdc=true for multi-file projects to enable device-side linking.
Telemetry: Real-time reporting of remote GPU temperature, memory usage, and load.
Token Authentication: gRPC Interceptors reject unauthorized requests before spawning tasks.
Isolation: Every job runs in a unique UUID-based scratchpad; workspaces are purged immediately after execution.
Compiler Streaming: Full nvcc stdout and stderr are streamed back to the client for real-time debugging.
Workspace Size: Sync is limited to 50MB to ensure fast transfers and host stability.

Name		Name	Last commit message	Last commit date
Latest commit History 50 Commits
.cargo		.cargo
.cursor		.cursor
.githooks		.githooks
.github/workflows		.github/workflows
crates		crates
docs		docs
samples		samples
scripts		scripts
.gitignore		.gitignore
CLAUDE.md		CLAUDE.md
Cargo.toml		Cargo.toml
README.md		README.md
rustfmt.toml		rustfmt.toml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

🦀 ferris-compute-cuda

🏗 Project Structure

📚 Living Lab Manual (Changelog)

🚀 Quick Start

1. Setup Authentication

2. Start the Host (GPU Server)

3. Use the Client (Local Machine)

🔧 Install the CLI (Optional)

📊 Check GPU Status

🔍 Discover hosts on the LAN

🏃 Run CUDA Code

🧪 Run Tests

📂 Directory Structures

🛠 Configuration Priority

🛡 Features & Security

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

🦀 ferris-compute-cuda

🏗 Project Structure

📚 Living Lab Manual (Changelog)

🚀 Quick Start

1. Setup Authentication

2. Start the Host (GPU Server)

3. Use the Client (Local Machine)

🔧 Install the CLI (Optional)

📊 Check GPU Status

🔍 Discover hosts on the LAN

🏃 Run CUDA Code

🧪 Run Tests

📂 Directory Structures

🛠 Configuration Priority

🛡 Features & Security

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages