Add harbor.rl + Tinker RL training example by benediktstroebl · Pull Request #28 · harbor-framework/harbor-cookbook

benediktstroebl · 2026-04-09T22:37:11Z

Adds a minimal RL training example that uses harbor.rl's step()/grade() interface with Tinker for the training loop.

train.py bridges harbor.rl environments to tinker's Env/EnvGroupBuilder abstractions
Downloads Terminal-Bench 2.0 tasks via the harbor registry
Adapts harbor.rl's BashTool and grade() as tinker-compatible tool and reward functions

benediktstroebl added 17 commits April 9, 2026 15:36

Add harbor.rl + Tinker RL training example

4006c5c

Make sandbox type a CLI arg, default to modal

2c185c0

Add -s alias for --sandbox

cb46537

Add sandbox provider API key to prerequisites

5e37ebb

Pin harbor[rl] to feature branch

ed2b600

Pin harbor[rl] to PR commit SHA

165f6fd

Drop [rl] extra from harbor dependency

b764b7d

Pin harbor dep back to branch name

d6d2d7e

Fix BashTool import path

0d46d95

Fix tinker-cookbook import paths

f7b654d

Fix download_dataset call: version in name, async

5959456

Make build_datasets async

bab6e54

Fix Task constructor call

eeabd12

Fix sandbox creation to use SandboxFactory.create_sandbox()

ebb40de

Add logging for tool calls and grading

867e3f8

Include output preview in bash ok log

cda188c

Remove manual dataset download from README

fe68c56

benediktstroebl merged commit 5a95710 into main Apr 11, 2026
2 checks passed

benediktstroebl deleted the feature/harbor-rl-example-5r9 branch April 11, 2026 01:15

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add harbor.rl + Tinker RL training example#28

Add harbor.rl + Tinker RL training example#28
benediktstroebl merged 17 commits intomainfrom
feature/harbor-rl-example-5r9

benediktstroebl commented Apr 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

benediktstroebl commented Apr 9, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant