Skip to content

Add harbor.rl + Tinker RL training example#28

Merged
benediktstroebl merged 17 commits intomainfrom
feature/harbor-rl-example-5r9
Apr 11, 2026
Merged

Add harbor.rl + Tinker RL training example#28
benediktstroebl merged 17 commits intomainfrom
feature/harbor-rl-example-5r9

Conversation

@benediktstroebl
Copy link
Copy Markdown
Collaborator

Adds a minimal RL training example that uses harbor.rl's step()/grade() interface with Tinker for the training loop.

  • train.py bridges harbor.rl environments to tinker's Env/EnvGroupBuilder abstractions
  • Downloads Terminal-Bench 2.0 tasks via the harbor registry
  • Adapts harbor.rl's BashTool and grade() as tinker-compatible tool and reward functions

@benediktstroebl benediktstroebl merged commit 5a95710 into main Apr 11, 2026
2 checks passed
@benediktstroebl benediktstroebl deleted the feature/harbor-rl-example-5r9 branch April 11, 2026 01:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant