[Feature Request] Agent trajectory / trace collection for Agentic RL training

Hi maintainers,

I've been exploring OpenSandbox as an execution environment for Agentic RL training, 
and I think it's a great fit — isolated sandboxes with multi-language support are 
exactly what's needed for safe agent execution.

One capability I'd find very valuable is **native agent trajectory collection**: 
the ability to automatically record agent interaction traces (inputs, tool calls, 
stdout/stderr, code execution results, file changes, errors, timestamps) within 
a sandbox session, and export them in a structured format (e.g. JSON / JSONL) 
suitable for RL training pipelines.

Currently this requires wrapping every SDK call manually, which is error-prone 
and not portable across projects.

**My questions:**

1. Is there an existing plan or roadmap item for built-in trace/trajectory 
   collection in OpenSandbox?
2. If not, would the maintainers be open to accepting a contribution that adds 
   this feature? My intended use case is Agentic RL training — collecting 
   (observation, action, reward signal) trajectories from agent runs inside 
   sandboxes.

I'm willing to implement this and submit a PR. Before doing so, I'd love to 
align on the design (where it lives, what format, opt-in vs always-on, etc.) 
to make sure it fits the project's direction.

Thanks for building this — looking forward to your thoughts!


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Agent trajectory / trace collection for Agentic RL training #609

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request] Agent trajectory / trace collection for Agentic RL training #609

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions