Skip to content

[Feature Request] Agent trajectory / trace collection for Agentic RL training #609

@khazic

Description

@khazic

Hi maintainers,

I've been exploring OpenSandbox as an execution environment for Agentic RL training,
and I think it's a great fit — isolated sandboxes with multi-language support are
exactly what's needed for safe agent execution.

One capability I'd find very valuable is native agent trajectory collection:
the ability to automatically record agent interaction traces (inputs, tool calls,
stdout/stderr, code execution results, file changes, errors, timestamps) within
a sandbox session, and export them in a structured format (e.g. JSON / JSONL)
suitable for RL training pipelines.

Currently this requires wrapping every SDK call manually, which is error-prone
and not portable across projects.

My questions:

  1. Is there an existing plan or roadmap item for built-in trace/trajectory
    collection in OpenSandbox?
  2. If not, would the maintainers be open to accepting a contribution that adds
    this feature? My intended use case is Agentic RL training — collecting
    (observation, action, reward signal) trajectories from agent runs inside
    sandboxes.

I'm willing to implement this and submit a PR. Before doing so, I'd love to
align on the design (where it lives, what format, opt-in vs always-on, etc.)
to make sure it fits the project's direction.

Thanks for building this — looking forward to your thoughts!

Metadata

Metadata

Assignees

No one assigned

    Labels

    featureNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions