-
Notifications
You must be signed in to change notification settings - Fork 732
Description
Hi maintainers,
I've been exploring OpenSandbox as an execution environment for Agentic RL training,
and I think it's a great fit — isolated sandboxes with multi-language support are
exactly what's needed for safe agent execution.
One capability I'd find very valuable is native agent trajectory collection:
the ability to automatically record agent interaction traces (inputs, tool calls,
stdout/stderr, code execution results, file changes, errors, timestamps) within
a sandbox session, and export them in a structured format (e.g. JSON / JSONL)
suitable for RL training pipelines.
Currently this requires wrapping every SDK call manually, which is error-prone
and not portable across projects.
My questions:
- Is there an existing plan or roadmap item for built-in trace/trajectory
collection in OpenSandbox? - If not, would the maintainers be open to accepting a contribution that adds
this feature? My intended use case is Agentic RL training — collecting
(observation, action, reward signal) trajectories from agent runs inside
sandboxes.
I'm willing to implement this and submit a PR. Before doing so, I'd love to
align on the design (where it lives, what format, opt-in vs always-on, etc.)
to make sure it fits the project's direction.
Thanks for building this — looking forward to your thoughts!