[rollout, tool] feat: add experimental agent framework and gateway runtime by zackcxb · Pull Request #5931 · verl-project/verl

zackcxb · 2026-04-08T18:50:33Z

What does this PR do?

This PR adds an experimental agent framework and gateway runtime for multi-turn agent-style rollout
in VERL, according to #5790.

Specifically, it:

adds verl.experimental.agent_framework for a new abstraction for agent systems, with an example implementation,
adds verl.experimental.agent_gateway for OpenAI-compatible session serving and sticky session
routing,
integrates gateway-backed session runtime into AsyncLLMServerManager,
adds focused tests for framework assembly, gateway actor/manager behavior, and session runtime
ownership.

WIP:

CLAassistant · 2026-04-08T18:50:41Z

Thank you for your submission! We really appreciate it. Like many open source projects, we ask that you sign our Contributor License Agreement before we can accept your contribution.
_{You have signed the CLA already but the status is still pending? Let us recheck it.}

gemini-code-assist

Code Review

This pull request introduces a new experimental agent framework for collecting and assembling agent trajectories. It includes a GatewayActor for managing OpenAI-compatible chat sessions, a GatewayManager for routing sessions across multiple gateways, and a TrajectoryAssembler to convert collected trajectories into a training-ready DataProto. Additionally, the AsyncLLMServerManager has been updated to support this new gateway-backed session runtime. I have no feedback to provide as there are no review comments.

wuxibin89 · 2026-04-09T06:12:10Z

+        if "rm_scores" in batch_tensors:
+            meta_info["reward_extra_keys"] = reward_extra_keys
+
+        return DataProto(


Do not use DataProto, use TensorDict instead.

wuxibin89 · 2026-04-09T06:13:19Z

        self.config = config
        self._load_balancer = load_balancer_handle
-        self._server_id_to_handle: dict[str, ray.actor.ActorHandle] = dict(servers)
+        self._server_id_to_handle: dict[str, ray.actor.ActorHandle] = dict(servers or [])


Do not modify agent_loop, we will adapt to agent gateway once it's mature.

so we rewrite the AsyncLLMServerManager class, put it together with anything we inherit from the current agent_loop.py under a new path (e.g. verl/agent)?

wangtiance · 2026-04-13T08:15:37Z

-            await asyncio.gather(*[_await_ray_ref(gateway.shutdown.remote()) for gateway in self.owned_gateway_actors])
-        self.owned_gateway_actors = []
-        self.gateway_manager = None
+        self._server_id_to_handle: dict[str, ray.actor.ActorHandle] = dict(servers)


self._server_id_to_handle: dict[str, ray.actor.ActorHandle] = dict(servers or [])

Polish agent gateway merge prep

e086823

gemini-code-assist bot reviewed Apr 8, 2026

View reviewed changes

wuxibin89 mentioned this pull request Apr 9, 2026

[roadmap] verl 26Q2 roadmap #5836

Open

33 tasks

wuxibin89 reviewed Apr 9, 2026

View reviewed changes

[rollout, tool] refactor: move agent framework and gateway to verl.agent

9bb1d8a

wangtiance reviewed Apr 13, 2026

View reviewed changes

Add a minimal example for agent framework

91b5be5

zackcxb force-pushed the main branch from 11d0aaf to 91b5be5 Compare April 14, 2026 11:27

Unit tests adjustment

f15710f

zackcxb force-pushed the main branch from d8c153e to f15710f Compare April 16, 2026 01:19

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rollout, tool] feat: add experimental agent framework and gateway runtime#5931

[rollout, tool] feat: add experimental agent framework and gateway runtime#5931
zackcxb wants to merge 4 commits intoverl-project:mainfrom
zackcxb:main

zackcxb commented Apr 8, 2026

Uh oh!

CLAassistant commented Apr 8, 2026

Uh oh!

gemini-code-assist bot left a comment

Uh oh!

wuxibin89 Apr 9, 2026

Uh oh!

wuxibin89 Apr 9, 2026

Uh oh!

wangtiance Apr 9, 2026

Uh oh!

wangtiance Apr 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

zackcxb commented Apr 8, 2026

What does this PR do?

Checklist Before Starting

Test

API and Usage Example

Design & Code Changes

Checklist Before Submitting

Uh oh!

CLAassistant commented Apr 8, 2026

Uh oh!

gemini-code-assist bot left a comment

Choose a reason for hiding this comment

Code Review

Uh oh!

wuxibin89 Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

wuxibin89 Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

wangtiance Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

wangtiance Apr 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants