Revert VLM support in `parse_response` by qgallouedec · Pull Request #5561 · huggingface/trl

qgallouedec · 2026-04-15T18:25:26Z

parse_response previously accepted either a tokenizer or a processor (from #5323) and unwrapped the inner tokenizer on the fly. Now that call sites can easily pass the tokenizer directly, we move that disambiguation to the call sites and keep parse_response strictly tokenizer-only. This centralizes the "processor vs tokenizer" logic in one place per trainer and makes parse_response's contract simpler.

Note

Medium Risk
Touches response parsing used during RLHF tool-call decoding; missed/unhandled call sites or incorrect tokenizer selection could break parsing for some models, especially VLM processors.

Overview
parse_response now only accepts a PreTrainedTokenizer (removing implicit VLM processor support/auto-unwrapping) and updates its docstring accordingly.

All affected call sites (notably GRPOTrainer/DPPOTrainer tool-call decoding paths and TestParseResponse) now explicitly select processing_class.tokenizer for VLM processors before calling parse_response, keeping response/tool-call parsing behavior the same while simplifying the helper’s contract.

^{Reviewed by Cursor Bugbot for commit cc3905c. Bugbot is set up for automated code reviews on this repo. Configure here.}

HuggingFaceDocBuilderDev · 2026-04-15T18:28:48Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

albertvillanova

I'm not sure of this PR, as it seems to overlap quite a bit with what I'm currently addressing in #5489. The approach I’m taking there is intentionally incremental:

First, ensure that we can consistently pass only tokenizer instances to parse_response (by introducing self._tokenizer across trainers).
Then, in a follow-up step, simplify parse_response to only accept tokenizers: Make parse_response accept only tokenizer

Given that, this PR feels somewhat like duplicated effort. Would it make sense to wait for #5489 to land instead?

For context, I’m already working through the relevant discussion here: #5489 (comment)

parse_response only needs a tokenizer instance but it had to handle both because we did not have a simple way to pass only tokenizer. Once we implement self._tokenizer in all trainers, parse_response could be simplified to accept only tokenizer instances.

and here: #5489 (comment)

More broadly, the underlying goal of this PR is to centralize the processor/tokenizer disambiguation within processing_class in a single place, so that the rest of the code can rely on a well-defined and consistent interface, with a clear expected class instance.

In that sense, the current change in calling parse_response is an intermediate step toward that simplification, rather than a deviation from it.

qgallouedec · 2026-04-16T12:56:26Z

Ah yes ok, Lgtm @albertvillanova

cursor

Cursor Bugbot has reviewed your changes and found 1 potential issue.

^{❌ Bugbot Autofix is OFF. To automatically fix reported issues with cloud agents, enable autofix in the Cursor dashboard.}

^{Reviewed by Cursor Bugbot for commit 973cf25. Configure here.}

Revert VLM support in parse_response

8618871

qgallouedec mentioned this pull request Apr 15, 2026

Set _tokenizer as trainer attribute #5489

Merged

albertvillanova reviewed Apr 16, 2026

View reviewed changes

Merge branch 'main' into revert-vlm-support-in-parse_response

973cf25

cursor Bot reviewed Apr 17, 2026

View reviewed changes

Comment thread trl/chat_template_utils.py

fix

cc3905c

qgallouedec marked this pull request as draft April 22, 2026 18:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Revert VLM support in `parse_response`#5561

Revert VLM support in `parse_response`#5561
qgallouedec wants to merge 3 commits intomainfrom
revert-vlm-support-in-parse_response

qgallouedec commented Apr 15, 2026 •

edited by cursor Bot

Loading

Uh oh!

HuggingFaceDocBuilderDev commented Apr 15, 2026

Uh oh!

albertvillanova left a comment •

edited

Loading

Uh oh!

qgallouedec commented Apr 16, 2026

Uh oh!

cursor Bot left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

qgallouedec commented Apr 15, 2026 • edited by cursor Bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

HuggingFaceDocBuilderDev commented Apr 15, 2026

Uh oh!

albertvillanova left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

qgallouedec commented Apr 16, 2026

Uh oh!

cursor Bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

qgallouedec commented Apr 15, 2026 •

edited by cursor Bot

Loading

albertvillanova left a comment •

edited

Loading