qwen3-coder tool call parser #16755

marceldev89 · 2025-10-24T10:40:56Z

Note

Original work and PR by bold84 @ #15019

This pull request resolves #15012 and introduces comprehensive support for the Qwen3-Coder model family's XML-based tool-calling format. It includes a new, robust XML parser and updated chat template detection logic to ensure reliable function calling.

Key Changes:

New XML Parser (common/chat-parser.cpp):
- A dedicated, non-streaming XML parser has been implemented to handle the Qwen3-Coder's specific output format.
- Features include robust attribute parsing, improved error reporting, and efficient function lookups using a hash set.
Chat Template Detection (common/chat.h, common/chat.cpp):
- The chat template detection logic has been updated to correctly identify Qwen3-Coder models, preventing conflicts with other formats like Hermes 2.
- Ensures the QWEN3_CODER_XML format is applied consistently, even when no tools are explicitly provided in the request.
Comprehensive tests (tests/test-chat.cpp):
- Comprehensive tests for the parser logic has been implemented.

Known issues:

The model (Qwen3-Coder-30B-A3B-Instruct-UD-Q*_K_XL.gguf) occasionally stops prefixing tool calls with the proper <tool_call>. This seems to be an issue with the model itself(?).

…r_edit Fix grammar, hide tool_call from output

Add missing closing brace to terminate test_template_output_parsers() function. This resolves compilation errors that prevented successful build of the test-chat target.

Co-authored-by: Kashyap Jois <[email protected]>

Co-authored-by: Marcel de Vries <[email protected]>

…d84/llama.cpp into qwen3-coder_tool_call_parser

…ranches; add tests - chat-parser: support schema.type as array (e.g. ["number","null"]) in convert_qwen3_param_value() - chat: resolve $refs; allow unions including "string" as freeform; sanitize empty {"not":{}} in anyOf/oneOf before add_schema - tests: add Qwen3-Coder regression ensuring grammar builds with unions and ignores {"not":{}}

See https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct/blob/main/chat_template.jinja

coder543 · 2025-10-24T14:50:47Z

Anecdotally, I observed that the previous PR (and presumably this PR too) essentially fixed tool calling for qwen3-coder. Although when trying to use it with codex, qwen3-coder absolutely refuses to use the apply_patch tool, opting to use sed instead, which is probably just a training issue?

It would be nice to get this PR merged in.

marceldev89 · 2025-10-24T14:57:11Z

Anecdotally, I observed that the previous PR (and presumably this PR too) essentially fixed tool calling for qwen3-coder. Although when trying to use it with codex, qwen3-coder absolutely refuses to use the apply_patch tool, opting to use sed instead, which is probably just a training issue?

It would be nice to get this PR merged in.

I guess you could test it through openrouter or something and check if you see the same behavior there as well. My guess would be that it's a model thing and not so much this PR. Or maybe even a codex thing since it's probably heavily optimized for GPT models in terms of system prompt and tool descriptions.

MartyLake · 2025-10-24T20:47:53Z

Hey, just to confirm that running this branch fixes the integration with Qwen3-Coder-30B-A3B.

Reproduction steps:

# Compile this branch
mkdir $HOME/bin; cd $HOME/bin
git clone https://github.com/marceldev89/llama.cpp.git llama.cpp-fork-sources && cd llama.cpp-fork-sources
cmake -Bbuild && cmake --build build --target llama-server --parallel

# Install qwen
brew install qwen-coder

# Launch model
$HOME/bin/llama.cpp-fork-sources/build/bin/llama-server --port 8012 --host 0.0.0.0 --jinja -ngl 99 -c 300000 -m $HOME/.lmstudio/models/hf.co/hf.co-unsloth-Qwen3-Coder-30B-A3B-Instruct-GGUF-UD-Q4-K-XL-GGUF/hf.co-unsloth-Qwen3-Coder-30B-A3B-Instruct-GGUF-UD-Q4-K-XL.gguf

# Launch qwen
OPENAI_API_KEY=no OPENAI_BASE_URL=http://localhost:8012/v1 OPENAI_MODEL=models/hf.co-unsloth-Qwen3-Coder-30B-A3B-Instruct-GGUF-UD-Q4-K-XL.gguf qwen

PS: I opened too many tabs to figure it out, and I can’t find the sources any more to properly source them. I invented nothing here, credits goes to whoever wrote the pieces first.

bold84 and others added 23 commits August 2, 2025 02:02

qwen3-coder tool call parser

90dd63a

reset template

c920daf

Merge branch 'master' into qwen3-coder_tool_call_parser

5c7c5dd

Fix grammar, hide tool_call from output

2de36f5

Merge pull request ggml-org#1 from bold84/qwen3-coder_tool_call_parse…

dda43af

…r_edit Fix grammar, hide tool_call from output

Fix C++ compilation error in tests/test-chat.cpp

89daf6b

Add missing closing brace to terminate test_template_output_parsers() function. This resolves compilation errors that prevented successful build of the test-chat target.

Update common/chat.cpp

b5e3747

Co-authored-by: Kashyap Jois <[email protected]>

Update common/chat.cpp

dc6c4f2

Co-authored-by: Kashyap Jois <[email protected]>

Fix for test

6e1fb00

revert

9b512e4

Update common/chat.cpp

ccad78f

Co-authored-by: Marcel de Vries <[email protected]>

Update common/chat.cpp

e33da80

Co-authored-by: Marcel de Vries <[email protected]>

Merge branch 'qwen3-coder_tool_call_parser' of https://github.com/bol…

a7f2105

…d84/llama.cpp into qwen3-coder_tool_call_parser

removed test

9a2cca8

Moved common_chat_parse_qwen3_coder_xml

ca51625

Merge branch 'master' into qwen3-coder_tool_call_parser

f43719f

Fix merge oopsie

11f3dbd

Merge branch 'master' into qwen3-coder_tool_call_parser

2520059

Merge branch 'master' into qwen3-coder_tool_call_parser

1ba0322

Sync bundled template with upstream

d1fe943

See https://huggingface.co/Qwen/Qwen3-Coder-30B-A3B-Instruct/blob/main/chat_template.jinja

Merge branch 'master' into qwen3-coder_tool_call_parser

0563a5d

Fix crash when tool call doesn't start with <tool_call>

e52c95c

marceldev89 requested a review from ggerganov as a code owner October 24, 2025 10:40

marceldev89 mentioned this pull request Oct 24, 2025

qwen3-coder tool call parser #15019

Closed

github-actions bot added the testing Everything test related label Oct 24, 2025

Merge branch 'master' into qwen3-coder_tool_call_parser

08cc2af

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

qwen3-coder tool call parser #16755

qwen3-coder tool call parser #16755

marceldev89 commented Oct 24, 2025 •

edited

Loading

Uh oh!

coder543 commented Oct 24, 2025

Uh oh!

marceldev89 commented Oct 24, 2025 •

edited

Loading

Uh oh!

MartyLake commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

qwen3-coder tool call parser #16755

Are you sure you want to change the base?

qwen3-coder tool call parser #16755

Conversation

marceldev89 commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Key Changes:

Known issues:

Uh oh!

coder543 commented Oct 24, 2025

Uh oh!

marceldev89 commented Oct 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

MartyLake commented Oct 24, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

marceldev89 commented Oct 24, 2025 •

edited

Loading

marceldev89 commented Oct 24, 2025 •

edited

Loading