Skip to content

vllm/transformer cannot load GLM-4.7-flash due to module name mismatch #134

@blairyeung

Description

@blairyeung

System Info / 系統信息

Driver Version: 575.57.08 CUDA Driver Version: 12.9 RTX 5090 (Blackwell)
vllm 0.14.1
transformers 5.0.0.dev0 (在4.57.6显示 transformers不支持)

Who can help? / 谁可以帮助到您?

@zRzRzRzRzRzRzR

Information / 问题信息

  • The official example scripts / 官方的示例脚本
  • My own modified scripts / 我自己修改的脚本和任务

Reproduction / 复现过程

CUDA_VISIBLE_DEVICES=2,3
vllm serve /data2/models/ZhipuAI/GLM-4.7-Flash
--tensor-parallel-size 2
--tool-call-parser glm47
--reasoning-parser glm45
--quantization fp8
--max-model-len 32768
--enable-auto-tool-choice
--served-model-name glm-4.7-flash


(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 335, in load_weights
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] autoloaded_weights = set(self._load_module("", self.module, weights))
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 288, in _load_module
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] yield from self._load_module(
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 288, in _load_module
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] yield from self._load_module(
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] File "/home/user/miniconda3/envs/vllm-nightly/lib/python3.12/site-packages/vllm/model_executor/models/utils.py", line 319, in _load_module
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] raise ValueError(msg)
(Worker_TP0 pid=884060) ERROR 01-26 14:08:37 [multiproc_executor.py:749] ValueError: There is no module or parameter named 'model.layers.47' in TransformersMoEForCausalLM

Expected behavior / 期待表现

launches without error

Metadata

Metadata

Labels

No labels
No labels

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions