Skip to content

[fix] Fix qwen3.5 MoE vp_stage propagation under VPP#112

Open
none0663 wants to merge 1 commit intoISEEKYAN:mainfrom
none0663:vpp-for-qwen3.5
Open

[fix] Fix qwen3.5 MoE vp_stage propagation under VPP#112
none0663 wants to merge 1 commit intoISEEKYAN:mainfrom
none0663:vpp-for-qwen3.5

Conversation

@none0663
Copy link
Copy Markdown

@none0663 none0663 commented Apr 1, 2026

  • Pass vp_stage from Qwen3_5VlBaseBridge provider to Qwen3_5VLModel.

  • Add vp_stage arg in Qwen3_5VLModel and forward to GPTModel only when the current Megatron version supports it.

  • Keep backward compatibility for older Megatron versions that may do not accept vp_stage in GPTModel.init.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant