[EP] Support vLLM; high-throughput and low-latency #621

MaoZiming · 2026-01-05T16:51:09Z

Description

Please include a summary of the changes and the related issue.

Fixes # (issue)

Type of Change

Bug fix
New feature
Documentation update

How Has This Been Tested?

Include any tests here.

Unit tests
Integration tests
Manual testing

Checklist

My code follows the style guidelines, e.g. format.sh.
I have run build_and_install.sh to verify compilation.
I have removed redundant variables and comments.
I have updated the documentation.
I have added tests.

MaoZiming · 2026-01-06T07:33:33Z

bash launch_vllm_head.sh 10.4.147.22 13345 deepseek-ai/DeepSeek-V3-0324 deepep_low_latency 2 1 8 1
bash launch_vllm_worker.sh 10.4.147.22 13345 deepseek-ai/DeepSeek-V3-0324 deepep_low_latency 2 1 8 1

This works.

…to vllm-launch-zm-debug

MaoZiming · 2026-01-06T19:54:14Z

Using this commit b1029ad right now, there seems to be some changes from merging master. I need to look more.

MaoZiming · 2026-01-06T20:56:58Z

#620
This seems to break the vLLM high-throughput mode integration. Marking it here to double check later.
I got

[wait_until_cmd_consumed nvl:5 cmd:4 label:3] waiting slot=565

MaoZiming added 2 commits January 5, 2026 16:36

debug vvlm launch

746a014

dp=2, tp=8 works for ll mode

b1029ad

Merge branch 'vllm-launch' of https://github.com/uccl-project/uccl in…

5f1c77a

…to vllm-launch-zm-debug

MaoZiming changed the title ~~[EP] Debug vLLM; high-throughput works but low-latency stuck at initialization~~ [EP] Support vLLM; high-throughput and low-latency Jan 6, 2026

MaoZiming marked this pull request as ready for review January 6, 2026 07:36

MaoZiming added 6 commits January 7, 2026 08:42

continue debugging

1f8bf05

remove conditional

ab385c3

remove print

198b4c7

remove print

2f79292

newline

ecad9e3

add readme

3d2e8a5

MaoZiming merged commit e0233e9 into vllm-launch Jan 7, 2026

MaoZiming deleted the vllm-launch-zm-debug branch January 7, 2026 21:41

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[EP] Support vLLM; high-throughput and low-latency #621

[EP] Support vLLM; high-throughput and low-latency #621

Uh oh!

MaoZiming commented Jan 5, 2026

Uh oh!

MaoZiming commented Jan 6, 2026

Uh oh!

MaoZiming commented Jan 6, 2026

Uh oh!

MaoZiming commented Jan 6, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

[EP] Support vLLM; high-throughput and low-latency #621

[EP] Support vLLM; high-throughput and low-latency #621

Uh oh!

Conversation

MaoZiming commented Jan 5, 2026

Description

Type of Change

How Has This Been Tested?

Checklist

Uh oh!

MaoZiming commented Jan 6, 2026

Uh oh!

MaoZiming commented Jan 6, 2026

Uh oh!

MaoZiming commented Jan 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

MaoZiming commented Jan 6, 2026 •

edited

Loading