[VLM] Offline scenario, performance-only mode of the reference implementation #2381

wangshangsam · 2025-10-28T15:59:56Z

This is the first PR towards the VLM reference implementation for the v6.0 round.
This PR currenlty supports the Offline scenario + performance-only mode. Server scenario and accuracy mode will be introduced through subsequent PRs.
The issue_query implemenation adopted the purely asyncio-based design from the DSR1 reference implementation, but the code here is simpler mostly because we only access the inference endpoint through OpenAI APIs.

…perf-inference into wangshangsam/vlm-sut-prototype

github-actions · 2025-10-28T16:00:06Z

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

…perf-inference into wangshangsam/vlm-sut-prototype

multimodal/vl2l/pyproject.toml

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/task.py

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/cli.py

…the client, event loop and the event loop thread

…perf-inference into wangshangsam/vlm-sut-prototype

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/task.py

…perf-inference into wangshangsam/vlm-sut-prototype

multimodal/vl2l/README.md

…perf-inference into wangshangsam/vlm-sut-prototype

hanyunfan

LGTM

wangshangsam and others added 10 commits October 7, 2025 03:25

Initial commit.

81b993d

WIP

37f1f14

[Automated Commit] Format Codebase

d032513

misc

41c94c4

Merge branch 'wangshangsam/vlm-sut-prototype' of github.com:CentML/ml…

40e62bc

…perf-inference into wangshangsam/vlm-sut-prototype

adding pydantic_typer

a240d7c

offline WIP

990503c

[Automated Commit] Format Codebase

0bc8773

Merge branch 'master' into wangshangsam/vlm-sut-prototype

ed021c5

[Automated Commit] Format Codebase

7a7c1bc

wangshangsam and others added 4 commits October 28, 2025 12:46

rename the notebook

ab7eeee

Merge branch 'wangshangsam/vlm-sut-prototype' of github.com:CentML/ml…

e4f5a7e

…perf-inference into wangshangsam/vlm-sut-prototype

clean-up

754207e

[Automated Commit] Format Codebase

83ccca4

wangshangsam marked this pull request as ready for review November 4, 2025 08:31

wangshangsam requested a review from a team as a code owner November 4, 2025 08:31

wangshangsam changed the title ~~VLM reference implementation~~ [VLM] Offline scenario, performance-only mode for the reference implementation Nov 4, 2025

wangshangsam changed the title ~~[VLM] Offline scenario, performance-only mode for the reference implementation~~ [VLM] Offline scenario, performance-only mode of the reference implementation Nov 4, 2025

johncalesp reviewed Nov 4, 2025

View reviewed changes

multimodal/vl2l/pyproject.toml Outdated Show resolved Hide resolved

Downgrade from 3.13 to 3.12

36ba877

wangshangsam requested a review from johncalesp November 4, 2025 20:31

[Automated Commit] Format Codebase

a2176d2

johncalesp reviewed Nov 4, 2025

View reviewed changes

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/task.py Outdated Show resolved Hide resolved

johncalesp reviewed Nov 4, 2025

View reviewed changes

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/cli.py Show resolved Hide resolved

johncalesp reviewed Nov 5, 2025

View reviewed changes

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/cli.py Outdated Show resolved Hide resolved

johncalesp reviewed Nov 5, 2025

View reviewed changes

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/cli.py Show resolved Hide resolved

send the response back to LoadGen one at a time

126f945

johncalesp reviewed Nov 5, 2025

View reviewed changes

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/cli.py Outdated Show resolved Hide resolved

Move the ownership of the AsyncOpenAI client into Task, and clean up …

b2400a0

…the client, event loop and the event loop thread

wangshangsam and others added 5 commits November 5, 2025 17:29

Merge branch 'wangshangsam/vlm-sut-prototype' of github.com:CentML/ml…

cdd0a4a

…perf-inference into wangshangsam/vlm-sut-prototype

[Automated Commit] Format Codebase

5ac23a5

fixing typo

0ff5f13

Merge branch 'wangshangsam/vlm-sut-prototype' of github.com:CentML/ml…

272e31d

…perf-inference into wangshangsam/vlm-sut-prototype

[Automated Commit] Format Codebase

ecf95ed

johncalesp reviewed Nov 6, 2025

View reviewed changes

multimodal/vl2l/src/mlperf_inference_multimodal_vl2l/task.py Outdated Show resolved Hide resolved

wangshangsam and others added 10 commits November 6, 2025 21:23

allowing --settings.min_duration to take in float or int as seconds

6e14163

fix lint

51592ae

Merge branch 'wangshangsam/vlm-sut-prototype' of github.com:CentML/ml…

a3bc387

…perf-inference into wangshangsam/vlm-sut-prototype

[Automated Commit] Format Codebase

71493fc

Parametrize use_token_latencies

faa5d8b

Merge branch 'wangshangsam/vlm-sut-prototype' of github.com:CentML/ml…

9fa624d

…perf-inference into wangshangsam/vlm-sut-prototype

[Automated Commit] Format Codebase

4fb5924

fix typos

474f785

Merge branch 'wangshangsam/vlm-sut-prototype' of github.com:CentML/ml…

9e6346b

…perf-inference into wangshangsam/vlm-sut-prototype

[Automated Commit] Format Codebase

579e588

wangshangsam requested a review from johncalesp November 7, 2025 08:40

johncalesp reviewed Nov 7, 2025

View reviewed changes

multimodal/vl2l/README.md Show resolved Hide resolved

johncalesp approved these changes Nov 7, 2025

View reviewed changes

wangshangsam added 2 commits November 10, 2025 15:42

update README

69ce423

Merge branch 'wangshangsam/vlm-sut-prototype' of github.com:CentML/ml…

e53d279

…perf-inference into wangshangsam/vlm-sut-prototype

hanyunfan approved these changes Nov 11, 2025

View reviewed changes

hanyunfan merged commit 808e2d7 into mlcommons:master Nov 11, 2025
29 checks passed

github-actions bot locked and limited conversation to collaborators Nov 11, 2025

wangshangsam deleted the wangshangsam/vlm-sut-prototype branch November 12, 2025 00:39

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[VLM] Offline scenario, performance-only mode of the reference implementation #2381

[VLM] Offline scenario, performance-only mode of the reference implementation #2381

Uh oh!

wangshangsam commented Oct 28, 2025 •

edited

Loading

Uh oh!

github-actions bot commented Oct 28, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hanyunfan left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[VLM] Offline scenario, performance-only mode of the reference implementation #2381

[VLM] Offline scenario, performance-only mode of the reference implementation #2381

Uh oh!

Conversation

wangshangsam commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 28, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

hanyunfan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

wangshangsam commented Oct 28, 2025 •

edited

Loading

github-actions bot commented Oct 28, 2025 •

edited

Loading