add openvino VLM blog post #3071

echarlaix · 2025-09-12T16:22:48Z

from https://huggingface.co/datasets/OpenVINO/documentation/blob/main/blog/openvino_vlm/openvino-vlm.md

openvino-vlm.md

Co-authored-by: Helena Kloosterman <[email protected]>

Co-authored-by: Pedro Cuenca <[email protected]>

openvino-vlm.md

Co-authored-by: Pedro Cuenca <[email protected]>

* Blog updates * updates

echarlaix · 2025-09-16T18:42:15Z

Thanks a lot for the great review @pcuenca, didn't had time to include everything but will do in a second pass. The blog post is not ready for publication yet but once it is, I'll let you know

openvino-vlm.md

merveenoyan

super cool!

openvino-vlm.md

echarlaix · 2025-09-18T13:34:47Z

Thanks a lot for your reviews @pcuenca @merveenoyan! The blog post is not ready yet (was set to draft but I should have clarified it in the description). Likely a lot will change in the following days, so don't want you to waste your time on corrections that could very well not be included in the final blog post, will let you know once ready !

openvino-vlm.md

Co-authored-by: Nikita Savelyev <[email protected]>

echarlaix · 2025-10-10T13:00:22Z

openvino-vlm.md

+| openvino-8bit-woq| 0.247                    | 0.016                      | 0.482                 | 63.928                        |
+
+
+This benchmark shows that small, optimized multimodal models, like [SmolVLM2-256M](https://huggingface.co/HuggingFaceTB/SmolVLM2-256M-Video-Instruct), can run efficiently on Intel CPUs. Weight-only quantization significantly reduces model size, improving efficiency without majorly impacting throughput.


cc @ezelanza would you mind updating once the benchmark is validated on your side

openvino-vlm.md

Co-authored-by: Nikita Savelyev <[email protected]>

openvino-vlm.md

Co-authored-by: Eze Lanza (Eze) <[email protected]>

openvino-vlm.md

Co-authored-by: Eze Lanza (Eze) <[email protected]>

openvino-vlm.md

Co-authored-by: Eze Lanza (Eze) <[email protected]>

openvino-vlm.md

IlyasMoutawwakil · 2025-10-14T07:49:28Z

openvino-vlm.md

+| Configuration    |Time To First Token (TTFT)|Time Per Output Token (TPOT)| End-to-End Latency    | Decoding Throughput           |
+|------------------|--------------------------|----------------------------|-----------------------|-------------------------------|
+| pytorch          | 5.150                    | 1.385                      | 25.927                | 0.722                         |
+| openvino         | 0.420                    | 0.021                      | 0.738                 | 47.237                        |
+| openvino-8bit-woq| 0.247                    | 0.016                      | 0.482                 | 63.928                        |


@ezelanza are these numbers from the notebook or optimum-benchmark ? cpu vs cpu ? I'm asking because the acceleration x65 seems too good to be true 😅

would be great to have a reference to the benchmark code

IlyasMoutawwakil

LGTM great work everyone !
I left one question about the benchmark numbers / reproduction.

add openvino VLM blog post

15d3828

echarlaix commented Sep 12, 2025

View reviewed changes

openvino-vlm.md Show resolved Hide resolved

helena-intel reviewed Sep 15, 2025

View reviewed changes

openvino-vlm.md Outdated Show resolved Hide resolved

openvino-vlm.md Outdated Show resolved Hide resolved

pcuenca reviewed Sep 16, 2025

View reviewed changes

echarlaix and others added 4 commits September 16, 2025 13:38

Update openvino-vlm.md

9cc1717

Co-authored-by: Helena Kloosterman <[email protected]>

Update openvino-vlm.md

15f6f5f

Co-authored-by: Pedro Cuenca <[email protected]>

Update openvino-vlm.md

d6033a8

Co-authored-by: Pedro Cuenca <[email protected]>

Update openvino-vlm.md

cfbcca1

Co-authored-by: Pedro Cuenca <[email protected]>

echarlaix commented Sep 16, 2025

View reviewed changes

openvino-vlm.md Outdated Show resolved Hide resolved

echarlaix and others added 11 commits September 16, 2025 13:48

Update openvino-vlm.md

6441337

Co-authored-by: Pedro Cuenca <[email protected]>

Update openvino-vlm.md

a6ee9d9

Co-authored-by: Pedro Cuenca <[email protected]>

Update openvino-vlm.md

c527b8b

Co-authored-by: Pedro Cuenca <[email protected]>

Update openvino-vlm.md

042ae0f

Co-authored-by: Pedro Cuenca <[email protected]>

Update openvino-vlm.md

e69c8ea

Co-authored-by: Pedro Cuenca <[email protected]>

Update openvino-vlm.md

69f09cc

Co-authored-by: Pedro Cuenca <[email protected]>

Update openvino-vlm.md

be7aef2

Update openvino-vlm.md

d2523c0

Co-authored-by: Pedro Cuenca <[email protected]>

Add benchmark (#3076)

d17cf36

* Blog updates * updates

rephrase

cfda70f

apply comment

47c9baf

echarlaix commented Sep 16, 2025

View reviewed changes

openvino-vlm.md Outdated Show resolved Hide resolved

add author

6ae3d81

merveenoyan reviewed Sep 16, 2025

View reviewed changes

helena-intel reviewed Sep 17, 2025

View reviewed changes

openvino-vlm.md Outdated Show resolved Hide resolved

fix typo

e3e410e

echarlaix added 3 commits September 18, 2025 15:39

rephrase intro

bcd87da

rephrase

18ae0ce

rephrase

da03f28

nikita-savelyevv reviewed Oct 9, 2025

View reviewed changes

openvino-vlm.md Outdated Show resolved Hide resolved

Update openvino-vlm.md

4a6c6b6

Co-authored-by: Nikita Savelyev <[email protected]>

echarlaix commented Oct 10, 2025

View reviewed changes

nikita-savelyevv reviewed Oct 10, 2025

View reviewed changes

openvino-vlm.md Outdated Show resolved Hide resolved

Update openvino-vlm.md

60aff81

Co-authored-by: Nikita Savelyev <[email protected]>

ezelanza reviewed Oct 10, 2025

View reviewed changes

openvino-vlm.md Show resolved Hide resolved

openvino-vlm.md Show resolved Hide resolved

openvino-vlm.md Show resolved Hide resolved

echarlaix force-pushed the openvino-vlm branch from acdc35e to 60aff81 Compare October 10, 2025 15:44

echarlaix added 2 commits October 10, 2025 17:59

merge main

f2d302a

fix title

5cd869c

ezelanza reviewed Oct 10, 2025

View reviewed changes

openvino-vlm.md Show resolved Hide resolved

Update openvino-vlm.md

107947c

Co-authored-by: Eze Lanza (Eze) <[email protected]>

ezelanza reviewed Oct 10, 2025

View reviewed changes

openvino-vlm.md Show resolved Hide resolved

openvino-vlm.md Outdated Show resolved Hide resolved

echarlaix and others added 4 commits October 10, 2025 18:07

Update openvino-vlm.md

caeb255

Co-authored-by: Eze Lanza (Eze) <[email protected]>

add as note

29fbb32

add as note

cf81b66

add comment

efb7cbf

ezelanza reviewed Oct 10, 2025

View reviewed changes

openvino-vlm.md Outdated Show resolved Hide resolved

echarlaix and others added 4 commits October 13, 2025 14:40

Update openvino-vlm.md

2713e55

Co-authored-by: Eze Lanza (Eze) <[email protected]>

fix link

b6c88dc

add speedup

069af77

fix date

96a7b76

echarlaix requested review from IlyasMoutawwakil, ezelanza, merveenoyan and pcuenca October 13, 2025 14:35

typo

a87e919

echarlaix commented Oct 13, 2025

View reviewed changes

openvino-vlm.md Outdated Show resolved Hide resolved

Update openvino-vlm.md

ee29381

IlyasMoutawwakil reviewed Oct 14, 2025

View reviewed changes

IlyasMoutawwakil approved these changes Oct 14, 2025

View reviewed changes

		\| openvino-8bit-woq\| 0.247 \| 0.016 \| 0.482 \| 63.928 \|


		This benchmark shows that small, optimized multimodal models, like [SmolVLM2-256M](https://huggingface.co/HuggingFaceTB/SmolVLM2-256M-Video-Instruct), can run efficiently on Intel CPUs. Weight-only quantization significantly reduces model size, improving efficiency without majorly impacting throughput.

add openvino VLM blog post #3071

Are you sure you want to change the base?

add openvino VLM blog post #3071

Conversation

echarlaix commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echarlaix commented Sep 16, 2025

Uh oh!

Uh oh!

merveenoyan left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

echarlaix commented Sep 18, 2025

Uh oh!

Uh oh!

echarlaix Oct 10, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

IlyasMoutawwakil Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

IlyasMoutawwakil Oct 14, 2025

Choose a reason for hiding this comment

Uh oh!

IlyasMoutawwakil left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

7 participants

echarlaix commented Sep 12, 2025 •

edited

Loading