Feat: Add ONNX export support for LightOn OCR models by remidesbois1 · Pull Request #129 · huggingface/optimum-onnx

remidesbois1 · 2026-04-28T12:11:09Z

Note: This PR replaces and supersedes #128, which was an experimental draft. This PR provides the clean, final implementation.

This PR adds full ONNX export support for LightOn OCR Vision-Language Models (e.g., lightonai/LightOnOCR-2-1B) to optimum.

The export pipeline correctly splits the model into 3 dedicated ONNX sub-components: vision_encoder, embed_tokens, decoder_merged.

Since the official models on the Hub use "mistral3" as their model_type in config.json, LightonOcrOnnxConfig is natively registered to handle the mistral3 architecture for image-text-to-text tasks. (This might not be the correct way to do this so I'm open to suggestion).

Enforce FORCE_ONNX_EXTERNAL_DATA="1" strictly during the merge_decoders step of the config to bypass the 2GB Protobuf limit.

Testing: Added lighton_ocr to the CI test suite with a tiny dummy model.

Note: The export completes successfully and the output files are perfectly valid, but the validation step fails with a ShapeError on the present keys (e.g. (2, 2, 32, 8) vs (2, 2, 16, 8)). Since the logits are accurate, this appears to be a false positive related to how DynamicCache is returned vs ONNX. Let me know if there's a preferred way to handle this validation check!

Registers lighton_ocr as a model type and exports it as three separate ONNX files: vision_encoder (ViT + projector), embed_tokens (embedding table), and decoder_model_merged (language model with merged KV cache support). Handles weight key remapping from lighton_ocr to Mistral3 internals and works around the >2GB protobuf limit during decoder merge.

…l3 model_type

…oders

remidesbois1 · 2026-04-28T15:03:57Z

Note on Dynamo :
I've experimented with the new --dynamo exporter. While the vision_encoder and embed_tokens components export correctly with adjusted dynamic axes, the decoder_with_past currently fails. This appears to be due to convert_dynamic_axes_into_dynamic_shapes not yet handling the nested tuple structure of past_key_values (failing the sum(_ is not None for _ in v) == 1 check). For now, this PR relies on the standard TorchScript-based export which is fully functional. If you have any idea how to implement it I'll be happy to learn.

remidesbois1 added 5 commits April 19, 2026 14:12

Add lighton_ocr tiny test model to CI test suite

4426aea

Fix ruff formatting and lint issues

eaedd00

Feat: Add native support for the official LightOnOCR model via mistra…

e20a0d3

…l3 model_type

Fix: Automatically use external data for >2GB models during merge_dec…

5f8bb2b

…oders

remidesbois1 marked this pull request as draft April 28, 2026 13:29

remidesbois1 marked this pull request as ready for review April 28, 2026 13:50

Optimized LightOnOCR dynamic axes for stricter exporter compatibility.

a7a0e86

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feat: Add ONNX export support for LightOn OCR models#129

Feat: Add ONNX export support for LightOn OCR models#129
remidesbois1 wants to merge 6 commits into
huggingface:mainfrom
remidesbois1:feat/lighton-ocr-support

remidesbois1 commented Apr 28, 2026

Uh oh!

remidesbois1 commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

remidesbois1 commented Apr 28, 2026

Uh oh!

remidesbois1 commented Apr 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant