[Fast image processors] Improve handling of image-like inputs other than images (segmentation_maps) #39489

yonigozlan · 2025-07-17T21:12:55Z

What does this PR do?

As the title says, unbloats (😉) a lot of the fast processing code for models needing to processed segmentation maps, trimaps, depth_maps etc.

HuggingFaceDocBuilderDev · 2025-07-17T21:33:41Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

ArthurZucker

Does seem to simplify a lot! Not 100% sure it BC?

ArthurZucker · 2025-07-21T12:36:41Z

src/transformers/models/eomt/image_processing_eomt_fast.py

+            segmentation_maps_kwargs = kwargs.copy()
+            segmentation_maps_kwargs["do_normalize"] = False
+            segmentation_maps_kwargs["do_rescale"] = False
+            segmentation_maps_kwargs["input_data_format"] = ChannelDimension.FIRST
+            # Nearest interpolation is used for segmentation maps instead of BILINEAR.
+            segmentation_maps_kwargs["interpolation"] = pil_torch_interpolation_mapping[PILImageResampling.NEAREST]
+


I would pass these explicitly instead of updating them but a small nit maybe complicated

I would still need to remove them from the kwargs as they have the default values for image processing there, so not sure if that simplifies...

yonigozlan · 2025-07-21T15:07:40Z

@ArthurZucker Should be 100% BC 🤗

…n-maps-handling

github-actions · 2025-07-21T15:59:57Z

[For maintainers] Suggested jobs to run (before merge)

run-slow: beit, dpt, eomt, idefics2, idefics3, llava_next, llava_onevision, mobilenet_v2, mobilevit, qwen2_vl, sam, smolvlm, vitmatte

…han images (segmentation_maps) (huggingface#39489) * improve handlike of other image-like inputs in fast image processors * fix issues with _prepare_images_structure * update sam image processor fast * use dict update

improve handlike of other image-like inputs in fast image processors

d09d890

yonigozlan requested a review from ArthurZucker July 17, 2025 21:13

fix issues with _prepare_images_structure

5a0c81c

yonigozlan and others added 3 commits July 18, 2025 09:53

Merge branch 'main' into improve-segmentation-maps-handling

da95daa

Merge branch 'main' into improve-segmentation-maps-handling

f29eff9

update sam image processor fast

9b1af7f

ArthurZucker approved these changes Jul 21, 2025

View reviewed changes

yonigozlan added 2 commits July 21, 2025 15:52

Merge remote-tracking branch 'upstream/main' into improve-segmentatio…

d8cf593

…n-maps-handling

use dict update

15e6da3

yonigozlan merged commit b3ebc76 into huggingface:main Jul 21, 2025
25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[Fast image processors] Improve handling of image-like inputs other than images (segmentation_maps) #39489

[Fast image processors] Improve handling of image-like inputs other than images (segmentation_maps) #39489

Uh oh!

yonigozlan commented Jul 17, 2025

Uh oh!

HuggingFaceDocBuilderDev commented Jul 17, 2025

Uh oh!

ArthurZucker left a comment

Uh oh!

ArthurZucker Jul 21, 2025

Uh oh!

yonigozlan Jul 21, 2025

Uh oh!

yonigozlan commented Jul 21, 2025

Uh oh!

github-actions bot commented Jul 21, 2025

Uh oh!

Uh oh!

Uh oh!

[Fast image processors] Improve handling of image-like inputs other than images (segmentation_maps) #39489

[Fast image processors] Improve handling of image-like inputs other than images (segmentation_maps) #39489

Uh oh!

Conversation

yonigozlan commented Jul 17, 2025

What does this PR do?

Uh oh!

HuggingFaceDocBuilderDev commented Jul 17, 2025

Uh oh!

ArthurZucker left a comment

Choose a reason for hiding this comment

Uh oh!

ArthurZucker Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

yonigozlan Jul 21, 2025

Choose a reason for hiding this comment

Uh oh!

yonigozlan commented Jul 21, 2025

Uh oh!

github-actions bot commented Jul 21, 2025

Uh oh!

Uh oh!

Uh oh!