Request to add SIBench evaluation code #1310

song2yu · 2025-11-09T15:20:43Z

Added evaluation code for the SIBench paper: "How Far are VLMs from Visual Spatial Intelligence? A Benchmark-Driven Perspective https://arxiv.org/abs/2509.18905 ".

Includes inference_mixed.py to support mixed inference for both images and videos.
Includes SIBench.py for processing the SIBenchmark.
Introduced a new MixedOutput format.
Added post-processing support for the MixedOutput format in run.py.

tonysy · 2025-11-12T09:40:10Z

Please fix the lint issue

song2yu · 2025-11-18T07:43:34Z

Thanks for the reminder. I have already checked the code according to the development guide pre-commit run --all-files, and now shows no formatting errors.

song2yu added 6 commits November 9, 2025 23:02

Update run.py

9e7202f

Update __init__.py

a126610

Add SIBench to dataset classes

84e18a8

Add files via upload

c598c82

Add files via upload

32dcfb1

Update run.py

4963cc3

song2yu added 6 commits November 12, 2025 21:43

Refactor SIBench.py for improved readability

a028227

Update inference_mixed.py

c9e7b89

Update SIBench.py

7929563

Refactor parameter formatting in inference_mixed.py

9ddfc26

Update SIBench.py

96bc750

Update inference_mixed.py

e6947bd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Request to add SIBench evaluation code #1310

Request to add SIBench evaluation code #1310

song2yu commented Nov 9, 2025

Uh oh!

tonysy commented Nov 12, 2025

Uh oh!

song2yu commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Request to add SIBench evaluation code #1310

Are you sure you want to change the base?

Request to add SIBench evaluation code #1310

Conversation

song2yu commented Nov 9, 2025

Uh oh!

tonysy commented Nov 12, 2025

Uh oh!

song2yu commented Nov 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants