Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models

🎯 What this paper does

Proposing a modality preference evaluation framework for OLLMs: Constructing a tri-modal semantic conflict dataset with quantitative metrics to systematically measure model modality preferences.
Revealing the modality preference landscape of OLLMs: Under tri-modal conflicts, most OLLMs exhibit significant visual preference; under bi-modal conflicts, all models favor the visual modality; across all input combinations, the audio modality is systematically neglected.
Revealing the internal evolution patterns of modality preference: Employing layer-wise linear probing to reveal that modality preference signals are absent in shallow layers and gradually emerge in mid-to-late layers.
Leveraging linear probes for hallucination detection: Discovering that hallucination generation is accompanied by abnormally elevated preference probability toward the interfering modality, enabling effective hallucination detection via linear probes.

🔮 Usage

📍 Data:

data/conflict_triplets_processed.json

📍 Eval:

eval/run_tri-modal.py

📍 Probe:

probe/train.py   # Train linear probes
probe/acc.py     # Calculate accuracy
probe/pred.py    # Predict preference probabilities

📖 Citation

If you find this project helpful, please use the following to cite it:

@article{yan2026beyond,
  title={Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models},
  author={Yan, Xinru and Cao, Boxi and Lu, Yaojie and Lin, Hongyu and Zhou, Weixiang and Sun, Le and Han, Xianpei},
  journal={arXiv preprint arXiv:2604.16902},
  year={2026}
}

Name		Name	Last commit message	Last commit date
Latest commit History 70 Commits
assets		assets
data		data
eval		eval
probe		probe
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models

🎯 What this paper does

🔮 Usage

📖 Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models

🎯 What this paper does

🔮 Usage

📖 Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages