Skip to content

icip-cas/OmniPreference

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

70 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models

arxiv License: MIT

🎯 What this paper does

  • Proposing a modality preference evaluation framework for OLLMs: Constructing a tri-modal semantic conflict dataset with quantitative metrics to systematically measure model modality preferences.
  • Revealing the modality preference landscape of OLLMs: Under tri-modal conflicts, most OLLMs exhibit significant visual preference; under bi-modal conflicts, all models favor the visual modality; across all input combinations, the audio modality is systematically neglected.
  • Revealing the internal evolution patterns of modality preference: Employing layer-wise linear probing to reveal that modality preference signals are absent in shallow layers and gradually emerge in mid-to-late layers.
  • Leveraging linear probes for hallucination detection: Discovering that hallucination generation is accompanied by abnormally elevated preference probability toward the interfering modality, enabling effective hallucination detection via linear probes.

🔮 Usage

📍 Data:

data/conflict_triplets_processed.json

📍 Eval:

eval/run_tri-modal.py

📍 Probe:

probe/train.py   # Train linear probes
probe/acc.py     # Calculate accuracy
probe/pred.py    # Predict preference probabilities

📖 Citation

If you find this project helpful, please use the following to cite it:

@article{yan2026beyond,
  title={Beyond Text-Dominance: Understanding Modality Preference of Omni-modal Large Language Models},
  author={Yan, Xinru and Cao, Boxi and Lu, Yaojie and Lin, Hongyu and Zhou, Weixiang and Sun, Le and Han, Xianpei},
  journal={arXiv preprint arXiv:2604.16902},
  year={2026}
}

About

No description, website, or topics provided.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages