Evaluating Input Feature Explanations through a Unified Diagnostic Evaluation Framework

This repo contains the code for the evaluation framework for three types of explanations regrading four metrics. For demonstration,

Under the ./examples, there are three types of highlight explanations, from a subset of SNLI, based on BERT, generated by the attention method.

./configs: base model configuration, and explanation paths.

./data : golden explanation files, and intermediate step files

./explain_interactions and ./tools: helper functions

See the main scripts and corresponding readmes for four metrics:

Faithfulness

faithfulness_eval.py README_faithfulness.md

Agreement with human annotation

agreement_eval.py README_agreement.md

Complexity

complexity_eval.py README_complexity.md

Simulatability

simulatability_eval.py README_simulatability.md

Please refer to https://github.com/copenlu/spanex for the span pair explanation generation.

Please cite our paper for the study use of this repo:

@inproceedings{sun2025evaluating, title={Evaluating Input Feature Explanations through a Unified Diagnostic Evaluation Framework}, author={Sun, Jingyi and Atanasova, Pepa and Augenstein, Isabelle}, booktitle={Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)}, pages={10559--10577}, year={2025} }

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Evaluating Input Feature Explanations through a Unified Diagnostic Evaluation Framework

Faithfulness

Agreement with human annotation

Complexity

Simulatability

About

Uh oh!

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
configs		configs
data		data
examples		examples
explain_interactions		explain_interactions
tools		tools
README.md		README.md
README_agreement.md		README_agreement.md
README_complexity.md		README_complexity.md
README_faithfulness.md		README_faithfulness.md
README_simulatability.md		README_simulatability.md
agreement_eval.py		agreement_eval.py
complexity_eval.py		complexity_eval.py
faithfulness_eval.py		faithfulness_eval.py
simulatabiility_eval.py		simulatabiility_eval.py
simulatability_eval.py		simulatability_eval.py

copenlu/A-unified-framework-for-input-feature-explanations

Folders and files

Latest commit

History

Repository files navigation

Evaluating Input Feature Explanations through a Unified Diagnostic Evaluation Framework

Faithfulness

Agreement with human annotation

Complexity

Simulatability

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages