What I Cannot Predict, I Do Not Understand: A Human-Centered Evaluation Framework for Explainability Methods.

Julien Colin Thomas Fel Rémi Cadène Thomas Serre

Adv Neural Inf Process Syst

Carney Institute for Brain Science, Brown University, USA.

Published: January 2022

A multitude of explainability methods has been described to try to help users better understand how modern AI systems make decisions. However, most performance metrics developed to evaluate these methods have remained largely theoretical - without much consideration for the human end-user. In particular, it is not yet clear (1) how useful current explainability methods are in real-world scenarios; and (2) whether current performance metrics accurately reflect the usefulness of explanation methods for the end user. To fill this gap, we conducted psychophysics experiments at scale ( = 1,150) to evaluate the usefulness of representative attribution methods in three real-world scenarios. Our results demonstrate that the degree to which individual attribution methods help human participants better understand an AI system varies widely across these scenarios. This suggests the need to move beyond quantitative improvements of current attribution methods, towards the development of complementary approaches that provide qualitatively different sources of information to human end-users.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10544769	PMC

Publication Analysis

Top Keywords

explainability methods

attribution methods

methods

better understand

performance metrics

real-world scenarios

predict understand

understand human-centered

human-centered evaluation

evaluation framework

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!