A Survey of Methods for Explaining Black Box Models

doi:10.1145/3236009

Open AccessJournal Article10.1145/3236009

A Survey of Methods for Explaining Black Box Models

Riccardo Guidotti, +5 more

- 22 Aug 2018

- ACM Computing Surveys

- Vol. 51, Iss: 5, pp 1-42

4.4K

TL;DR: In this paper, the authors provide a classification of the main problems addressed in the literature with respect to the notion of explanation and the type of black box decision support systems, given a problem definition, a black box type, and a desired explanation, this survey should help the researcher to find the proposals more useful for his own work.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Table 3. Summary of methods for opening and explaining black boxes with respect to the problem faced.

Table 2. Legend of Table 1. In the following are described the features reported and the abbreviations adopted.

Fig. 11. Saliency Masks for explanation of deep neural network. (Left) From [108] the elements of the image highlighted. (Right) From [25] the mask and the level of accuracy on the image considering and not considering the learned mask.

Table 4. Summary of methods for opening and explaining black boxes with respect to the explanator adopted.

Fig. 9. (Left) Generalizable reverse engineering approach: internal peculiarities of the black box are not exploited to build the comprehensible predictor. (Right) Not Generalizable reverse engineering approach: the comprehensible predictor is the result of a procedure involving internal characteristics of the black box.

Citations

Proceedings Article•10.1145/3491101.3503727

Human-Centered Explainable AI (HCXAI): Beyond Opening the Black-Box of AI

Upol Ehsan, +7 more

- 27 Apr 2022

TL;DR: This second CHI workshop on Human-centered XAI (HCXAI), which builds on the success of the first installment from CHI 2021 to expand the conversation around XAI, examines how human-centered perspectives in XAI can be operationalized at the conceptual, methodological, and technical levels.

...read moreread less

80

•Proceedings Article•10.1109/ASE.2019.00079

Property inference for deep neural networks

Divya Gopinath, +3 more

- 10 Nov 2019

TL;DR: In this article, the authors propose to extract patterns based on neuron decisions as preconditions that imply certain desirable output properties, e.g., the prediction being a certain class.

...read moreread less

80

•Journal Article•10.1016/j.patter.2022.100489

An artificial intelligence life cycle: From conception to production

01 Jun 2022

- Patterns

TL;DR: The CDAC AI life cycle as discussed by the authors is a comprehensive life cycle for the design, development, and deployment of artificial intelligence (AI) systems and solutions, which addresses the void of a practical and inclusive approach that spans beyond the technical constructs to also focus on the challenges of risk analysis of AI adoption, transferability of prebuilt models, increasing importance of ethics and governance, and the composition, skills, and knowledge of an AI team required for successful completion.

...read moreread less

80

Proceedings Article•10.48550/arXiv.2206.11104

OpenXAI: Towards a Transparent Evaluation of Model Explanations

Chirag Agarwal, +7 more

- 22 Jun 2022

TL;DR: Overall, OpenXAI provides an automated end-to-end pipeline that not only simplifies and standardizes the evaluation of post hoc explanation methods, but also promotes transparency and reproducibility in benchmarking these methods.

...read moreread less

79

•Journal Article•10.1145/3495013

It’s Complicated: The Relationship between User Trust, Model Accuracy and Explanations in AI

Andrea Papenmeier, +3 more

- 31 Mar 2022

- ACM Transactions on Computer-Human Inter...

TL;DR: In this article , the authors examined the practical consequences of adding explanations for user trust and found that the influence of their explanations on trust differs depending on the classifier's accuracy, revealing discrepancies between self-reported and behavioural trust.

...read moreread less

79

...

Expand

References

Book Chapter•10.1017/CBO9781139207249.009

I and J

William Marsden

- 01 Jan 2012

154.7K

•Journal Article•10.1136/BMJ.323.7325.1375/A

I and i

Kevin Barraclough

- 08 Dec 2001

- BMJ

TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.

...read moreread less

38.1K

•Book

C4.5: Programs for Machine Learning

J. Ross Quinlan

- 15 Oct 1992

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.

...read moreread less

27.2K

Journal Article•10.1002/WIDM.8

Classification and regression trees

Wei-Yin Loh

- 01 Jan 2011

- Wiley Interdisciplinary Reviews-Data Min...

TL;DR: This article gives an introduction to the subject of classification and regression trees by reviewing some widely available algorithms and comparing their capabilities, strengths, and weakness in two examples.

...read moreread less

18.7K

Proceedings Article•10.1145/2939672.2939778

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Marco Tulio Ribeiro, +2 more

- 13 Aug 2016

TL;DR: In this article, the authors propose LIME, a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem.

...read moreread less

17.3K