A Survey of Methods for Explaining Black Box Models

doi:10.1145/3236009

Open AccessJournal Article10.1145/3236009

A Survey of Methods for Explaining Black Box Models

Riccardo Guidotti, +5 more

- 22 Aug 2018

- ACM Computing Surveys

- Vol. 51, Iss: 5, pp 1-42

4.4K

TL;DR: In this paper, the authors provide a classification of the main problems addressed in the literature with respect to the notion of explanation and the type of black box decision support systems, given a problem definition, a black box type, and a desired explanation, this survey should help the researcher to find the proposals more useful for his own work.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Table 3. Summary of methods for opening and explaining black boxes with respect to the problem faced.

Table 2. Legend of Table 1. In the following are described the features reported and the abbreviations adopted.

Fig. 11. Saliency Masks for explanation of deep neural network. (Left) From [108] the elements of the image highlighted. (Right) From [25] the mask and the level of accuracy on the image considering and not considering the learned mask.

Table 4. Summary of methods for opening and explaining black boxes with respect to the explanator adopted.

Fig. 9. (Left) Generalizable reverse engineering approach: internal peculiarities of the black box are not exploited to build the comprehensible predictor. (Right) Not Generalizable reverse engineering approach: the comprehensible predictor is the result of a procedure involving internal characteristics of the black box.

Citations

Proceedings Article•10.1145/3351095.3372833

What to account for when accounting for algorithms: a systematic literature review on algorithmic accountability

Maranke Wieringa

- 27 Jan 2020

TL;DR: A definition ofgorithmic accountability based on accountability theory and algorithmic accountability literature is provided, which pays extra attention to accountability risks in algorithmic systems.

...read moreread less

233

Journal Article•10.1016/J.ESWA.2021.115736

Explaining anomalies detected by autoencoders using Shapley Additive Explanations

Liat Antwarg, +3 more

- 30 Dec 2021

- Expert Systems With Applications

TL;DR: This paper proposes a black-box explanation method, which uses Kernel SHAP to explain anomalies detected by an autoencoder, which is an unsupervised model, and demonstrates that the explanations it provides are more effective at reducing the anomaly score than other methods.

...read moreread less

232

•Journal Article•10.1038/S42256-020-0212-3

Making deep neural networks right for the right scientific reasons by interacting with their explanations

Patrick Schramowski, +8 more

- 01 Aug 2020

- Nature Machine Intelligence

TL;DR: The novel learning setting of explanatory interactive learning is introduced and its benefits on a plant phenotyping research task are illustrated and it is demonstrated that explanatory interactiveLearning can help to avoid Clever Hans moments in machine learning.

...read moreread less

230

Journal Article•10.1109/TKDE.2020.2983930

A Survey of Data-driven and Knowledge-aware eXplainable AI

Xiao-Hui Li, +10 more

- 30 Mar 2020

- IEEE Transactions on Knowledge and Data ...

TL;DR: A survey, reviewing and taxonomizing existing efforts from the view-point of DKE, summarizing their contribution, technical essence and comparative characteristics, and categorizing methods into data-driven methods where explanation comes from the task-related data, and knowledge-aware methods where extraneous knowledge is incorporated.

...read moreread less

229

•Journal Article•10.3390/app13127082

Re-Thinking Data Strategy and Integration for Artificial Intelligence: Concepts, Opportunities, and Challenges

Abdulaziz Aldoseri, +2 more

- 13 Jun 2023

- Applied Sciences

TL;DR: In this article , the authors comprehensively review and critically examine the challenges of using data for AI, including data quality, data volume, privacy and security, bias and fairness, interpretability and explainability, ethical concerns, and technical expertise and skills.

...read moreread less

228

...

Expand

References

Book Chapter•10.1017/CBO9781139207249.009

I and J

William Marsden

- 01 Jan 2012

154.7K

•Journal Article•10.1136/BMJ.323.7325.1375/A

I and i

Kevin Barraclough

- 08 Dec 2001

- BMJ

TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.

...read moreread less

38.1K

•Book

C4.5: Programs for Machine Learning

J. Ross Quinlan

- 15 Oct 1992

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.

...read moreread less

27.2K

Journal Article•10.1002/WIDM.8

Classification and regression trees

Wei-Yin Loh

- 01 Jan 2011

- Wiley Interdisciplinary Reviews-Data Min...

TL;DR: This article gives an introduction to the subject of classification and regression trees by reviewing some widely available algorithms and comparing their capabilities, strengths, and weakness in two examples.

...read moreread less

18.7K

Proceedings Article•10.1145/2939672.2939778

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Marco Tulio Ribeiro, +2 more

- 13 Aug 2016

TL;DR: In this article, the authors propose LIME, a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem.

...read moreread less

17.3K