A Survey of Methods for Explaining Black Box Models

doi:10.1145/3236009

Open AccessJournal Article10.1145/3236009

A Survey of Methods for Explaining Black Box Models

Riccardo Guidotti, +5 more

- 22 Aug 2018

- ACM Computing Surveys

- Vol. 51, Iss: 5, pp 1-42

4.4K

TL;DR: In this paper, the authors provide a classification of the main problems addressed in the literature with respect to the notion of explanation and the type of black box decision support systems, given a problem definition, a black box type, and a desired explanation, this survey should help the researcher to find the proposals more useful for his own work.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Table 3. Summary of methods for opening and explaining black boxes with respect to the problem faced.

Table 2. Legend of Table 1. In the following are described the features reported and the abbreviations adopted.

Fig. 11. Saliency Masks for explanation of deep neural network. (Left) From [108] the elements of the image highlighted. (Right) From [25] the mask and the level of accuracy on the image considering and not considering the learned mask.

Table 4. Summary of methods for opening and explaining black boxes with respect to the explanator adopted.

Fig. 9. (Left) Generalizable reverse engineering approach: internal peculiarities of the black box are not exploited to build the comprehensible predictor. (Right) Not Generalizable reverse engineering approach: the comprehensible predictor is the result of a procedure involving internal characteristics of the black box.

Citations

•Book Chapter•10.1007/978-3-030-51924-7_4

Decision Theory Meets Explainable AI

Kary Främling, +1 more

- 09 May 2020

TL;DR: CIU provides a universal and model-agnostic foundation for XAI and extends the notions of importance and utility for the non-linear models of AI systems and notably those produced by Machine Learning methods.

...read moreread less

37

Journal Article•10.1109/TMAG.2021.3063141

Explainable Deep Neural Network for Design of Electric Motors

Hidenori Sasaki, +2 more

- 02 Mar 2021

- IEEE Transactions on Magnetics

TL;DR: This study presents a novel two-step optimization method that incorporates explainable neural networks into topology optimization and is shown to increase the average torque of an interior permanent magnet (IPM) motor and reduce the torque ripple by 79% compared with the original model.

...read moreread less

37

Journal Article•10.1016/j.cogsys.2024.101243

Post-hoc vs ante-hoc explanations: xAI design guidelines for data scientists

Charles Retzlaff, +6 more

- 01 Aug 2024

- Cognitive Systems Research

TL;DR: This study presents a decision support framework for data scientists to choose suitable xAI approaches, comparing six ante-hoc and post-hoc methods through expert interviews, to aid in selecting xAI tools that demystify decision-making and enrich user understanding.

...read moreread less

37

Journal Article•10.1016/j.inffus.2024.102303

Adversarial attacks and defenses in explainable artificial intelligence: A survey

Hubert Baniecki, +1 more

- 01 Feb 2024

- Information Fusion

TL;DR: Adversarial attacks on explanations of machine learning models raise concerns about the trustworthiness and security of XAI methods. Recent advances in adversarial machine learning highlight the vulnerabilities of explanation methods and their susceptibility to manipulation and fooling.

...read moreread less

37

Journal Article•10.48550/arXiv.2212.10693

Requirements Engineering for Artificial Intelligence Systems: A Systematic Mapping Study

Khlood Ahmad, +4 more

- 20 Dec 2022

- Information & Software Technology

TL;DR: In this paper , the authors performed a systematic mapping study to find papers on current RE4AI approaches and identified 43 primary studies and analyzed the existing methodologies, models, tools, and techniques used to specify and model requirements in real-world scenarios.

...read moreread less

36

...

Expand

References

Book Chapter•10.1017/CBO9781139207249.009

I and J

William Marsden

- 01 Jan 2012

154.7K

•Journal Article•10.1136/BMJ.323.7325.1375/A

I and i

Kevin Barraclough

- 08 Dec 2001

- BMJ

TL;DR: There is, I think, something ethereal about i —the square root of minus one, which seems an odd beast at that time—an intruder hovering on the edge of reality.

...read moreread less

38.1K

•Book

C4.5: Programs for Machine Learning

J. Ross Quinlan

- 15 Oct 1992

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.

...read moreread less

27.2K

Journal Article•10.1002/WIDM.8

Classification and regression trees

Wei-Yin Loh

- 01 Jan 2011

- Wiley Interdisciplinary Reviews-Data Min...

TL;DR: This article gives an introduction to the subject of classification and regression trees by reviewing some widely available algorithms and comparing their capabilities, strengths, and weakness in two examples.

...read moreread less

18.7K

Proceedings Article•10.1145/2939672.2939778

"Why Should I Trust You?": Explaining the Predictions of Any Classifier

Marco Tulio Ribeiro, +2 more

- 13 Aug 2016

TL;DR: In this article, the authors propose LIME, a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem.

...read moreread less

17.3K