Open AccessPosted Content
Axiomatic Attribution for Deep Networks
TL;DR: The problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works, is studied and two fundamental axioms— Sensitivity and Implementation Invariance that attribution methods ought to satisfy are identified.
read more
Abstract: We study the problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works. We identify two fundamental axioms---Sensitivity and Implementation Invariance that attribution methods ought to satisfy. We show that they are not satisfied by most known attribution methods, which we consider to be a fundamental weakness of those methods. We use the axioms to guide the design of a new attribution method called Integrated Gradients. Our method requires no modification to the original network and is extremely simple to implement; it just needs a few calls to the standard gradient operator. We apply this method to a couple of image models, a couple of text models and a chemistry model, demonstrating its ability to debug networks, to extract rules from a network, and to enable users to engage with models better.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Rearchitecting Classification Frameworks For Increased Robustness
TL;DR: It is found that applying invariants to the classification task makes robustness and accuracy feasible together, and designs a classification paradigm that leverages these invariances to improve the robustness accuracy trade-off.
•Dissertation
Anomaly detection through explanations
Leilani H. Gilpin
- 01 Jan 2020
TL;DR: This thesis contributes a system architecture that reconciles local errors and inconsistencies amongst parts, and a systemwide architecture, Anomaly Detection through Explanations (ADE), which reconciles system-wide failures.
9
•Posted Content
Contextual Prediction Difference Analysis for Explaining Individual Image Classifications
Jindong Gu,Volker Tresp +1 more
TL;DR: This work first shows that PDA can suffer from saturated classifiers, then proposes Contextual PDA, which runs hundreds of times faster than PDA and is shown to be superior by explaining image classifications of the state-of-the-art deep convolutional neural networks.
9
Spatio-Temporal Perturbations for Video Attribution
TL;DR: A generic perturbation-based attribution method that is compatible with diversified video understanding networks is investigated and a novel regularization term is proposed to enhance the method by constraining the smoothness of its attribution results in both spatial and temporal dimensions.
9
•Posted Content
Understanding Image Captioning Models beyond Visualizing Attention
TL;DR: Variants of layer-wise relevance backpropagation (LRP) and gradient back Propagation, tailored to image captioning models with attention mechanisms, are developed and shown to correlate to object locations with higher precision than attention.
9
References
Going deeper with convolutions
Christian Szegedy,Wei Liu,Yangqing Jia,Pierre Sermanet,Scott Reed,Dragomir Anguelov,Dumitru Erhan,Vincent Vanhoucke,Andrew Rabinovich +8 more
- 07 Jun 2015
TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky,Jia Deng,Hao Su,Jonathan Krause,Sanjeev Satheesh,Sean Ma,Zhiheng Huang,Andrej Karpathy,Aditya Khosla,Michael S. Bernstein,Alexander C. Berg,Li Fei-Fei +11 more
TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.
•Posted Content
Neural Machine Translation by Jointly Learning to Align and Translate
TL;DR: In this paper, the authors propose to use a soft-searching model to find the parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.
20.9K
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro,Sameer Singh,Carlos Guestrin +2 more
- 13 Aug 2016
TL;DR: In this article, the authors propose LIME, a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem.
Visualizing and Understanding Convolutional Networks
Matthew D. Zeiler,Rob Fergus +1 more
- 06 Sep 2014
TL;DR: A novel visualization technique is introduced that gives insight into the function of intermediate feature layers and the operation of the classifier in large Convolutional Network models, used in a diagnostic role to find model architectures that outperform Krizhevsky et al on the ImageNet classification benchmark.
16.6K
Related Papers (5)
Matthew D. Zeiler,Rob Fergus +1 more
- 06 Sep 2014
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016