Open AccessPosted Content
Axiomatic Attribution for Deep Networks
TL;DR: The problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works, is studied and two fundamental axioms— Sensitivity and Implementation Invariance that attribution methods ought to satisfy are identified.
read more
Abstract: We study the problem of attributing the prediction of a deep network to its input features, a problem previously studied by several other works. We identify two fundamental axioms---Sensitivity and Implementation Invariance that attribution methods ought to satisfy. We show that they are not satisfied by most known attribution methods, which we consider to be a fundamental weakness of those methods. We use the axioms to guide the design of a new attribution method called Integrated Gradients. Our method requires no modification to the original network and is extremely simple to implement; it just needs a few calls to the standard gradient operator. We apply this method to a couple of image models, a couple of text models and a chemistry model, demonstrating its ability to debug networks, to extract rules from a network, and to enable users to engage with models better.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Enhanced Integrated Gradients: improving interpretability of deep learning models using splicing codes as a case study.
TL;DR: It is demonstrated that EIG improves upon the original Integrated Gradients method and produces sets of informative features and is applied to identify A1CF as a key regulator of liver-specific alternative splicing.
Reinforcement Learning Interpretation Methods: A Survey
TL;DR: The main objective of this paper is to show and explain RL interpretation methods, the metrics used to classify them, and how these metrics were applied to understand the internal details of RL models.
•Posted Content
Symbolic Execution for Deep Neural Networks.
TL;DR: DeepCheck is introduced, a new approach for validating DNNs based on core ideas from program analysis, specifically from symbolic execution to translate a DNN into an imperative program, thereby enabling program analysis to assist with DNN validation.
72
Classifying Alzheimer's disease with brain imaging and genetic data using a neural network framework.
Kaida Ning,Bo Chen,Fengzhu Sun,Zachary Hobel,Lu Zhao,Will Matloff,Alzheimer’s Disease Neuroimaging Initiative,Arthur W. Toga +7 more
TL;DR: The ability of NN models to not only classify and predict AD occurrence but also to identify important AD risk factors and interactions among them is shown.
72
ToxDL: deep learning using primary structure and domain embeddings for assessing protein toxicity
Xiaoyong Pan,Xiaoyong Pan,Jasper Zuallaert,Xi Wang,Hong-Bin Shen,Elda Posada Campos,Denys Marushchak,Wesley De Neve +7 more
TL;DR: The proposed ToxDL network outperforms traditional homology-based approaches and state-of-the-art machine learning techniques and allows for directed in silico modification of a sequence, thus making it possible to alter its predicted protein toxicity.
References
Going deeper with convolutions
Christian Szegedy,Wei Liu,Yangqing Jia,Pierre Sermanet,Scott Reed,Dragomir Anguelov,Dumitru Erhan,Vincent Vanhoucke,Andrew Rabinovich +8 more
- 07 Jun 2015
TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
ImageNet Large Scale Visual Recognition Challenge
Olga Russakovsky,Jia Deng,Hao Su,Jonathan Krause,Sanjeev Satheesh,Sean Ma,Zhiheng Huang,Andrej Karpathy,Aditya Khosla,Michael S. Bernstein,Alexander C. Berg,Li Fei-Fei +11 more
TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.
•Posted Content
Neural Machine Translation by Jointly Learning to Align and Translate
TL;DR: In this paper, the authors propose to use a soft-searching model to find the parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.
20.9K
"Why Should I Trust You?": Explaining the Predictions of Any Classifier
Marco Tulio Ribeiro,Sameer Singh,Carlos Guestrin +2 more
- 13 Aug 2016
TL;DR: In this article, the authors propose LIME, a method to explain models by presenting representative individual predictions and their explanations in a non-redundant way, framing the task as a submodular optimization problem.
Visualizing and Understanding Convolutional Networks
Matthew D. Zeiler,Rob Fergus +1 more
- 06 Sep 2014
TL;DR: A novel visualization technique is introduced that gives insight into the function of intermediate feature layers and the operation of the classifier in large Convolutional Network models, used in a diagnostic role to find model architectures that outperform Krizhevsky et al on the ImageNet classification benchmark.
16.6K
Related Papers (5)
Matthew D. Zeiler,Rob Fergus +1 more
- 06 Sep 2014
Kaiming He,Xiangyu Zhang,Shaoqing Ren,Jian Sun +3 more
- 27 Jun 2016