Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV).

doi:10.48550/arxiv.1711.11279

10.48550/arxiv.1711.11279

Interpretability Beyond Feature Attribution: Quantitative Testing with Concept Activation Vectors (TCAV).

Been Kim, +6 more

223

TL;DR: Researchers introduce Concept Activation Vectors (CAVs) to interpret deep learning models, enabling quantitative testing of concept importance through directional derivatives, and demonstrate its application in image classification and medical domains for hypothesis exploration and insight generation.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1016/J.INFFUS.2019.12.012

Explainable Artificial Intelligence (XAI): Concepts, Taxonomies, Opportunities and Challenges toward Responsible AI

Alejandro Barredo Arrieta, +13 more

- 01 Jun 2020

- Information Fusion

TL;DR: In this paper, a taxonomy of recent contributions related to explainability of different machine learning models, including those aimed at explaining Deep Learning methods, is presented, and a second dedicated taxonomy is built and examined in detail.

...read moreread less

4.7K

•Journal Article•10.1109/JPROC.2021.3060483

Explaining Deep Neural Networks and Beyond: A Review of Methods and Applications

Wojciech Samek, +4 more

- 17 Mar 2020

- arXiv: Learning

TL;DR: In this paper, the authors provide a timely overview of explainable AI, with a focus on 'post-hoc' explanations, explain its theoretical foundations, and put interpretability algorithms to a test both from a theory and comparative evaluation perspective using extensive simulations.

...read moreread less

709

•Journal Article•10.1109/TETCI.2021.3100641

A Survey on Neural Network Interpretability

Yu Zhang, +3 more

- 24 Aug 2021

TL;DR: A comprehensive review of the neural network interpretability research can be found in this paper, where a novel taxonomy organized along three dimensions: type of engagement (passive vs. active interpretation approaches), the type of explanation, and the focus (from local to global interpretability).

...read moreread less

708

•Book Chapter•10.1007/978-3-030-28954-6_1

Towards Explainable Artificial Intelligence

Wojciech Samek, +3 more

- 10 Sep 2019

TL;DR: This introductory paper presents recent developments and applications in deep learning, and makes a plea for a wider use of explainable learning algorithms in practice.

...read moreread less

589

•Journal Article•10.1073/PNAS.1907375117

Understanding the role of individual units in a deep neural network.

David Bau, +5 more

- 01 Sep 2020

- Proceedings of the National Academy of S...

TL;DR: This work presents network dissection, an analytic framework to systematically identify the semantics of individual hidden units within image classification and image generation networks, and applies it to understanding adversarial attacks and to semantic image editing.

...read moreread less

478

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Journal Article•10.1007/S11263-015-0816-Y

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, +11 more

- 01 Dec 2015

- International Journal of Computer Vision

TL;DR: The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) as mentioned in this paper is a benchmark in object category classification and detection on hundreds of object categories and millions of images, which has been run annually from 2010 to present, attracting participation from more than fifty institutions.

...read moreread less

41.6K

•Proceedings Article•10.1109/CVPR.2016.308

Rethinking the Inception Architecture for Computer Vision

Christian Szegedy, +4 more

- 27 Jun 2016

TL;DR: In this article, the authors explore ways to scale up networks in ways that aim at utilizing the added computation as efficiently as possible by suitably factorized convolutions and aggressive regularization.

...read moreread less

27.9K

•Proceedings Article•10.1109/ICCV.2017.244

Unpaired Image-to-Image Translation Using Cycle-Consistent Adversarial Networks

Jun-Yan Zhu, +3 more

- 01 Oct 2017

TL;DR: CycleGAN as discussed by the authors learns a mapping G : X → Y such that the distribution of images from G(X) is indistinguishable from the distribution Y using an adversarial loss.

...read moreread less

19.5K

...

Expand