Local Interpretations for Explainable Natural Language Processing: A Survey.

Open AccessPosted Content

Local Interpretations for Explainable Natural Language Processing: A Survey.

- 20 Mar 2021

14

TL;DR: This article investigated various methods to improve the interpretability of deep neural networks for natural language processing (NLP) tasks, including machine translation and sentiment analysis, and provided a comprehensive discussion on the definition of the term ''interpretability'' and its various aspects at the beginning of this work.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Ethics Sheet for Automatic Emotion Recognition and Sentiment Analysis.

Saif M. Mohammad

- 17 Sep 2021

- arXiv: Computation and Language

TL;DR: In this article, the authors present fifty ethical considerations relevant to automatic emotion recognition (AER) and sentiment analysis and discuss the implications of AER systems on privacy and social groups.

...read moreread less

61

•Proceedings Article•10.18653/v1/2022.naacl-main.192

Necessity and Sufficiency for Explaining Text Classifiers: A Case Study in Hate Speech Detection

01 Jan 2022

TL;DR: Esma Balkir, Isar Nejadgholi, Kathleen Fraser, Svetlana Kiritchenko as discussed by the authors presented the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies.

...read moreread less

21

Journal Article•10.3389/frai.2023.1225093

Rationalization for explainable NLP: a survey

Sai Gurrapu, +4 more

- 25 Sep 2023

- Frontiers in artificial intelligence

TL;DR: Rationalization is a new subfield in XAI that provides natural language explanations for NLP models. It is intuitive, human-comprehensible, and accessible to non-technical users.

...read moreread less

19

Journal Article•10.1016/j.compeleceng.2024.109246

Explainable AI approaches in deep learning: Advancements, applications and challenges

Md. Tanzib Hosain, +3 more

- 01 Jul 2024

- Computers & Electrical Engineering

18

•Proceedings Article•10.18653/v1/2022.bigscience-1.5

UNIREX: A Unified Learning Framework for Language Model Rationale Extraction

01 Jan 2022

TL;DR: Chan, Maziar Sanjabi, Lambert Mathias, Liang Tan, Shaoliang Nie, Xiaochang Peng, Xiang Ren, Hamed Firooz as mentioned in this paper presented a large language modeling workshop.

...read moreread less

14

References

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 01 Jan 2015

TL;DR: In this paper, the authors investigated the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting and showed that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 layers.

...read moreread less

51.9K

Preprint•10.48550/arxiv.1706.03762

Attention Is All You Need

Ashish Vaswani, +7 more

- 01 Jan 2017

Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

...read moreread less

51.8K