Generating Natural Language Inference Chains.

Open AccessPosted Content

Generating Natural Language Inference Chains.

- 04 Jun 2016

26

TL;DR: A new task is proposed that measures how well a model can generate an entailed sentence from a source sentence and takes entailment-pairs of the Stanford Natural Language Inference corpus and trains an LSTM with attention, and applies this model recursively to input-output pairs, thereby generating natural language inference chains.

Abstract: The ability to reason with natural language is a fundamental prerequisite for many NLP tasks such as information extraction, machine translation and question answering. To quantify this ability, systems are commonly tested whether they can recognize textual entailment, i.e., whether one sentence can be inferred from another one. However, in most NLP applications only single source sentences instead of sentence pairs are available. Hence, we propose a new task that measures how well a model can generate an entailed sentence from a source sentence. We take entailment-pairs of the Stanford Natural Language Inference corpus and train an LSTM with attention. On a manually annotated test set we found that 82% of generated sentences are correct, an improvement of 10.3% over an LSTM baseline. A qualitative analysis shows that this model is not only capable of shortening input sentences, but also inferring new statements via paraphrasing and phrase entailment. We then apply this model recursively to input-output pairs, thereby generating natural language inference chains that can be used to automatically construct an entailment graph from source sentences. Finally, by swapping source and target sentences we can also train a model that given an input sentence invents additional information to generate a new sentence.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article

Neural Paraphrase Generation with Stacked Residual LSTM Networks

Aaditya Prakash, +6 more

- 01 Dec 2016

TL;DR: The authors proposed a stacked residual LSTM network for paraphrase generation, which adds residual connections between LSTMs layers for efficient training, and achieved state-of-the-art performance on three different datasets: PPDB, WikiAnswers and MSCOCO.

...read moreread less

268

•Posted Content

Transforming Question Answering Datasets Into Natural Language Inference Datasets

Dorottya Demszky, +2 more

- 09 Sep 2018

- arXiv: Computation and Language

TL;DR: This work proposes a new method for automatically deriving NLI datasets from the growing abundance of large-scale question answering datasets, and relies on learning a sentence transformation model which converts question-answer pairs into their declarative forms.

...read moreread less

210

•Proceedings Article

Towards Text Generation with Adversarially Learned Neural Outlines

Sandeep Subramanian, +5 more

- 01 Dec 2018

TL;DR: This article proposed a combination of autoregressive and adversarial models with the goal of learning generative models of text, which produces a high-level sentence outline and then generates words sequentially, conditioning on both the outline and the previous outputs.

...read moreread less

61

•Proceedings Article•10.18653/V1/D17-1292

Detecting and Explaining Causes From Text For a Time Series Event.

Dongyeop Kang, +4 more

- 01 Sep 2017

TL;DR: This paper proposed a method based on the Granger causality of time series between features extracted from text such as N-grams, topics, sentiments, and their composition to detect causal features from text.

...read moreread less

41

•Proceedings Article

Grounded Textual Entailment.

Hoa Trong Vu, +8 more

- 01 Aug 2018

TL;DR: The authors compare blind and visual-augmented models of textual entailment and show that visual information is beneficial, but also conduct an in-depth error analysis that reveals that current multimodal models are not performing "grounding" in an optimal fashion.

...read moreread less

25

...

Expand

References

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

•Proceedings Article•10.3115/1073083.1073135

Bleu: a Method for Automatic Evaluation of Machine Translation

Kishore Papineni, +3 more

- 06 Jul 2002

TL;DR: This paper proposed a method of automatic machine translation evaluation that is quick, inexpensive, and language-independent, that correlates highly with human evaluation, and that has little marginal cost per run.

...read moreread less

28.9K

•Proceedings Article•10.3115/V1/D14-1179

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Kyunghyun Cho, +8 more

- 01 Jan 2014

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.

...read moreread less

28.6K

•Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

- 01 Jan 2015

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

25.7K

...

Expand

Generating Natural Language Inference Chains.

Chat with Paper

AI Agents for this Paper

Citations

Neural Paraphrase Generation with Stacked Residual LSTM Networks

Transforming Question Answering Datasets Into Natural Language Inference Datasets

Towards Text Generation with Adversarially Learned Neural Outlines

Detecting and Explaining Causes From Text For a Time Series Event.

Grounded Textual Entailment.

References

Adam: A Method for Stochastic Optimization

Long short-term memory

Bleu: a Method for Automatic Evaluation of Machine Translation

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Neural Machine Translation by Jointly Learning to Align and Translate

Related Papers (5)

Long short-term memory

Bleu: a Method for Automatic Evaluation of Machine Translation

Neural Machine Translation by Jointly Learning to Align and Translate

A large annotated corpus for learning natural language inference

Reasoning about Entailment with Neural Attention