Incorporating Copying Mechanism in Sequence-to-Sequence Learning

doi:10.18653/V1/P16-1154

Open AccessProceedings Article10.18653/V1/P16-1154

Incorporating Copying Mechanism in Sequence-to-Sequence Learning

Jiatao Gu, +3 more

- 21 Mar 2016

- Vol. 1, pp 1631-1640

1.5K

TL;DR: CopyNet as discussed by the authors incorporates copying into neural network-based Seq2Seq learning and proposes a new model called CopyNet with encoder-decoder structure, which can nicely integrate the regular way of word generation in the decoder with the new copying mechanism which can choose sub-sequences in the input sequence and put them at proper places in the output sequence.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Break It Down: A Question Understanding Benchmark

Tomer Wolfson, +7 more

- 31 Jan 2020

- arXiv: Computation and Language

TL;DR: This work introduces a Question Decomposition Meaning Representation (QDMR) for questions, and demonstrates the utility of QDMR by showing that it can be used to improve open-domain question answering on the HotpotQA dataset, and can be deterministically converted to a pseudo-SQL formal language, which can alleviate annotation in semantic parsing applications.

...read moreread less

21

Journal Article•10.1007/S00521-018-3946-7

Neural abstractive summarization fusing by global generative topics

Yang Gao, +4 more

- 01 May 2020

- Neural Computing and Applications

TL;DR: This work proposes to incorporate a neural generative topic matrix as an abstractive level of topic information into a summarization generation system that is capable of generating succinct and recapitulative words or phrases.

...read moreread less

21

Journal Article•10.1007/S00521-018-3825-2

Cross-domain aspect/sentiment-aware abstractive review summarization by combining topic modeling and deep reinforcement learning

Min Yang, +4 more

- 01 Jun 2020

- Neural Computing and Applications

TL;DR: The novel model Abstractive review Summarization with Topic modeling and Reinforcement deep learning (ASTR) leverages the benefits of the supervised deep neural networks, reinforcement learning, and unsupervised probabilistic generative model to strengthen the aspect/sentiment-aware review representation learning.

...read moreread less

21

Proceedings Article•10.18653/V1/D19-1677

Softregex: Generating regex from natural language descriptions using softened regex equivalence

Park Jun U, +4 more

- 01 Nov 2019

TL;DR: A new regex generation model, SoftRegex, is proposed, us-ing the EQ_Reg model, and it is empirically demonstrated that SoftRe regex substantially reduces the training time and produces state-of-the-art results on three benchmark datasets.

...read moreread less

21

•Proceedings Article•10.18653/V1/D19-1056

Translate and label! An encoder-decoder approach for cross-lingual semantic role labeling

Angel Daza, +1 more

- 01 Nov 2019

TL;DR: This article proposed a cross-lingual encoder-decoder model that simultaneously translates and generates sentences with SRL annotations in a resource-poor target language, but their model does not need parallel data during inference time.

...read moreread less

21

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

Journal Article•10.1162/NECO.1997.9.8.1735

Long short-term memory

Sepp Hochreiter, +1 more

- 01 Nov 1997

- Neural Computation

TL;DR: A novel, efficient, gradient based method called long short-term memory (LSTM) is introduced, which can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units.

...read moreread less

99K

•Proceedings Article•10.3115/V1/D14-1179

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Kyunghyun Cho, +8 more

- 01 Jan 2014

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.

...read moreread less

28.6K

•Proceedings Article

Neural Machine Translation by Jointly Learning to Align and Translate

Dzmitry Bahdanau, +2 more

- 01 Jan 2015

TL;DR: It is conjecture that the use of a fixed-length vector is a bottleneck in improving the performance of this basic encoder-decoder architecture, and it is proposed to extend this by allowing a model to automatically (soft-)search for parts of a source sentence that are relevant to predicting a target word, without having to form these parts as a hard segment explicitly.

...read moreread less

25.7K

•Proceedings Article

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 08 Dec 2014

TL;DR: The authors used a multilayered Long Short-Term Memory (LSTM) to map the input sequence to a vector of a fixed dimensionality, and then another deep LSTM to decode the target sequence from the vector.

...read moreread less

20.1K

...

Expand

Incorporating Copying Mechanism in Sequence-to-Sequence Learning

Chat with Paper

AI Agents for this Paper

Citations

Break It Down: A Question Understanding Benchmark

Neural abstractive summarization fusing by global generative topics

Cross-domain aspect/sentiment-aware abstractive review summarization by combining topic modeling and deep reinforcement learning

Softregex: Generating regex from natural language descriptions using softened regex equivalence

Translate and label! An encoder-decoder approach for cross-lingual semantic role labeling

References

Deep Residual Learning for Image Recognition

Long short-term memory

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Neural Machine Translation by Jointly Learning to Align and Translate

Sequence to Sequence Learning with Neural Networks

Related Papers (5)

Neural Machine Translation by Jointly Learning to Align and Translate

Bleu: a Method for Automatic Evaluation of Machine Translation

Sequence to Sequence Learning with Neural Networks

Attention is All you Need

Adam: A Method for Stochastic Optimization