Document-level relation extraction with Entity-Selection Attention

doi:10.1016/J.INS.2021.04.007

Journal Article10.1016/J.INS.2021.04.007

Document-level relation extraction with Entity-Selection Attention

Changsen Yuan, +4 more

- 01 Aug 2021

- Information Sciences

- Vol. 568, pp 163-174

29

TL;DR: This paper proposes a document-level relation extraction framework with two advantages that use the encoder to obtain the semantic features about the document and use the inter-sentence attention based on entity pairs to dynamically capture the features of multiple vital sentences.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/j.ins.2021.10.047

A pattern-aware self-attention network for distant supervised relation extraction

None Fachgesellschaften Socits des disciplines mdicales

- 01 Jan 2022

- Information Sciences

TL;DR: In this article , a self-attention network is designed to generate the probability distribution of all patterns in a sentence, and then a probability distribution is applied as a constraint in the first Transformer layer to encourage its attention heads to follow the relational pattern structures.

...read moreread less

27

•Journal Article•10.1016/J.INS.2021.05.045

Classifier-adaptation knowledge distillation framework for relation extraction and event detection with imbalanced data

Dandan Song, +3 more

- 26 May 2021

- Information Sciences

TL;DR: A Classifier-Adaptation Knowledge Distillation (CAKD) framework is proposed to address fundamental information extraction tasks, thus improving relation extraction and event detection performance and demonstrating the effectiveness of the proposed framework.

...read moreread less

25

Journal Article•10.1016/J.INS.2021.07.020

Document image layout analysis via explicit edge embedding network

Xingjiao Wu, +4 more

- 01 Oct 2021

- Information Sciences

TL;DR: A novel document layout analysis framework with the Explicit Edge Embedding Network (E3 Net), which contains the edge embedding block and dynamic skip connection block to produce detailed features, as well as a lightweight fully convolutional subnet as the backbone for the effectiveness of the framework.

...read moreread less

25

Journal Article•10.1016/J.INS.2021.10.047

A Pattern-aware Self-attention Network for Distant Supervised Relation Extraction

Yuming Shang, +5 more

- 01 Jan 2022

- Information Sciences

TL;DR: In this paper, a self-attention network is designed to generate the probability distribution of all patterns in a sentence, and then a probability distribution is applied as a constraint in the first Transformer layer to encourage its attention heads to follow the relational pattern structures.

...read moreread less

13

Journal Article•10.1016/j.knosys.2022.109146

Heterogenous affinity graph inference network for document-level relation extraction

Rongzhen Li, +3 more

- 01 May 2022

- Knowledge Based Systems

TL;DR: Wang et al. as discussed by the authors proposed to explicitly model the heterogeneous affinity graph, HAG, including a mention graph (MG) and a coreference graph (CG), to capture the reasoning clues from the adjacent affinity matrix.

...read moreread less

9

...

Expand

References

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Proceedings Article•10.3115/V1/D14-1179

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Kyunghyun Cho, +8 more

- 01 Jan 2014

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.

...read moreread less

28.6K

Proceedings Article•10.18653/V1/N19-1423

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

24.6K

•Proceedings Article•10.18653/V1/P16-1162

Neural Machine Translation of Rare Words with Subword Units

Rico Sennrich, +2 more

- 12 Aug 2016

TL;DR: This paper introduces a simpler and more effective approach, making the NMT model capable of open-vocabulary translation by encoding rare and unknown words as sequences of subword units, and empirically shows that subword models improve over a back-off dictionary baseline for the WMT 15 translation tasks English-German and English-Russian by 1.3 BLEU.

...read moreread less

9.3K

•Posted Content

Attention Is All You Need

Ashish Vaswani, +7 more

- 12 Jun 2017

- arXiv: Computation and Language

TL;DR: A new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely is proposed, which generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

...read moreread less

7.2K