Global-to-Local Neural Networks for Document-Level Relation Extraction

doi:10.18653/V1/2020.EMNLP-MAIN.303

Open AccessProceedings Article10.18653/V1/2020.EMNLP-MAIN.303

Global-to-Local Neural Networks for Document-Level Relation Extraction

Difeng Wang, +3 more

- 01 Nov 2020

- pp 3711-3721

143

TL;DR: A novel model to document-level RE is proposed, by encoding the document information in terms of entity global and local representations as well as context relation representations, which is particularly effective in extracting relations between entities of long distance and having multiple mentions.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.18653/V1/2021.FINDINGS-ACL.402

Enhancing Dialogue-based Relation Extraction by Speaker and Trigger Words Prediction

Tianyang Zhao, +3 more

- 01 Aug 2021

3

•Journal Article•10.1007/S12559-021-09917-7

Deep Neural Approaches to Relation Triplets Extraction: a Comprehensive Survey

Tapas Nayak, +3 more

- 31 Mar 2021

- Cognitive Computation

TL;DR: More recently, with the advances made in the continuous representation of words (word embeddings) and deep neural architectures, many research works are published in the area of relation extraction as discussed by the authors.

...read moreread less

3

•Posted Content

Eider: Evidence-enhanced Document-level Relation Extraction.

Yiqing Xie, +4 more

- 16 Jun 2021

- arXiv: Computation and Language

TL;DR: Li et al. as mentioned in this paper proposed a three-stage evidenceenhanced DocRE framework consisting of joint relation and evidence extraction, evidence-centered relation extraction (RE), and fusion of extraction results.

...read moreread less

3

Journal Article•10.3390/e26030210

Enhanced Heterogeneous Graph Attention Network with a Novel Multilabel Focal Loss for Document-Level Relation Extraction

Yang Chen, +1 more

- 28 Feb 2024

- Entropy

TL;DR: This paper proposes a two-stage mention-level framework for document-level relation extraction using an enhanced heterogeneous graph attention network, addressing class imbalance and multilabel prediction with a novel multilabel focal loss function, outperforming existing methods.

...read moreread less

2

Proceedings Article

Document-Level Relation Extraction via Pair-Aware and Entity-Enhanced Representation Learning

Xiusheng Huang, +6 more

TL;DR: This paper designs a Pair-Aware Representation module to predict potential relational entity pairs, which constrains the relation extraction to the predicted entity pairs subset rather than all pairs, and introduces a Entity-Enhanced representation module to assemble directional entity pairs and obtain a holistic understanding of the entire document.

...read moreread less

2

...

Expand

References

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 22 Dec 2014

- arXiv: Learning

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less

82.5K

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

Preprint•10.48550/arxiv.1706.03762

Attention Is All You Need

Ashish Vaswani, +7 more

- 01 Jan 2017

Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

...read moreread less

51.8K