Global-to-Local Neural Networks for Document-Level Relation Extraction

doi:10.18653/V1/2020.EMNLP-MAIN.303

Open AccessProceedings Article10.18653/V1/2020.EMNLP-MAIN.303

Global-to-Local Neural Networks for Document-Level Relation Extraction

Difeng Wang, +3 more

- 01 Nov 2020

- pp 3711-3721

143

TL;DR: A novel model to document-level RE is proposed, by encoding the document information in terms of entity global and local representations as well as context relation representations, which is particularly effective in extracting relations between entities of long distance and having multiple mentions.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arxiv.2311.07314

Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models

Junpeng Li, +2 more

- 13 Nov 2023

- arXiv.org

TL;DR: This work proposes a method integrating a large language model (LLM) and a natural language inference (NLI) module to generate relation triples, thereby augmenting document-level relation datasets and demonstrates the effectiveness of the approach by introducing an enhanced dataset known as DocGNRE, which excels in re-annotating numerous long-tail relation types.

...read moreread less

Proceedings Article•10.48550/arXiv.2205.14393

Relation-Specific Attentions over Entity Mentions for Enhanced Document-Level Relation Extraction

Jiaxin Yu, +2 more

- 28 May 2022

TL;DR: RSMAN is proposed in this paper which performs selective attentions over different entity mentions with respect to candidate relations so that the flexible and relation-specific representations of entities are obtained which indeed benefit relation classification.

...read moreread less

Journal Article•10.48550/arxiv.2402.14521

Malaysian English News Decoded: A Linguistic Resource for Named Entity and Relation Extraction

Mohan Raj Chanthran, +3 more

- 22 Feb 2024

- arXiv.org

TL;DR: This paper constructed a Malaysian English News (MEN) dataset, which contains 200 news articles that are manually annotated with entities and relations, and fine-tuned the spaCy NER tool and validated that having a dataset tailor-made for Malaysian English could improve the performance of NER in Malaysian English significantly.

...read moreread less

Journal Article•10.1016/j.aiopen.2024.08.002

Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature

Chenyang Song, +7 more

- 01 Jan 2024

- AI open

Proceedings Article•10.1109/ialp61005.2023.10337090

RADM-DRE:Retrieval Augmentation for Document-Level Relation Extraction with Diffusion Model

Qing Zhang, +7 more

- 18 Nov 2023

TL;DR: It is argued that the data generated from the distribution of raw data beyond the raw data itself can provide more informative augmentation and can relax the strong assumption that the original raw data must be accessible in testing stage.

...read moreread less

...

Expand

References

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

•Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

- 12 Jun 2017

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

94.2K

•Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 22 Dec 2014

- arXiv: Learning

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less

82.5K

•Posted Content

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

- 11 Oct 2018

- arXiv: Computation and Language

TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

81.7K

Preprint•10.48550/arxiv.1706.03762

Attention Is All You Need

Ashish Vaswani, +7 more

- 01 Jan 2017

Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.

...read moreread less

51.8K