Global-to-Local Neural Networks for Document-Level Relation Extraction
Difeng Wang,Wei Hu,Ermei Cao,Weijian Sun +3 more
- 01 Nov 2020
- pp 3711-3721
TL;DR: A novel model to document-level RE is proposed, by encoding the document information in terms of entity global and local representations as well as context relation representations, which is particularly effective in extracting relations between entities of long distance and having multiple mentions.
read more
Abstract: Relation extraction (RE) aims to identify the semantic relations between named entities in text. Recent years have witnessed it raised to the document level, which requires complex reasoning with entities and mentions throughout an entire document. In this paper, we propose a novel model to document-level RE, by encoding the document information in terms of entity global and local representations as well as context relation representations. Entity global representations model the semantic information of all entities in the document, entity local representations aggregate the contextual information of multiple mentions of specific entities, and context relation representations encode the topic information of other relations. Experimental results demonstrate that our model achieves superior performance on two public datasets for document-level RE. It is particularly effective in extracting relations between entities of long distance and having multiple mentions.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Semi-automatic Data Enhancement for Document-Level Relation Extraction with Distant Supervision from Large Language Models
Junpeng Li,Zixia Jia,Zilong Zheng +2 more
TL;DR: This work proposes a method integrating a large language model (LLM) and a natural language inference (NLI) module to generate relation triples, thereby augmenting document-level relation datasets and demonstrates the effectiveness of the approach by introducing an enhanced dataset known as DocGNRE, which excels in re-annotating numerous long-tail relation types.
Relation-Specific Attentions over Entity Mentions for Enhanced Document-Level Relation Extraction
Jiaxin Yu,Deqing Yang,Shuyu Tian +2 more
- 28 May 2022
TL;DR: RSMAN is proposed in this paper which performs selective attentions over different entity mentions with respect to candidate relations so that the flexible and relation-specific representations of entities are obtained which indeed benefit relation classification.
Malaysian English News Decoded: A Linguistic Resource for Named Entity and Relation Extraction
Mohan Raj Chanthran,Lay-Ki Soon,Huey Fang Ong,Bhawani Selvaretnam +3 more
TL;DR: This paper constructed a Malaysian English News (MEN) dataset, which contains 200 news articles that are manually annotated with entities and relations, and fine-tuned the spaCy NER tool and validated that having a dataset tailor-made for Malaysian English could improve the performance of NER in Malaysian English significantly.
Relation-aware deep neural network enables more efficient biomedical knowledge acquisition from massive literature
Chenyang Song,Zheni Zeng,Changyao Tian,Kuai Li,Yuan Yao,Suncong Zheng,Zhiyuan Liu,Maosong Sun +7 more
RADM-DRE:Retrieval Augmentation for Document-Level Relation Extraction with Diffusion Model
Qing Zhang,Qingsong Yuan,Jianyong Duan,Yuhang Jiang,Hao Wang,Zhengxin Gao,Li He,Jie Liu +7 more
- 18 Nov 2023
TL;DR: It is argued that the data generated from the distribution of raw data beyond the raw data itself can provide more informative augmentation and can relax the strong assumption that the original raw data must be accessible in testing stage.
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
•Proceedings Article
Attention is All you Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Lukasz Kaiser,Illia Polosukhin +7 more
- 12 Jun 2017
TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
•Posted Content
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
81.7K
Attention Is All You Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Łukasz Kaiser,Illia Polosukhin +7 more
- 01 Jan 2017
Abstract: The dominant sequence transduction models are based on complex recurrent or convolutional neural networks in an encoder-decoder configuration. The best performing models also connect the encoder and decoder through an attention mechanism. We propose a new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely. Experiments on two machine translation tasks show these models to be superior in quality while being more parallelizable and requiring significantly less time to train. Our model achieves 28.4 BLEU on the WMT 2014 English-to-German translation task, improving over the existing best results, including ensembles by over 2 BLEU. On the WMT 2014 English-to-French translation task, our model establishes a new single-model state-of-the-art BLEU score of 41.8 after training for 3.5 days on eight GPUs, a small fraction of the training costs of the best models from the literature. We show that the Transformer generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.
51.8K