Journal Article10.1016/j.knosys.2023.111281
Document-level relation extraction with three channels
6
TL;DR: This paper proposes DRETC, a document-level relation extraction model with three channels, each extracting relation features from a different perspective, achieving better performance than previous models on the DocRED dataset through channel combination.
read more
Abstract: Relation extraction aims to identify the relation between two entities. Early works on relation extraction are performed within a sentence. Numerous entity pairs with relations are distributed across sentences, so the task of document-level relation extraction is proposed. The current works perform well on document-level relation extraction through graph neural network, transformer, etc. However, these works extract relation features from one perspective, resulting in limited extraction performance of the model. To break through the limitation, this paper proposes the DRETC model consisting of three channels. Channel one first constructs the mention matrix and then extracts the relation features between entities. Channel two performs sequence inference from mention to entity before generating the relation matrix. Moreover, the bridge method is proposed to introduce information in channel one that is beneficial to relation extraction. Channel three directly generates entity representations through text sequences to make up for the absence of text information related to the entity. The three channels extract relation features from three different perspectives, and the combination leads to better extraction performance. Experimental results show that the proposed DRETC model outperforms previous strong baseline models with one channel on the public dataset DocRED. Further analysis demonstrates that the combination of multiple channels has a better relation extraction performance than a single channel.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Semantic-guided attention and adaptive gating for document-level relation extraction
Xiaoyao Ding,Shaopeng Duan,Zheng Zhang +2 more
TL;DR: A novel SAAG model is proposed for document-level relation extraction, leveraging semantic-guided attention and adaptive gating to capture contextual interactions and distinguish between local and global contexts, outperforming previous models on two public datasets.
1
Document-Level Relation Extraction with Structural Encoding And Entity-Pair-Level Information Interaction
Wanlong Liu,Dingyi Zeng,Yunfeng Xiao,Zhou Li,Shaohuan Cheng,Wen Kong,Malu Zhang,Wenyu Chen +7 more
- 01 Jan 2024
Zero-shot relation triplet extraction as Next-Sentence Prediction
Wenxiong Liao,Zhengliang Liu,Yiyang Zhang,Xiaoke Huang,Ninghao Liu,Tianming Liu,Quanzheng Li,Xiang Li,Hongmin Cai +8 more
Document-level relation extraction via dual attention fusion and dynamic asymmetric loss
Xiaoyao Ding,Dongyan Ding,Gang Zhou,Jicang Lu,Taojie Zhu +4 more
Improving inference via rich path information and logic rules for document-level relation extraction
Huizhe Su,Shaorong Xie,Hang Yu,Changsen Yuan,Xinzhi Wang,Xiangfeng Luo +5 more
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
•Proceedings Article
Attention is All you Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Lukasz Kaiser,Illia Polosukhin +7 more
- 12 Jun 2017
TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
Glove: Global Vectors for Word Representation
Jeffrey Pennington,Richard Socher,Christopher D. Manning +2 more
- 01 Oct 2014
TL;DR: A new global logbilinear regression model that combines the advantages of the two major model families in the literature: global matrix factorization and local context window methods and produces a vector space with meaningful substructure.
•Posted Content
Semi-Supervised Classification with Graph Convolutional Networks
Thomas Kipf,Max Welling +1 more
TL;DR: A scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs which outperforms related methods by a significant margin.
22.7K
Graph Attention Networks
Petar Veličković,Guillem Cucurull,Arantxa Casanova,Adriana Romero,Pietro Liò,Yoshua Bengio +5 more
- 15 Feb 2018
TL;DR: Graph Attention Networks (GATs) as mentioned in this paper leverage masked self-attentional layers to address the shortcomings of prior methods based on graph convolutions or their approximations.