Journal Article10.1016/J.INS.2021.04.007
Document-level relation extraction with Entity-Selection Attention
29
TL;DR: This paper proposes a document-level relation extraction framework with two advantages that use the encoder to obtain the semantic features about the document and use the inter-sentence attention based on entity pairs to dynamically capture the features of multiple vital sentences.
read more
About: This article is published in Information Sciences. The article was published on 01 Aug 2021. The article focuses on the topics: Relationship extraction & Relation (database).
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A pattern-aware self-attention network for distant supervised relation extraction
TL;DR: In this article , a self-attention network is designed to generate the probability distribution of all patterns in a sentence, and then a probability distribution is applied as a constraint in the first Transformer layer to encourage its attention heads to follow the relational pattern structures.
27
Classifier-adaptation knowledge distillation framework for relation extraction and event detection with imbalanced data
TL;DR: A Classifier-Adaptation Knowledge Distillation (CAKD) framework is proposed to address fundamental information extraction tasks, thus improving relation extraction and event detection performance and demonstrating the effectiveness of the proposed framework.
25
Document image layout analysis via explicit edge embedding network
TL;DR: A novel document layout analysis framework with the Explicit Edge Embedding Network (E3 Net), which contains the edge embedding block and dynamic skip connection block to produce detailed features, as well as a lightweight fully convolutional subnet as the backbone for the effectiveness of the framework.
25
A Pattern-aware Self-attention Network for Distant Supervised Relation Extraction
Yuming Shang,None Fachgesellschaften Socits des disciplines mdicales,Heyan Huang,Xin Sun,Wei Wei,Xian-Ling Mao +5 more
TL;DR: In this paper, a self-attention network is designed to generate the probability distribution of all patterns in a sentence, and then a probability distribution is applied as a constraint in the first Transformer layer to encourage its attention heads to follow the relational pattern structures.
13
Heterogenous affinity graph inference network for document-level relation extraction
TL;DR: Wang et al. as discussed by the authors proposed to explicitly model the heterogeneous affinity graph, HAG, including a mention graph (MG) and a coreference graph (CG), to capture the reasoning clues from the adjacent affinity matrix.
9
References
•Proceedings Article
Attention is All you Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Lukasz Kaiser,Illia Polosukhin +7 more
- 12 Jun 2017
TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation
Kyunghyun Cho,Bart van Merriënboer,Caglar Gulcehre,Dzmitry Bahdanau,Fethi Bougares,Holger Schwenk,Yoshua Bengio,Yoshua Bengio,Yoshua Bengio +8 more
- 01 Jan 2014
TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Jacob Devlin,Ming-Wei Chang,Kenton Lee,Kristina Toutanova +3 more
- 11 Oct 2018
TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
24.6K
Neural Machine Translation of Rare Words with Subword Units
Rico Sennrich,Barry Haddow,Alexandra Birch +2 more
- 12 Aug 2016
TL;DR: This paper introduces a simpler and more effective approach, making the NMT model capable of open-vocabulary translation by encoding rare and unknown words as sequences of subword units, and empirically shows that subword models improve over a back-off dictionary baseline for the WMT 15 translation tasks English-German and English-Russian by 1.3 BLEU.
•Posted Content
Attention Is All You Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Lukasz Kaiser,Illia Polosukhin +7 more
TL;DR: A new simple network architecture, the Transformer, based solely on attention mechanisms, dispensing with recurrence and convolutions entirely is proposed, which generalizes well to other tasks by applying it successfully to English constituency parsing both with large and limited training data.
7.2K