Proceedings Article10.1145/3477495.3531808
Translation-Based Implicit Annotation Projection for Zero-Shot Cross-Lingual Event Argument Extraction
Chenwei Lou,Jun Gao,Changlong Yu,Wei Wang,Hua Zhao,Weiwei Tu,Rui-Hua Xu +6 more
- 06 Jul 2022
18
TL;DR: A translation- based method to implicitly project annotations from the source language to the target language with the use of translation-based parallel corpora is investigated, which is more cost effective than previous works on zero-shot cross-lingual EAE.
read more
Abstract: Zero-shot cross-lingual event argument extraction (EAE) is a challenging yet practical problem in Information Extraction. Most previous works heavily rely on external structured linguistic features, which are not easily accessible in real-world scenarios. This paper investigates a translation-based method to implicitly project annotations from the source language to the target language. With the use of translation-based parallel corpora, no additional linguistic features are required during training and inference. As a result, the proposed approach is more cost effective than previous works on zero-shot cross-lingual EAE. Moreover, our implicit annotation projection approach introduces less noises and hence is more effective and robust than explicit ones. Experimental results show that our model achieves the best performance, outperforming a number of competitive baselines. The thorough analysis further demonstrates the effectiveness of our model compared to explicit annotation projection approaches.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Mask-then-Fill: A Flexible and Effective Data Augmentation Framework for Event Extraction
Jun Gao,Changlong Yu,Wei Wang,Hua Zhao,Ruifeng Xu +4 more
- 06 Jan 2023
TL;DR: This paper proposed Mask-then-Fill, a flexible and effective data augmentation framework for event extraction which allows for more flexible manipulation of text and thus can generate more diverse data while keeping the original event structure unchanged as much as possible.
Mask-then-Fill: A Flexible and Effective Data Augmentation Framework for Event Extraction
Jun Gao,Chunxiao Yu,Wei Wang,Hang Zhao,Ruifeng Xu +4 more
- 01 Jan 2022
TL;DR: Mask-then-Fill is a flexible and effective data augmentation framework for event extraction that generates diverse data while preserving the original event structure.
Zero-Shot Cross-Lingual Event Argument Extraction with Language-Oriented Prefix-Tuning
TL;DR: This paper proposed a language-oriented prefix generator module to handle the discrepancies between source and target languages and leverage a Language-agnostic template constructor module to design templates that can be adapted to any language.
Event Extraction: A Survey
TL;DR: This report provides the task definition, the evaluation method, as well as the benchmark datasets and a taxonomy of methodologies for event extraction, and presents the vision of future research direction in event detection.
5
Constructing Code-mixed Universal Dependency Forest for Unbiased Cross-lingual Relation Extraction
Hao Fei,Meishan Zhang,Min Zhang,Tat‐Seng Chua +3 more
- 01 Jan 2023
TL;DR: Constructing code-mixed universal dependency forest for unbiased cross-lingual relation extraction achieves significant performance gains on the ACE XRE benchmark datasets.
References
•Proceedings Article
Attention is All you Need
Ashish Vaswani,Noam Shazeer,Niki Parmar,Jakob Uszkoreit,Llion Jones,Aidan N. Gomez,Lukasz Kaiser,Illia Polosukhin +7 more
- 12 Jun 2017
TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.
•Posted Content
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
TL;DR: A new language representation model, BERT, designed to pre-train deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.
81.7K
•Posted Content
Semi-Supervised Classification with Graph Convolutional Networks
Thomas Kipf,Max Welling +1 more
TL;DR: A scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs which outperforms related methods by a significant margin.
22.7K
The Stanford CoreNLP Natural Language Processing Toolkit
Christopher D. Manning,Mihai Surdeanu,John Bauer,Jenny Rose Finkel,Steven Bethard,David McClosky +5 more
- 01 Jun 2014
TL;DR: The design and use of the Stanford CoreNLP toolkit is described, an extensible pipeline that provides core natural language analysis, and it is suggested that this follows from a simple, approachable design, straightforward interfaces, the inclusion of robust and good quality analysis components, and not requiring use of a large amount of associated baggage.
•Proceedings Article
Sinkhorn Distances: Lightspeed Computation of Optimal Transport
Marco Cuturi
- 05 Dec 2013
TL;DR: This work smooths the classic optimal transport problem with an entropic regularization term, and shows that the resulting optimum is also a distance which can be computed through Sinkhorn's matrix scaling algorithm at a speed that is several orders of magnitude faster than that of transport solvers.