TL;DR: The empirical results suggest that the highest levels of performance can be obtained through relatively simple means: heuristic learning of phrase translations from word-based alignments and lexical weighting of phrase translation.

...read moreread less

Abstract: We propose a new phrase-based translation model and decoding algorithm that enables us to evaluate and compare several, previously proposed phrase-based translation models. Within our framework, we carry out a large number of experiments to understand better and explain why phrase-based models out-perform word-based models. Our empirical results, which hold for all examined language pairs, suggest that the highest levels of performance can be obtained through relatively simple means: heuristic learning of phrase translations from word-based alignments and lexical weighting of phrase translations. Surprisingly, learning phrases longer than three words and learning phrases from high-accuracy word-level alignment models does not have a strong impact on performance. Learning only syntactically motivated phrases degrades the performance of our systems.

...read moreread less

4,102 citations

Proceedings Article•10.18653/V1/P16-2034•

Attention-Based Bidirectional Long Short-Term Memory Networks for Relation Classification

[...]

Peng Zhou¹, Wei Shi, Jun Tian, Zhenyu Qi¹, Bingchen Li, Hao Hongwei, Bo Xu¹ - Show less +3 more•Institutions (1)

Chinese Academy of Sciences¹

1 Aug 2016

TL;DR: The experimental results on the SemEval-2010 relation classification task show that the AttBLSTM method outperforms most of the existing methods, with only word vectors.

...read moreread less

Abstract: Relation classification is an important semantic processing task in the field of natural language processing (NLP). State-ofthe-art systems still rely on lexical resources such as WordNet or NLP systems like dependency parser and named entity recognizers (NER) to get high-level features. Another challenge is that important information can appear at any position in the sentence. To tackle these problems, we propose Attention-Based Bidirectional Long Short-Term Memory Networks(AttBLSTM) to capture the most important semantic information in a sentence. The experimental results on the SemEval-2010 relation classification task show that our method outperforms most of the existing methods, with only word vectors.

...read moreread less

2,362 citations

Proceedings Article•10.3115/V1/D14-1082•

A Fast and Accurate Dependency Parser using Neural Networks

[...]

Danqi Chen¹, Christopher D. Manning¹•Institutions (1)

Stanford University¹

1 Jan 2014

TL;DR: This work proposes a novel way of learning a neural network classifier for use in a greedy, transition-based dependency parser that can work very fast, while achieving an about 2% improvement in unlabeled and labeled attachment scores on both English and Chinese datasets.

...read moreread less

Abstract: Almost all current dependency parsers classify based on millions of sparse indicator features. Not only do these features generalize poorly, but the cost of feature computation restricts parsing speed significantly. In this work, we propose a novel way of learning a neural network classifier for use in a greedy, transition-based dependency parser. Because this classifier learns and uses just a small number of dense features, it can work very fast, while achieving an about 2% improvement in unlabeled and labeled attachment scores on both English and Chinese datasets. Concretely, our parser is able to parse more than 1000 sentences per second at 92.2% unlabeled attachment score on the English Penn Treebank.

...read moreread less

2,322 citations

Book•

Dependency Syntax: Theory and Practice

[...]

Igor Alexandrowitsch Meltschuk

1 Jan 1987

1,431 citations

Proceedings Article•10.3115/1596276.1596305•

CoNLL-X Shared Task on Multilingual Dependency Parsing

[...]

Sabine Buchholz¹, Erwin Marsi²•Institutions (2)

Toshiba¹, Tilburg University²

8 Jun 2006

TL;DR: How treebanks for 13 languages were converted into the same dependency format and how parsing performance was measured is described and general conclusions about multi-lingual parsing are drawn.

...read moreread less

Abstract: Each year the Conference on Computational Natural Language Learning (CoNLL) features a shared task, in which participants train and test their systems on exactly the same data sets, in order to better compare systems. The tenth CoNLL (CoNLL-X) saw a shared task on Multilingual Dependency Parsing. In this paper, we describe how treebanks for 13 languages were converted into the same dependency format and how parsing performance was measured. We also give an overview of the parsing approaches that participants took and the results that they achieved. Finally, we try to draw general conclusions about multi-lingual parsing: What makes a particular language, treebank or annotation scheme easier or harder to parse and which phenomena are challenging for any dependency parser?

...read moreread less

1,114 citations