Evaluating Evaluation Measures

Open Access

Evaluating Evaluation Measures

- 23 May 2007

- pp 372-379

20

TL;DR: An analysis of specic error types indicates that the dependency-based evaluation is most appropriate to reect parse quality, and shows that PARSEVAL should not be used to compare parser performance for parsers trained on treebanks with different annotation schemes.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article

Overview of the SPMRL 2013 Shared Task: A Cross-Framework Evaluation of Parsing Morphologically Rich Languages

Djamé Seddah, +22 more

- 18 Oct 2013

TL;DR: This paper presents and analyzes parsing results obtained by the task participants, and provides an analysis and comparison of the parsers across languages and frameworks, reported for gold input as well as more realistic parsing scenarios.

...read moreread less

201

Journal Article•10.1016/J.ESWA.2012.09.017

Effects of data set features on the performances of classification algorithms

Ohbyung Kwon, +1 more

- 01 Apr 2013

- Expert Systems With Applications

TL;DR: This research experimentally examines how data set characteristics affect algorithm performance, both in terms of accuracy and in elapsed time, and uses a multiple regression method to evaluate the causality between dataSet characteristics as independent variables, and performance metrics as dependent variables.

...read moreread less

171

•Proceedings Article•10.3115/V1/P15-1116

Discontinuous Incremental Shift-reduce Parsing

Wolfgang Maier

- 01 Jul 2015

TL;DR: This work presents an extension to incremental shift-reduce parsing that handles discontinuous constituents, using a linear classifier and beam search, and achieves very high parsing speeds and accurate results.

...read moreread less

46

•Proceedings Article

Information Retrieval Meta-Evaluation: Challenges and Opportunities in the Music Domain.

Julián Urbano

- 01 Jan 2011

TL;DR: A survey of past meta-evaluation work in the context of Text Information Retrieval argues that the music community still needs to address various issues concerning the evaluation of music systems and the IR cycle, pointing out directions for further research and proposals in this line.

...read moreread less

34

•Proceedings Article

Direct Parsing of Discontinuous Constituents in German

Wolfgang Maier

- 05 Jun 2010

TL;DR: This paper uses a parser for Probabilistic Linear Context-Free Rewriting Systems (PLCFRS), a formalism with high expressivity, to directly parse the German NeGra and TIGER treebanks, and shows that an output quality can be achieved which is comparable to the output quality of PCFG-based systems.

...read moreread less

29

...

Expand

References

Statistical learning theory

Vladimir Vapnik

- 01 Jan 1998

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

30.4K

•Journal Article

Binary codes capable of correcting deletions, insertions, and reversals

V.I. Levenshtein

- 01 Jan 1966

- Soviet physics. Doklady

10.5K

•Journal Article

Binary codes capable of correcting deletions, insertions and reversals

V.I. Levenshtein

- 01 Jan 1965

- Proceedings of the USSR Academy of Scien...

9.1K

Binary Codes capable of currecting deletions, insertions, and reversals

I. V. Levenshtein

- 01 Jan 1966

7.7K

•Book

A Probabilistic Theory of Pattern Recognition

Luc Devroye, +2 more

- 01 Jan 1996

TL;DR: The Bayes Error and Vapnik-Chervonenkis theory are applied as guide for empirical classifier selection on the basis of explicit specification and explicit enforcement of the maximum likelihood principle.

...read moreread less

4.2K