Round robin classification

doi:10.1162/153244302320884605

Open AccessJournal Article10.1162/153244302320884605

Round robin classification

Johannes Fürnkranz

- 01 Mar 2002

- Journal of Machine Learning Research

- Vol. 2, Iss: 4, pp 721-747

502

TL;DR: An empirical evaluation of round robin classification, implemented as a wrapper around the Ripper rule learning algorithm, on 20 multi-class datasets from the UCI database repository shows that the technique is very likely to improve Ripper's classification accuracy without having a high risk of decreasing it.

Abstract: In this paper, we discuss round robin classification (aka pairwise classification), a technique for handling multi-class problems with binary classifiers by learning one classifier for each pair of classes. We present an empirical evaluation of the method, implemented as a wrapper around the Ripper rule learning algorithm, on 20 multi-class datasets from the UCI database repository. Our results show that the technique is very likely to improve Ripper's classification accuracy without having a high risk of decreasing it. More importantly, we give a general theoretical analysis of the complexity of the approach and show that its run-time complexity is below that of the commonly used one-against-all technique. These theoretical results are not restricted to rule learning but are also of interest to other communities where pairwise classification has recently received some attention. Furthermore, we investigate its properties as a general ensemble technique and show that round robin classification with C5.0 may improve C5.0's performance on multi-class problems. However, this improvement does not reach the performance increase of boosting, and a combination of boosting and round robin classification does not produce any gain over conventional boosting. Finally, we show that the performance of round robin classification can be further improved by a straight-forward integration with bagging.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1007/S10994-011-5256-5

Classifier chains for multi-label classification

Jesse Read, +3 more

- 01 Dec 2011

- Machine Learning

TL;DR: This paper presents a novel classifier chains method that can model label correlations while maintaining acceptable computational complexity, and illustrates the competitiveness of the chaining method against related and state-of-the-art methods, both in terms of predictive performance and time complexity.

...read moreread less

2.5K

•Journal Article

In Defense of One-Vs-All Classification

Ryan Rifkin, +1 more

- 01 Dec 2004

- Journal of Machine Learning Research

TL;DR: It is argued that a simple "one-vs-all" scheme is as accurate as any other approach, assuming that the underlying binary classifiers are well-tuned regularized classifiers such as support vector machines.

...read moreread less

1.9K

•Book Chapter•10.1007/978-3-642-04174-7_17

Classifier Chains for Multi-label Classification

Jesse Read, +3 more

- 27 Aug 2009

TL;DR: Empirical evaluation over a broad range of multi-label datasets with a variety of evaluation metrics demonstrates the competitiveness of the chaining method against related and state-of-the-art methods, both in terms of predictive performance and time complexity.

...read moreread less

1.8K

Journal Article•10.1002/CEM.873

An introduction to decision tree modeling

Anthony J. Myles, +4 more

- 01 Jun 2004

- Journal of Chemometrics

TL;DR: In this tutorial, traditional decision tree construction and the current state of decision tree modeling are reviewed and emphasis is placed on techniques that make decision trees well suited to handle the complexities of chemical and biochemical applications.

...read moreread less

1.2K

•Journal Article•10.1109/TGRS.2004.842481

Investigation of the random forest framework for classification of hyperspectral data

Jisoo Ham, +3 more

- 22 Feb 2005

- IEEE Transactions on Geoscience and Remo...

TL;DR: This work investigates two approaches based on the concept of random forests of classifiers implemented within a binary hierarchical multiclassifier system, with the goal of achieving improved generalization of the classifier in analysis of hyperspectral data, particularly when the quantity of training data is limited.

...read moreread less

1.2K

...

Expand

References

•Journal Article•10.1023/A:1022627411411

Support-Vector Networks

Corinna Cortes, +1 more

- 15 Sep 1995

- Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

42K

•Book

C4.5: Programs for Machine Learning

J. Ross Quinlan

- 15 Oct 1992

TL;DR: A complete guide to the C4.5 system as implemented in C for the UNIX environment, which starts from simple core learning methods and shows how they can be elaborated and extended to deal with typical problems such as missing data and over hitting.

...read moreread less

27.2K

•Book

Classification and regression trees

Leo Breiman

- 01 Jan 1983

TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.

...read moreread less

22.7K

Journal Article•10.2307/2288003

Classification and Regression Trees.

John Van Ryzin, +4 more

- 01 Mar 1986

- Journal of the American Statistical Asso...

21.8K

•Journal Article•10.1006/JCSS.1997.1504

A Decision-Theoretic Generalization of On-Line Learning and an Application to Boosting

Yoav Freund, +1 more

- 01 Aug 1997

TL;DR: The model studied can be interpreted as a broad, abstract extension of the well-studied on-line prediction model to a general decision-theoretic setting, and it is shown that the multiplicative weight-update Littlestone?Warmuth rule can be adapted to this model, yielding bounds that are slightly weaker in some cases, but applicable to a considerably more general class of learning problems.

...read moreread less

18.6K