Classification-based objective functions

doi:10.1007/S10994-006-6266-6

Open AccessJournal Article10.1007/S10994-006-6266-6

Classification-based objective functions

M.E. Rimer, +1 more

- 01 May 2006

- Machine Learning

- Vol. 63, Iss: 2, pp 183-205

27

TL;DR: CB1 is presented here as a novel objective function for learning classification problems that seeks to directly minimize classification error by backpropagating error only on misclassified patterns from culprit output nodes and achieves higher accuracy on classification problems than optimizing SSE or CE.

Abstract: Backpropagation, similar to most learning algorithms that can form complex decision surfaces, is prone to overfitting. This work presents classification-based objective functions, an approach to training artificial neural networks on classification problems. Classification-based learning attempts to guide the network directly to correct pattern classification rather than using common error minimization heuristics, such as sum-squared error (SSE) and cross-entropy (CE), that do not explicitly minimize classification error. CB1 is presented here as a novel objective function for learning classification problems. It seeks to directly minimize classification error by backpropagating error only on misclassified patterns from culprit output nodes. CB1 discourages weight saturation and overfitting and achieves higher accuracy on classification problems than optimizing SSE or CE. Experiments on a large OCR data set have shown CB1 to significantly increase generalization accuracy over SSE or CE optimization, from 97.86% and 98.10%, respectively, to 99.11%. Comparable results are achieved over several data sets from the UC Irvine Machine Learning Database Repository, with an average increase in accuracy from 90.7% and 91.3% using optimized SSE and CE networks, respectively, to 92.1% for CB1. Analysis indicates that CB1 performs a fundamentally different search of the feature space than optimizing SSE or CE and produces significantly different solutions.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1145/242224.242229

Machine learning

Thomas G. Dietterich

- 01 Dec 1996

- ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

14K

•Journal Article

Multiagent systems: a modern approach to distributed artificial intelligence

Guillermo Ricardo Simari

- 01 Jan 2000

- Journal of Computer Science and Technolo...

TL;DR: Multiagent Systems is the title of a collection of papers dedicated to surveying specific themes of Multiagent Systems (MAS) and Distributed Artificial Intelligence (DAI).

...read moreread less

789

Journal Article•10.1162/NECO.2007.04-07-508

Deterministic neural classification

Kar-Ann Toh

- 01 Jun 2008

- Neural Computation

TL;DR: By approximating the nonlinear counting step function using a quadratic function, the classification error rate is shown to be deterministically solvable and empirical results indicate SLFN's effectiveness on classification generalization.

...read moreread less

111

•Journal Article•10.3390/diagnostics12010135

A Novel Multistage Transfer Learning for Ultrasound Breast Cancer Image Classification

Gelan Ayana, +2 more

- 01 Jan 2022

- Diagnostics

TL;DR: It is argued that learning from both natural and medical datasets improves performance in ultrasound breast cancer image classification and could remarkably improve the early diagnosis of breast cancer in young women.

...read moreread less

94

•Journal Article•10.1109/TFUZZ.2008.928597

On Constructing Parsimonious Type-2 Fuzzy Logic Systems via Influential Rule Selection

Shang-Ming Zhou, +3 more

- 01 Jun 2009

- IEEE Transactions on Fuzzy Systems

TL;DR: Four novel indexes for ranking the relative contribution of type-2 fuzzy rules are proposed, and experiments are presented which demonstrate that by using the proposed methodology, the most influential type- 2 fuzzy rules can be effectively retained in order to construct parsimonious type-1 fuzzy models.

...read moreread less

73

...

Expand

References

UCI Machine Learning Repository

A. Asuncion

- 01 Jan 2007

24.3K

Book Chapter•10.1016/B978-1-4832-1446-7.50035-2

Learning internal representations by error propagation

David E. Rumelhart, +2 more

- 01 Jan 1988

TL;DR: This chapter contains sections titled: The Problem, The Generalized Delta Rule, Simulation Results, Some Further Generalizations, Conclusion.

...read moreread less

18.9K

•Book

Learning internal representations by error propagation

David E. Rumelhart, +2 more

- 03 Jan 1986

TL;DR: In this paper, the problem of the generalized delta rule is discussed and the Generalized Delta Rule is applied to the simulation results of simulation results in terms of the generalized delta rule.

...read moreread less

16K

Journal Article•10.1145/242224.242229

Machine learning

Thomas G. Dietterich

- 01 Dec 1996

- ACM Computing Surveys

TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.

...read moreread less

14K

•Journal Article•10.1023/A:1007379606734

Multitask Learning

Rich Caruana

- 01 Jul 1997

TL;DR: Multi-task Learning (MTL) as mentioned in this paper is an approach to inductive transfer that improves generalization by using the domain information contained in the training signals of related tasks as an inductive bias.

...read moreread less

8K