Classification-based objective functions
M.E. Rimer,Tony Martinez +1 more
TL;DR: CB1 is presented here as a novel objective function for learning classification problems that seeks to directly minimize classification error by backpropagating error only on misclassified patterns from culprit output nodes and achieves higher accuracy on classification problems than optimizing SSE or CE.
read more
Abstract: Backpropagation, similar to most learning algorithms that can form complex decision surfaces, is prone to overfitting. This work presents classification-based objective functions, an approach to training artificial neural networks on classification problems. Classification-based learning attempts to guide the network directly to correct pattern classification rather than using common error minimization heuristics, such as sum-squared error (SSE) and cross-entropy (CE), that do not explicitly minimize classification error. CB1 is presented here as a novel objective function for learning classification problems. It seeks to directly minimize classification error by backpropagating error only on misclassified patterns from culprit output nodes. CB1 discourages weight saturation and overfitting and achieves higher accuracy on classification problems than optimizing SSE or CE. Experiments on a large OCR data set have shown CB1 to significantly increase generalization accuracy over SSE or CE optimization, from 97.86% and 98.10%, respectively, to 99.11%. Comparable results are achieved over several data sets from the UC Irvine Machine Learning Database Repository, with an average increase in accuracy from 90.7% and 91.3% using optimized SSE and CE networks, respectively, to 92.1% for CB1. Analysis indicates that CB1 performs a fundamentally different search of the feature space than optimizing SSE or CE and produces significantly different solutions.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Machine learning
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
•Journal Article
Multiagent systems: a modern approach to distributed artificial intelligence
TL;DR: Multiagent Systems is the title of a collection of papers dedicated to surveying specific themes of Multiagent Systems (MAS) and Distributed Artificial Intelligence (DAI).
Deterministic neural classification
TL;DR: By approximating the nonlinear counting step function using a quadratic function, the classification error rate is shown to be deterministically solvable and empirical results indicate SLFN's effectiveness on classification generalization.
111
A Novel Multistage Transfer Learning for Ultrasound Breast Cancer Image Classification
TL;DR: It is argued that learning from both natural and medical datasets improves performance in ultrasound breast cancer image classification and could remarkably improve the early diagnosis of breast cancer in young women.
On Constructing Parsimonious Type-2 Fuzzy Logic Systems via Influential Rule Selection
TL;DR: Four novel indexes for ranking the relative contribution of type-2 fuzzy rules are proposed, and experiments are presented which demonstrate that by using the proposed methodology, the most influential type- 2 fuzzy rules can be effectively retained in order to construct parsimonious type-1 fuzzy models.
References
Learning internal representations by error propagation
David E. Rumelhart,Geoffrey E. Hinton,Ronald J. Williams +2 more
- 01 Jan 1988
TL;DR: This chapter contains sections titled: The Problem, The Generalized Delta Rule, Simulation Results, Some Further Generalizations, Conclusion.
•Book
Learning internal representations by error propagation
David E. Rumelhart,Geoffrey E. Hinton,Ronald J. Williams +2 more
- 03 Jan 1986
TL;DR: In this paper, the problem of the generalized delta rule is discussed and the Generalized Delta Rule is applied to the simulation results of simulation results in terms of the generalized delta rule.
16K
Machine learning
TL;DR: Machine learning addresses many of the same research questions as the fields of statistics, data mining, and psychology, but with differences of emphasis.
Multitask Learning
Rich Caruana
- 01 Jul 1997
TL;DR: Multi-task Learning (MTL) as mentioned in this paper is an approach to inductive transfer that improves generalization by using the domain information contained in the training signals of related tasks as an inductive bias.