Performance analysis for L\_2 kernel classification

Open AccessProceedings Article

Performance analysis for L\_2 kernel classification

- 08 Dec 2008

- Vol. 21, pp 833-840

3

TL;DR: A distribution free concentration inequality is proved for a cross-validation based estimate of the ISE, and this result is applied to deduce an oracle inequality and consistency of the classifier on the sense of both ISE and probability of error.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1080/10618600.2012.737296

Robust Parametric Classification and Variable Selection by a Minimum Distance Criterion

Eric C. Chi, +1 more

- 12 Feb 2014

- Journal of Computational and Graphical S...

TL;DR: In this paper, a robust penalized logistic regression algorithm based on a minimum distance criterion was proposed to avoid estimation implosion in the presence of many outliers in the important small n large p situation.

...read moreread less

30

•Dissertation

Parametric classification and variable selection by the minimum integrated squared error criterion

Eric C. Chi

- 01 Jan 2012

TL;DR: In this article, Parametric classification and variable selection by the Minimum Integrated Squared Error Criterion (MIQE) was used for parametric classification, variable selection and variable classification.

...read moreread less

5

Journal Article•10.1109/TPAMI.2009.188

L₂ Kernel Classification

JooSeuk Kim, +1 more

- 01 Oct 2010

- IEEE Transactions on Pattern Analysis an...

TL;DR: This work proposes a kernel classifier that optimizes the L2 or integrated squared error of a “difference of densities” of the Gaussian kernel and extends the method through the introduction of a natural regularization parameter, which allows it to remain competitive with the SVM in high dimensions.

...read moreread less

References

•Journal Article•10.1023/A:1022627411411

Support-Vector Networks

Corinna Cortes, +1 more

- 15 Sep 1995

- Machine Learning

TL;DR: High generalization ability of support-vector networks utilizing polynomial input transformations is demonstrated and the performance of the support- vector network is compared to various classical learning algorithms that all took part in a benchmark study of Optical Character Recognition.

...read moreread less

42K

Journal Article•10.2307/2982206

Testing Statistical Hypotheses

J. D. Biggins, +1 more

- 01 Jan 1988

- Journal of The Royal Statistical Society...

TL;DR: Lehmann as discussed by the authors, Testing Statistical Hypotheses (2nd ed.). By E. L. Lehmann, 1986. xx, 600p. £44.13.

...read moreread less

1K

•Journal Article•10.1109/TPAMI.2003.1233899

Probability density estimation from optimally condensed data samples

Mark Girolami, +1 more

- 01 Oct 2003

- IEEE Transactions on Pattern Analysis an...

TL;DR: The Reduced Set Density Estimator is presented, which provides a kernel-based density estimator which employs a small percentage of the available data sample and is optimal in the L/sub 2/ sense.

...read moreread less

253

Journal Article•10.1198/004017001316975880

Parametric Statistical Modeling by Minimum Integrated Square Error

David Scott

- 01 Aug 2001

- Technometrics

TL;DR: This article investigates the use of integrated square error, or L2 distance, as a theoretical and practical estimation tool for a variety of parametric statistical models and demonstrates by example the well-known result that minimum distance estimators, including L2E, are inherently robust.

...read moreread less

250

Journal Article•10.1109/TIT.1969.1054295

Asymptotically optimal discriminant functions for pattern classification

C. Wolverton, +1 more

- 01 Mar 1969

- IEEE Transactions on Information Theory

TL;DR: It is shown that as the number of labeled samples used to construct the approximations increases, the resulting sequence of discriminant functions is asymptotically optimal in the sense that the probability of misclassification when using the approxIMations in the decision procedure converges in probability or with probability 1, depending on the assumptions made.

...read moreread less

179