Estimating the Support of a High-Dimensional Distribution
TL;DR: In this paper, the authors propose a method to estimate a function f that is positive on S and negative on the complement of S. The functional form of f is given by a kernel expansion in terms of a potentially small subset of the training data; it is regularized by controlling the length of the weight vector in an associated feature space.
read more
Abstract: Suppose you are given some data set drawn from an underlying probability distribution P and you want to estimate a "simple" subset S of input space such that the probability that a test point drawn from P lies outside of S equals some a priori specified value between 0 and 1. We propose a method to approach this problem by trying to estimate a function f that is positive on S and negative on the complement. The functional form of f is given by a kernel expansion in terms of a potentially small subset of the training data; it is regularized by controlling the length of the weight vector in an associated feature space. The expansion coefficients are found by solving a quadratic programming problem, which we do by carrying out sequential optimization over pairs of input patterns. We also provide a theoretical analysis of the statistical performance of our algorithm. The algorithm is a natural extension of the support vector algorithm to the case of unlabeled data.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Learning Based Anomaly Detection for Industrial Arm Applications
Vedanth Narayanan,Rakesh B. Bobba +1 more
- 15 Jan 2018
TL;DR: An anomaly detection framework for robotic arms in a manufacturing pipeline and integrate it into Robot Operating System (ROS), a middleware framework whose variants are being considered for deployment in industrial environments for flexible automation.
53
Combining different biometric traits with one-class classification
TL;DR: It is shown that one-class classification could be considered as an alternative to biometric fusion, especially when the data are highly unbalanced or when data from only a single class are available.
52
Animal recognition in the Mojave Desert: Vision tools for field biologists
Michael J. Wilber,Walter J. Scheirer,P. Leitner,Brian Heflin,J. Zott,D. Reinke,David K. Delaney,Terrance E. Boult +7 more
- 15 Jan 2013
TL;DR: A novel algorithm for animal classification is introduced that addresses the open set nature of this problem and is suitable for implementation on a smartphone and a simple model for object recognition applied to the problem of individual species identification is looked at.
52
Random Partitioning Forest for Point-Wise and Collective Anomaly Detection -- Application to Intrusion Detection
TL;DR: The experiments show that DiFF-RF almost systematically outperforms the IF algorithm and one of its extended variant, but also challenges the one-class SVM baseline, deep learning variational auto-encoder and ensemble of auto- encoder architectures.
Isolation-based conditional anomaly detection on mixed-attribute data to uncover workers' compensation fraud
Eugen Stripling,Bart Baesens,Bart Baesens,Barak Chizi,Seppe vanden Broucke +4 more
- 01 Jul 2018
TL;DR: This work proposes the i Forest CAD approach that computes conditional anomaly scores, useful for fraud detection, and presents a case study in which the usefulness of the proposed approach is demonstrated on real-world workers' compensation claims received from a large European insurance organization.
References
•Book
Elements of information theory
Thomas M. Cover,Joy A. Thomas +1 more
- 01 Jan 1991
TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.
•Book
The Nature of Statistical Learning Theory
Vladimir Vapnik
- 01 Jan 1995
TL;DR: Setting of the learning problem consistency of learning processes bounds on the rate of convergence ofLearning processes controlling the generalization ability of learning process constructing learning algorithms what is important in learning theory?
46K
Statistical learning theory
Vladimir Vapnik
- 01 Jan 1998
TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.
30.4K
A training algorithm for optimal margin classifiers
Bernhard E. Boser,Isabelle Guyon,Vladimir Vapnik +2 more
- 01 Jul 1992
TL;DR: A training algorithm that maximizes the margin between the training patterns and the decision boundary is presented, applicable to a wide variety of the classification functions, including Perceptrons, polynomials, and Radial Basis Functions.
A tutorial on support vector regression
TL;DR: This tutorial gives an overview of the basic ideas underlying Support Vector (SV) machines for function estimation, and includes a summary of currently used algorithms for training SV machines, covering both the quadratic programming part and advanced methods for dealing with large datasets.
Related Papers (5)
Vladimir Vapnik
- 01 Jan 1995
Vladimir Vapnik
- 01 Jan 1998