Learning Minimum Volume Sets

Open AccessProceedings Article

Learning Minimum Volume Sets

- 05 Dec 2005

- Vol. 18, pp 1209-1216

146

TL;DR: In this article, the problem of estimating minimum volume sets based on independent samples distributed according to a probability measure P and a reference measure μ is addressed, where no other information is available regarding P, but the reference measure is assumed to be known.

Abstract: Given a probability measure P and a reference measure μ, one is often interested in the minimum μ-measure set with P-measure at least α. Minimum volume sets of this type summarize the regions of greatest probability mass of P, and are useful for detecting anomalies and constructing confidence regions. This paper addresses the problem of estimating minimum volume sets based on independent samples distributed according to P. Other than these samples, no other information is available regarding P, but the reference measure μ is assumed to be known. We introduce rules for estimating minimum volume sets that parallel the empirical risk minimization and structural risk minimization principles in classification. As in classification, we show that the performances of our estimators are controlled by the rate of uniform convergence of empirical to true probabilities over the class from which the estimator is drawn. Thus we obtain finite sample size performance bounds in terms of VC dimension and related quantities. We also demonstrate strong universal consistency and an oracle inequality. Estimators based on histograms and dyadic partitions illustrate the proposed rules.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/JPROC.2021.3052449

A Unifying Review of Deep and Shallow Anomaly Detection

Lukas Ruff, +7 more

- 04 Feb 2021

TL;DR: Deep learning approaches to anomaly detection (AD) have recently improved the state of the art in detection performance on complex data sets, such as large collections of images or text as mentioned in this paper, and led to the introduction of a great variety of new methods.

...read moreread less

650

•Journal Article•10.1109/JPROC.2021.3052449

A Unifying Review of Deep and Shallow Anomaly Detection

Lukas Ruff, +7 more

- 24 Sep 2020

- arXiv: Learning

TL;DR: This review aims to identify the common underlying principles and the assumptions that are often made implicitly by various methods in deep learning, and draws connections between classic “shallow” and novel deep approaches and shows how this relation might cross-fertilize or extend both directions.

...read moreread less

582

•Posted Content

Deep Semi-Supervised Anomaly Detection

Lukas Ruff, +6 more

- 06 Jun 2019

- arXiv: Learning

TL;DR: This work presents Deep SAD, an end-to-end deep methodology for general semi-supervised anomaly detection, and introduces an information-theoretic framework for deep anomaly detection based on the idea that the entropy of the latent distribution for normal data should be lower than the entropy the anomalous distribution, which can serve as a theoretical interpretation for the method.

...read moreread less

388

•Journal Article•10.1080/01621459.2012.751873

Distribution-Free Prediction Sets

Jing Lei, +2 more

- 15 Mar 2013

- Journal of the American Statistical Asso...

TL;DR: This article considers the problem of constructing nonparametric tolerance/prediction sets by starting from the general conformal prediction approach, and uses a kernel density estimator as a measure of agreement between a sample point and the underlying distribution.

...read moreread less

210

Journal Article•10.1109/TPAMI.2009.24

A Small Sphere and Large Margin Approach for Novelty Detection Using Training Data with Outliers

Mingrui Wu, +1 more

- 01 Nov 2009

- IEEE Transactions on Pattern Analysis an...

TL;DR: A small sphere and large margin approach for novelty detection problems, where the majority of training data are normal examples and the training data also contain a small number of abnormal examples or outliers.

...read moreread less

195

...

Expand

References

•Book

Elements of information theory

Thomas M. Cover, +1 more

- 01 Jan 1991

TL;DR: The author examines the role of entropy, inequality, and randomness in the design of codes and the construction of codes in the rapidly changing environment.

...read moreread less

52.2K

Statistical learning theory

Vladimir Vapnik

- 01 Jan 1998

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

30.4K

•Book

Classification and regression trees

Leo Breiman

- 01 Jan 1983

TL;DR: The methodology used to construct tree structured rules is the focus of a monograph as mentioned in this paper, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.

...read moreread less

22.7K

Journal Article•10.2307/2288003

Classification and Regression Trees.

John Van Ryzin, +4 more

- 01 Mar 1986

- Journal of the American Statistical Asso...

21.8K

•Journal Article•10.1162/089976601750264965

Estimating the Support of a High-Dimensional Distribution

Bernhard Schölkopf, +4 more

- 01 Jul 2001

- Neural Computation

TL;DR: In this paper, the authors propose a method to estimate a function f that is positive on S and negative on the complement of S. The functional form of f is given by a kernel expansion in terms of a potentially small subset of the training data; it is regularized by controlling the length of the weight vector in an associated feature space.

...read moreread less

5.4K

...

Expand

Learning Minimum Volume Sets

Chat with Paper

AI Agents for this Paper

Citations

A Unifying Review of Deep and Shallow Anomaly Detection

A Unifying Review of Deep and Shallow Anomaly Detection

Deep Semi-Supervised Anomaly Detection

Distribution-Free Prediction Sets

A Small Sphere and Large Margin Approach for Novelty Detection Using Training Data with Outliers

References

Elements of information theory

Statistical learning theory

Classification and regression trees

Classification and Regression Trees.

Estimating the Support of a High-Dimensional Distribution

Related Papers (5)

Estimating the intensity of a random measure by histogram type estimators

Finite sample properties of linear model identification

Optimal testing of discrete distributions with high probability

On the Value of Partial Information for Learning from Examples

Effective two-stage estimation for a linear function of high-dimensional Gaussian means