Active Property Testing

doi:10.1109/FOCS.2012.64

Open AccessProceedings Article10.1109/FOCS.2012.64

Active Property Testing

Maria-Florina Balcan, +3 more

- 20 Oct 2012

- Vol. 2012, pp 21-30

69

TL;DR: For example, the authors showed that testing unions of d intervals can be done with O(1) label requests in our setting, whereas it is known that requiring Omega(d) labeled examples for learning (and Omega(sqrt{d}) for passive testing [KR00] where the algorithm must pay for every example drawn from D).

Abstract: One motivation for property testing of boolean functions is the idea that testing can provide a fast preprocessing step before learning. However, in most machine learning applications, it is not possible to request for labels of arbitrary examples constructed by an algorithm. Instead, the dominant query paradigm in applied machine learning, called *active learning*, is one where the algorithm may query for labels, but only on points in a given (polynomial-sized) unlabeled sample, drawn from some underlying distribution D. In this work, we bring this well-studied model to the domain of testing. We develop both general results for this *active testing* model as well as efficient testing algorithms for several important properties for learning, demonstrating that testing can still yield substantial benefits in this restricted setting. For example, we show that testing unions of d intervals can be done with O(1) label requests in our setting, whereas it is known to require Omega(d) labeled examples for learning (and Omega(sqrt{d}) for passive testing [KR00] where the algorithm must pay for every example drawn from D). In fact, our results for testing unions of intervals also yield improvements on prior work in both the classic query model (where any point in the domain can be queried) and the passive testing model as well. For the problem of testing linear separators in R^n over the Gaussian distribution, we show that both active and passive testing can be done with O(sqrt{n}) queries, substantially less than the Omega(n) needed for learning, with near-matching lower bounds. We also present a general combination result in this model for building testable properties out of others, which we then use to provide testers for a number of assumptions used in semi-supervised learning. In addition to the above results, we also develop a general notion of the *testing dimension* of a given property with respect to a given distribution, that we show characterizes (up to constant factors) the intrinsic number of label requests needed to test that property. We develop such notions for both the active and passive testing models. We then use these dimensions to prove a number of lower bounds, including for linear separators and the class of dictator functions.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book

Introduction to Property Testing

Oded Goldreich

- 01 Nov 2017

TL;DR: In this article, a wide range of algorithmic techniques for the design and analysis of tests for algebraic properties, properties of Boolean functions, graph properties, and properties of distributions are presented.

...read moreread less

440

•Proceedings Article•10.1109/FOCS.2012.64

Active Property Testing

Maria-Florina Balcan, +3 more

- 20 Oct 2012

TL;DR: For example, the authors showed that testing unions of d intervals can be done with O(1) label requests in our setting, whereas it is known that requiring Omega(d) labeled examples for learning (and Omega(sqrt{d}) for passive testing [KR00] where the algorithm must pay for every example drawn from D).

...read moreread less

68

Journal Article•10.1145/2898355

On Sample-Based Testers

Oded Goldreich, +1 more

- 25 Apr 2016

- ACM Transactions on Computation Theory

TL;DR: This work advances the study of sample-based property testers by providing several general positive results as well as by revealing relations between variants of this testing model, and shows that certain types of query-based testers yield sample- based testers of sublinear sample complexity.

...read moreread less

49

•Journal Article•10.1137/16M1075661

Erasure-Resilient Property Testing

Kashyap Dixit, +3 more

- 06 Mar 2018

- SIAM Journal on Computing

TL;DR: This work begins a study of property testers that are resilient to the presence of adversarially erased function values and identifies an $\alpha$-erasure-resilient $\var...$ that is resistant to being erased by an adversary.

...read moreread less

32

•Proceedings Article•10.5555/2634074.2634163

Testing surface area

Pravesh K. Kothari, +3 more

- 05 Jan 2014

TL;DR: The surface area of an unknown n-dimensional set F given membership oracle access is considered, and the algorithm completely evades the "curse of dimensionality": for any n and any κ > 4/π a 1.27, the "approximation factor" of the testing algorithm.

...read moreread less

30

...

Expand

References

Statistical learning theory

Vladimir Vapnik

- 01 Jan 1998

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

30.4K

Journal Article•10.1126/SCIENCE.290.5500.2323

Nonlinear dimensionality reduction by locally linear embedding.

Sam T. Roweis, +1 more

- 22 Dec 2000

- Science

TL;DR: Locally linear embedding (LLE) is introduced, an unsupervised learning algorithm that computes low-dimensional, neighborhood-preserving embeddings of high-dimensional inputs that learns the global structure of nonlinear manifolds.

...read moreread less

17.4K

Journal Article•10.1126/SCIENCE.290.5500.2319

A global geometric framework for nonlinear dimensionality reduction.

Joshua B. Tenenbaum, +2 more

- 22 Dec 2000

- Science

TL;DR: An approach to solving dimensionality reduction problems that uses easily measured local metric information to learn the underlying global geometry of a data set and efficiently computes a globally optimal solution, and is guaranteed to converge asymptotically to the true structure.

...read moreread less

15.9K

Book Chapter•10.1007/978-3-319-21852-6_3

On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities

Vladimir Vapnik, +1 more

- 01 Jan 1971

- Theory of Probability and Its Applicatio...

TL;DR: This chapter reproduces the English translation by B. Seckler of the paper by Vapnik and Chervonenkis in which they gave proofs for the innovative results they had obtained in a draft form in July 1966 and announced in 1968 in their note in Soviet Mathematics Doklady.

...read moreread less

4.3K

•Book•10.7551/MITPRESS/9780262033589.001.0001

Semi-Supervised Learning

Olivier Chapelle, +2 more

- 31 Mar 2010

TL;DR: Semi-supervised learning (SSL) as discussed by the authors is the middle ground between supervised learning (in which all training examples are labeled) and unsupervised training (where no label data are given).

...read moreread less

4.2K

...

Expand

Active Property Testing

Chat with Paper

AI Agents for this Paper

Citations

Introduction to Property Testing

Active Property Testing

On Sample-Based Testers

Erasure-Resilient Property Testing

Testing surface area

References

Statistical learning theory

Nonlinear dimensionality reduction by locally linear embedding.

A global geometric framework for nonlinear dimensionality reduction.

On the Uniform Convergence of Relative Frequencies of Events to Their Probabilities

Semi-Supervised Learning

Related Papers (5)

Property testing and its connection to learning and approximation

Robust Characterizations of Polynomials withApplications to Program Testing

Property Testing Lower Bounds via Communication Complexity

Testing juntas nearly optimally

Self-testing/correcting with applications to numerical problems