From average case complexity to improper learning complexity

doi:10.1145/2591796.2591820

Open AccessProceedings Article10.1145/2591796.2591820

From average case complexity to improper learning complexity

Amit Daniely, +2 more

- 31 May 2014

- pp 441-448

147

TL;DR: In this article, the authors introduce a new technique for proving hardness of improper learning, based on reductions from problems that are hard on average, which is a generalization of Feige's assumption about the complexity of refuting random constraint satisfaction problems.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article

In Search of the Real Inductive Bias: On the Role of Implicit Regularization in Deep Learning

Behnam Neyshabur, +2 more

- 20 Dec 2014

TL;DR: In this paper, Sipser et al. showed that without any regularization, even with zero training error (and zero approximation error), increasing the number of hidden units reduces estimation error.

...read moreread less

563

•Proceedings Article•10.1109/FOCS.2017.16

Statistical Query Lower Bounds for Robust Estimation of High-Dimensional Gaussians and Gaussian Mixtures

Ilias Diakonikolas, +2 more

- 01 Oct 2017

TL;DR: In particular, this paper showed that the complexity of learning a Gaussian mixture model is exponential in the dimension of the latent space, and showed that statistical query algorithms can be implemented in polynomial time.

...read moreread less

282

•Proceedings Article

Globally optimal gradient descent for a ConvNet with Gaussian inputs

Alon Brutzkus, +1 more

- 06 Aug 2017

TL;DR: This work provides the first global optimality guarantee of gradient descent on a convolutional neural network with ReLU activations, and shows that learning is NP-complete in the general case, but that when the input distribution is Gaussian, gradient descent converges to the global optimum in polynomial time.

...read moreread less

235

•Posted Content

Geometry of Optimization and Implicit Regularization in Deep Learning.

Behnam Neyshabur, +3 more

- 08 May 2017

- arXiv: Learning

TL;DR: This work argues that the optimization plays a crucial role in generalization of deep learning models through implicit regularization, and demonstrates how changing the empirical optimization procedure can improve generalization, even if actual optimization quality is not affected.

...read moreread less

150

•Proceedings Article

Noisy Tensor Completion via the Sum-of-Squares Hierarchy

Boaz Barak, +1 more

- 06 Jun 2016

TL;DR: The main technical result is in characterizing the Rademacher complexity of the sequence of norms that arise in the sum-of-squares relaxations to the tensor nuclear norm.

...read moreread less

123

...

Expand

References

Statistical learning theory

Vladimir Vapnik

- 01 Jan 1998

TL;DR: Presenting a method for determining the necessary and sufficient conditions for consistency of learning process, the author covers function estimates from small data pools, applying these estimations to real-life problems, and much more.

...read moreread less

30.4K

•Book

Neural networks for pattern recognition

Christopher M. Bishop

- 01 Jan 1995

TL;DR: This is the first comprehensive treatment of feed-forward neural networks from the perspective of statistical pattern recognition, and is designed as a text, with over 100 exercises, to benefit anyone involved in the fields of neural computation and pattern recognition.

...read moreread less

19.9K

Book Chapter•10.1016/S0065-2458(08)60404-0

Neural Networks for Pattern Recognition

Suresh Kothari, +1 more

- 01 Jan 1993

- Advances in Computers

TL;DR: The chapter discusses two important directions of research to improve learning algorithms: the dynamic node generation, which is used by the cascade correlation algorithm; and designing learning algorithms where the choice of parameters is not an issue.

...read moreread less

14.5K

Journal Article•10.1037/H0042519

The perceptron: a probabilistic model for information storage and organization in the brain.

Frank Rosenblatt

- 01 Nov 1958

- Psychological Review

TL;DR: This article will be concerned primarily with the second and third questions, which are still subject to a vast amount of speculation, and where the few relevant facts currently supplied by neurophysiology have not yet been integrated into an acceptable theory.

...read moreread less

10.6K

•Book

The perception: a probabilistic model for information storage and organization in the brain

F. Rosenblatt

- 01 Jan 1988

TL;DR: The second and third questions are still subject to a vast amount of speculation, and where the few relevant facts currently supplied by neurophysiology have not yet been integrated into an acceptable theory as mentioned in this paper.

...read moreread less

9.3K