Sparse Coding and Autoencoders

doi:10.1109/ISIT.2018.8437533

Open AccessProceedings Article10.1109/ISIT.2018.8437533

Sparse Coding and Autoencoders

Akshay Rangamani, +6 more

- 17 Jun 2018

- pp 36-40

16

TL;DR: It is proved that a layer of ReLU gates can be set up to automatically recover the support of the sparse codes when the data generative model is that of “Sparse Coding”/“Dictionary Learning”.

Abstract: In this work we study the landscape of squared loss of an Autoencoder when the data generative model is that of “Sparse Coding”/“Dictionary Learning”. The neural net considered is an $\mathbb{R}^{n}\rightarrow \mathbb{R}^{n}$ mapping and has a single ReLU activation layer of size $h > n$ . The net has access to vectors $y\in \mathbb{R}^{n}$ obtained as $y=A^{\ast}x^{\ast}$ where $x^{\ast}\in \mathbb{R}^{h}$ are sparse high dimensional vectors and $A^{\ast}\in \mathbb{R}^{n\times h}$ is an overcomplete incoherent matrix. Under very mild distributional assumptions on $x^{\ast}$ , we prove that the norm of the expected gradient of the squared loss function is asymptotically (in sparse code dimension) negligible for all points in a small neighborhood of $A^{\ast}$ . This is supported with experimental evidence using synthetic data. We conduct experiments to suggest that $A^{\ast}$ sits at the bottom of a well in the landscape and we also give experiments showing that gradient descent on this loss function gets columnwise very close to the original dictionary even with far enough initialization. Along the way we prove that a layer of ReLU gates can be set up to automatically recover the support of the sparse codes. Since this property holds independent of the loss function we believe that it could be of independent interest. A full version of this paper is accessible at: https://arxiv.org/abs/1708.03735

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1109/ACCESS.2020.3021943

Deep Learning Based Systems Developed for Fall Detection: A Review

Md. Milon Islam, +6 more

- 04 Sep 2020

- IEEE Access

TL;DR: Among the reviewed systems, three dimensional (3D) CNN, CNN with 10-fold cross-validation, LSTM with CNN based systems performed the best in terms of accuracy, sensitivity, specificity, etc.

...read moreread less

118

•Posted Content

Convergence Guarantees for RMSProp and ADAM in Non-Convex Optimization and an Empirical Comparison to Nesterov Acceleration

Soham De, +2 more

- 18 Jul 2018

- arXiv: Learning

TL;DR: This work provides proofs that these adaptive gradient algorithms are guaranteed to reach criticality for smooth non-convex objectives, and gives bounds on the running time of these algorithms.

...read moreread less

112

•Proceedings Article

On Random Deep Weight-Tied Autoencoders: Exact Asymptotic Analysis, Phase Transitions, and Implications to Training.

Ping Li, +1 more

- 27 Sep 2018

TL;DR: It is demonstrated experimentally that it is possible to train a deep autoencoder, even with the tanh activation and a depth as large as 200 layers, without resorting to techniques such as layer-wise pre-training or batch normalization.

...read moreread less

35

Journal Article•10.1016/J.ANUCENE.2020.107307

A mixed intelligent condition monitoring method for nuclear power plant

Binsen Peng, +5 more

- 01 Jun 2020

- Annals of Nuclear Energy

TL;DR: It can be known that sparse autoencoder can extract the nature of operating data, and monitoring accuracy of 100% and 98% can be achieved under one operating condition and two operating conditions by isolation forest method, respectively.

...read moreread less

26

•Journal Article•10.1016/J.ULTRAS.2021.106637

Autoencoder-based detection of near-surface defects in ultrasonic testing.

Jong Moon Ha, +3 more

- 01 Feb 2022

- Ultrasonics

TL;DR: In this paper, an adaptive autoencoder was proposed to predict the normal behavior of ultrasonic signals including disturbances, thus enabling the identification of even subtle deviations made by defects.

...read moreread less

23

...

Expand

References

•Posted Content

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

Martín Abadi, +39 more

- 01 Jan 2015

- arXiv: Distributed, Parallel, and Cluste...

TL;DR: The TensorFlow interface and an implementation of that interface that is built at Google are described, which has been used for conducting research and for deploying machine learning systems into production across more than a dozen areas of computer science and other fields.

...read moreread less

13.6K

Journal Article•10.1109/TSP.2006.881199

$rm K$ -SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation

Michal Aharon, +2 more

- 01 Nov 2006

- IEEE Transactions on Signal Processing

TL;DR: A novel algorithm for adapting dictionaries in order to achieve sparse signal representations, the K-SVD algorithm, an iterative method that alternates between sparse coding of the examples based on the current dictionary and a process of updating the dictionary atoms to better fit the data.

...read moreread less

10K

•Proceedings Article•10.1145/1390156.1390294

Extracting and composing robust features with denoising autoencoders

Pascal Vincent, +3 more

- 05 Jul 2008

TL;DR: This work introduces and motivate a new training principle for unsupervised learning of a representation based on the idea of making the learned representations robust to partial corruption of the input pattern.

...read moreread less

9K

Journal Article•10.1038/381607A0

Emergence of simple-cell receptive field properties by learning a sparse code for natural images

Bruno A. Olshausen, +2 more

- 13 Jun 1996

- Nature

TL;DR: It is shown that a learning algorithm that attempts to find sparse linear codes for natural scenes will develop a complete family of localized, oriented, bandpass receptive fields, similar to those found in the primary visual cortex.

...read moreread less

6.5K

•Journal Article•10.1016/S0042-6989(97)00169-7

Sparse Coding with an Overcomplete Basis Set: A Strategy Employed by V1 ?

Bruno A. Olshausen, +1 more

- 01 Dec 1997

- Vision Research

TL;DR: These deviations from linearity provide a potential explanation for the weak forms of non-linearity observed in the response properties of cortical simple cells, and they further make predictions about the expected interactions among units in response to naturalistic stimuli.

...read moreread less

4.2K

...

Expand

Sparse Coding and Autoencoders

Chat with Paper

AI Agents for this Paper

Citations

Deep Learning Based Systems Developed for Fall Detection: A Review

Convergence Guarantees for RMSProp and ADAM in Non-Convex Optimization and an Empirical Comparison to Nesterov Acceleration

On Random Deep Weight-Tied Autoencoders: Exact Asymptotic Analysis, Phase Transitions, and Implications to Training.

A mixed intelligent condition monitoring method for nuclear power plant

Autoencoder-based detection of near-surface defects in ultrasonic testing.

References

TensorFlow: Large-Scale Machine Learning on Heterogeneous Distributed Systems

$rm K$ -SVD: An Algorithm for Designing Overcomplete Dictionaries for Sparse Representation

Extracting and composing robust features with denoising autoencoders

Emergence of simple-cell receptive field properties by learning a sparse code for natural images

Sparse Coding with an Overcomplete Basis Set: A Strategy Employed by V1 ?

Related Papers (5)

New Algorithms for Learning Incoherent and Overcomplete Dictionaries

Learning Mixtures of Sparse Linear Regressions Using Sparse Graph Codes

On the Sample Complexity of Predictive Sparse Coding

Simple Bounds for Recovering Low-complexity Models

Compressed Sensing with Adversarial Sparse Noise via L1 Regression.