PixelVAE: A Latent Variable Model for Natural Images

Open AccessPosted Content

PixelVAE: A Latent Variable Model for Natural Images

- 15 Nov 2016

255

TL;DR: PixelVAE as mentioned in this paper is a VAE model with an autoregressive decoder based on PixelCNN, which achieves state-of-the-art performance on binarized MNIST, competitive performance on 64x64 ImageNet, and high quality samples on the LSUN bedrooms dataset.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Decision-Making with Auto-Encoding Variational Bayes

Romain Lopez, +4 more

- 17 Feb 2020

- arXiv: Machine Learning

TL;DR: This work describes the error of importance sampling as a function of posterior variance and shows that proposal distributions learned with evidence upper bounds are better than the current state of the art.

...read moreread less

7.5K

•Journal Article•10.1561/2200000056

An Introduction to Variational Autoencoders.

Diederik P. Kingma, +1 more

- 06 Jun 2019

- arXiv: Learning

TL;DR: This work provides an introduction to variational autoencoders and some important extensions, which provide a principled framework for learning deep latent-variable models and corresponding inference models.

...read moreread less

2.4K

•Posted Content

NVAE: A Deep Hierarchical Variational Autoencoder

Arash Vahdat, +1 more

- 08 Jul 2020

- arXiv: Machine Learning

TL;DR: NVAE is the first successful VAE applied to natural images as large as 256$\times$256 pixels and achieves state-of-the-art results among non-autoregressive likelihood-based models on the MNIST, CIFAR-10, CelebA 64, and CelebA HQ datasets and it provides a strong baseline on FFHQ.

...read moreread less

773

•Posted Content

InfoVAE: Information Maximizing Variational Autoencoders

Shengjia Zhao, +2 more

- 07 Jun 2017

- arXiv: Learning

TL;DR: It is shown that this model can significantly improve the quality of the variational posterior and can make effective use of the latent features regardless of the flexibility of the decoding distribution, and it is demonstrated that the models outperform competing approaches on multiple performance metrics.

...read moreread less

560

•Posted Content

Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders

Jesse Engel, +6 more

- 05 Apr 2017

- arXiv: Learning

TL;DR: A powerful new WaveNet-style autoencoder model is detailed that conditions an autoregressive decoder on temporal codes learned from the raw audio waveform, and NSynth, a large-scale and high-quality dataset of musical notes that is an order of magnitude larger than comparable public datasets is introduced.

...read moreread less

506

...

Expand

References

Journal Article•10.1109/5.726791

Gradient-based learning applied to document recognition

Yann LeCun, +6 more

- 01 Jan 1998

TL;DR: In this article, a graph transformer network (GTN) is proposed for handwritten character recognition, which can be used to synthesize a complex decision surface that can classify high-dimensional patterns, such as handwritten characters.

...read moreread less

53.5K

•Journal Article•10.3156/JSOFT.29.5_177_2

Generative Adversarial Nets

Ian Goodfellow, +7 more

- 08 Dec 2014

TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.

...read moreread less

48.6K

•Proceedings Article

Auto-Encoding Variational Bayes

Diederik P. Kingma, +1 more

- 01 Jan 2014

TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.

...read moreread less

28.9K

•Proceedings Article

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Alec Radford, +2 more

- 01 Jan 2016

TL;DR: Deep convolutional generative adversarial networks (DCGANs) as discussed by the authors learn a hierarchy of representations from object parts to scenes in both the generator and discriminator for unsupervised learning.

...read moreread less

7.3K

•Proceedings Article

Variational Inference with Normalizing Flows

Danilo Jimenez Rezende, +1 more

- 06 Jul 2015

TL;DR: It is demonstrated that the theoretical advantages of having posteriors that better match the true posterior, combined with the scalability of amortized variational approaches, provides a clear improvement in performance and applicability of variational inference.

...read moreread less

3.4K

...

Expand

PixelVAE: A Latent Variable Model for Natural Images

Chat with Paper

AI Agents for this Paper

Citations

Decision-Making with Auto-Encoding Variational Bayes

An Introduction to Variational Autoencoders.

NVAE: A Deep Hierarchical Variational Autoencoder

InfoVAE: Information Maximizing Variational Autoencoders

Neural Audio Synthesis of Musical Notes with WaveNet Autoencoders

References

Gradient-based learning applied to document recognition

Generative Adversarial Nets

Auto-Encoding Variational Bayes

Unsupervised Representation Learning with Deep Convolutional Generative Adversarial Networks

Variational Inference with Normalizing Flows

Related Papers (5)

Auto-Encoding Variational Bayes

Generative Adversarial Nets

beta-VAE: Learning Basic Visual Concepts with a Constrained Variational Framework

Adam: A Method for Stochastic Optimization

Deep Learning Face Attributes in the Wild