Open AccessPosted Content
Hybrid VAE: Improving Deep Generative Models using Partial Observations.
TL;DR: It is shown that such a combination is beneficial because the unlabeled data acts as a data-driven form of regularization, allowing generative models trained on few labeled samples to reach the performance of fully-supervised generative model trained on much larger datasets.
read more
Abstract: Deep neural network models trained on large labeled datasets are the state-of-the-art in a large variety of computer vision tasks. In many applications, however, labeled data is expensive to obtain or requires a time consuming manual annotation process. In contrast, unlabeled data is often abundant and available in large quantities. We present a principled framework to capitalize on unlabeled data by training deep generative models on both labeled and unlabeled data. We show that such a combination is beneficial because the unlabeled data acts as a data-driven form of regularization, allowing generative models trained on few labeled samples to reach the performance of fully-supervised generative models trained on much larger datasets. We call our method Hybrid VAE (H-VAE) as it contains both the generative and the discriminative parts. We validate H-VAE on three large-scale datasets of different modalities: two face datasets: (MultiPIE, CelebA) and a hand pose dataset (NYU Hand Pose). Our qualitative visualizations further support improvements achieved by using partial observations.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
MoCoGAN: Decomposing Motion and Content for Video Generation
Sergey Tulyakov,Ming-Yu Liu,Xiaodong Yang,Jan Kautz +3 more
- 18 Jun 2018
TL;DR: MoCoGAN as discussed by the authors decomposes the visual signals in a video into content and motion, and learns motion and content decomposition in an unsupervised manner using both image and video discriminators.
•Posted Content
MoCoGAN: Decomposing Motion and Content for Video Generation
TL;DR: MoCoGAN as mentioned in this paper decomposes the visual signals in a video into content and motion, and learns motion and content decomposition in an unsupervised manner using both image and video discriminators.
419
•Posted Content
Learning to Decompose and Disentangle Representations for Video Prediction
TL;DR: The Decompositional Disentangled Predictive Auto-Encoder (DDPAE) is proposed, a framework that combines structured probabilistic models and deep networks to automatically decompose the high-dimensional video that the authors aim to predict into components, and disentangle each component to have low-dimensional temporal dynamics that are easier to predict.
241
•Proceedings Article
Learning to Decompose and Disentangle Representations for Video Prediction
Jun-Ting Hsieh,Bingbin Liu,De-An Huang,Li Fei-Fei,Juan Carlos Niebles +4 more
- 01 Jan 2018
TL;DR: The Decompositional Disentangled Predictive AutoEncoder (DDPAE) as mentioned in this paper combines structured probabilistic models and deep networks to automatically decompose the high-dimensional video into components, and disentangle each component to have low-dimensional temporal dynamics that are easier to predict.
•Proceedings Article
Hybrid Models with Deep and Invertible Features
Eric Nalisnick,Akihiro Matsukawa,Yee Whye Teh,Dilan Gorur,Balaji Lakshminarayanan +4 more
- 24 May 2019
TL;DR: The hybrid model, despite the invertibility constraints, achieves similar accuracy to purely predictive models, and the generative component remains a good model of the input features despite the hybrid optimization objective.
References
Generative Adversarial Nets
Ian Goodfellow,Jean Pouget-Abadie,Mehdi Mirza,Bing Xu,David Warde-Farley,Sherjil Ozair,Aaron Courville,Yoshua Bengio +7 more
- 08 Dec 2014
TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
•Proceedings Article
Auto-Encoding Variational Bayes
Diederik P. Kingma,Max Welling +1 more
- 01 Jan 2014
TL;DR: A stochastic variational inference and learning algorithm that scales to large datasets and, under some mild differentiability conditions, even works in the intractable case is introduced.
•Proceedings Article
Rectified Linear Units Improve Restricted Boltzmann Machines
Vinod Nair,Geoffrey E. Hinton +1 more
- 21 Jun 2010
TL;DR: Restricted Boltzmann machines were developed using binary stochastic hidden units that learn features that are better for object recognition on the NORB dataset and face verification on the Labeled Faces in the Wild dataset.
Deep Learning Face Attributes in the Wild
Ziwei Liu,Ping Luo,Xiaogang Wang,Xiaoou Tang +3 more
- 07 Dec 2015
TL;DR: A novel deep learning framework for attribute prediction in the wild that cascades two CNNs, LNet and ANet, which are fine-tuned jointly with attribute tags, but pre-trained differently.
•Posted Content
Improved Techniques for Training GANs
TL;DR: In this article, the authors present a variety of new architectural features and training procedures that apply to the generative adversarial networks (GANs) framework and achieve state-of-the-art results in semi-supervised classification on MNIST, CIFAR-10 and SVHN.
7.4K
Related Papers (5)
Diederik P. Kingma,Max Welling +1 more
- 01 Jan 2014
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
Shuangjun Liu,Sarah Ostadabbas +1 more
- 08 Sep 2018