Open AccessPosted Content
HoloGAN: Unsupervised learning of 3D representations from natural images
TL;DR: HoloGAN is the first generative model that learns 3D representations from natural images in an entirely unsupervised manner and is shown to be able to generate images with similar or higher visual quality than other generative models.
read more
Abstract: We propose a novel generative adversarial network (GAN) for the task of unsupervised learning of 3D representations from natural images. Most generative models rely on 2D kernels to generate images and make few assumptions about the 3D world. These models therefore tend to create blurry images or artefacts in tasks that require a strong 3D understanding, such as novel-view synthesis. HoloGAN instead learns a 3D representation of the world, and to render this representation in a realistic manner. Unlike other GANs, HoloGAN provides explicit control over the pose of generated objects through rigid-body transformations of the learnt 3D features. Our experiments show that using explicit 3D features enables HoloGAN to disentangle 3D pose and identity, which is further decomposed into shape and appearance, while still being able to generate images with similar or higher visual quality than other generative models. HoloGAN can be trained end-to-end from unlabelled 2D images only. Particularly, we do not require pose labels, 3D shapes, or multiple views of the same objects. This shows that HoloGAN is the first generative model that learns 3D representations from natural images in an entirely unsupervised manner.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
GRM: Large Gaussian Reconstruction Model for Efficient 3D Reconstruction and Generation
Yinghao Xu,Zifan Shi,Yifan Wang,Hansheng Chen,Ceyuan Yang,Sida Peng,Yujun Shen,Gordon Wetzstein +7 more
3
A Survey on 3D-aware Image Synthesis
25 Oct 2022
TL;DR: 3D-aware generative models have shown that the introduction of 3D information can lead to more controllable image generation as mentioned in this paper , thus bridging the gap between 2D imagery and 3D reality.
3
•Posted Content
3DMaterialGAN: Learning 3D Shape Representation from Latent Space for Materials Science Applications.
Devendra K. Jangid,Neal R. Brodnik,Amil Khan,McLean P. Echlin,Tresa M. Pollock,Sam Daly,B.S. Manjunath +6 more
TL;DR: A 3DMaterialGAN network is proposed that is capable of recognizing and synthesizing individual grains whose morphology conforms to a given 3D polycrystalline material microstructure, which lays the foundation for the recognition and synthesis of polycrystals microstructures used in additive manufacturing, aerospace, and structural design applications.
3
•Posted Content
CUDA-GHR: Controllable Unsupervised Domain Adaptation for Gaze and Head Redirection
Swati Jindal,Xin Eric Wang +1 more
TL;DR: In this paper, an unsupervised domain adaptation framework is proposed to enable fine-grained control over gaze and head pose directions while preserving the appearance-related factors of the person.
3
OrthoPlanes: A Novel Representation for Better 3D-Awareness of GANs
Honglin He,Zhuoqian Yang,Shikai Li,B. Z. Dai,Wayne Wu +4 more
TL;DR: This work proposes a hybrid explicit-implicit representation called OrthoPlanes, which encodes fine-grained 3D information in feature maps that can be efficiently generated by modifying 2D StyleGANs.
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
Going deeper with convolutions
Christian Szegedy,Wei Liu,Yangqing Jia,Pierre Sermanet,Scott Reed,Dragomir Anguelov,Dumitru Erhan,Vincent Vanhoucke,Andrew Rabinovich +8 more
- 07 Jun 2015
TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Generative Adversarial Nets
Ian Goodfellow,Jean Pouget-Abadie,Mehdi Mirza,Bing Xu,David Warde-Farley,Sherjil Ozair,Aaron Courville,Yoshua Bengio +7 more
- 08 Dec 2014
TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola,Jun-Yan Zhu,Tinghui Zhou,Alexei A. Efros +3 more
- 21 Jul 2017
TL;DR: Conditional adversarial networks are investigated as a general-purpose solution to image-to-image translation problems and it is demonstrated that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks.