Open AccessPosted Content
HoloGAN: Unsupervised learning of 3D representations from natural images
TL;DR: HoloGAN is the first generative model that learns 3D representations from natural images in an entirely unsupervised manner and is shown to be able to generate images with similar or higher visual quality than other generative models.
read more
Abstract: We propose a novel generative adversarial network (GAN) for the task of unsupervised learning of 3D representations from natural images. Most generative models rely on 2D kernels to generate images and make few assumptions about the 3D world. These models therefore tend to create blurry images or artefacts in tasks that require a strong 3D understanding, such as novel-view synthesis. HoloGAN instead learns a 3D representation of the world, and to render this representation in a realistic manner. Unlike other GANs, HoloGAN provides explicit control over the pose of generated objects through rigid-body transformations of the learnt 3D features. Our experiments show that using explicit 3D features enables HoloGAN to disentangle 3D pose and identity, which is further decomposed into shape and appearance, while still being able to generate images with similar or higher visual quality than other generative models. HoloGAN can be trained end-to-end from unlabelled 2D images only. Particularly, we do not require pose labels, 3D shapes, or multiple views of the same objects. This shows that HoloGAN is the first generative model that learns 3D representations from natural images in an entirely unsupervised manner.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Make Your Brief Stroke Real and Stereoscopic: 3D-Aware Simplified Sketch to Portrait Generation
Yasheng Sun,Qianyi Wu,Hang Zhou,Kaisiyuan Wang,Tianshu Hu,Chenxi Liao,Dongliang He,Jingtuo Liu,Errui Ding,Jingdong Wang,Shio Miyafuji,Ziwei Liu,Hideki Koike +12 more
TL;DR: Wang et al. as mentioned in this paper presented Stereoscopic Simplified Sketch-to-Portrait (SSSP), which explores the possibility of creating stereoscopic 3D-aware portraits from simple contour sketches by involving 3D generative models.
Removing Objects From Neural Radiance Fields
Silvan Weder,Guillermo Garcia-Hernando,Aron Monszpart,Marc Pollefeys,Gabriel J. Brostow,Michael Firman,Sara Vicente +6 more
TL;DR: In this paper , a confidence-based view selection procedure is used to select the individual 2D inpainted images to use in the creation of the NeRF, so that the resulting inpainted NeRF is 3D consistent.
SinGRAV: Learning a Generative Radiance Volume from a Single Natural Scene
TL;DR: The ability of SinGRAV in generating plausible and diverse variations from a single scene, the merits of Sin GRAV over state-of-the-art generative neural scene methods, as well as the versatility of the method by its use in a variety of applications, spanning 3D scene editing, composition, and animation are demonstrated.
DeepFaceEditing: deep face generation and editing with disentangled geometry and appearance control
TL;DR: DeepFaceEditing as discussed by the authors proposes a structured disentanglement framework specifically designed for face images to support face generation and editing with disentangled control of geometry and appearance, which can achieve fine control of facial details such as wrinkles.
Decomposing NeRF for Editing via Feature Field Distillation
S. Kobayashi,Eiichi Matsumoto,Vincent Sitzmann +2 more
- 31 May 2022
TL;DR: The authors propose to distill the knowledge of off-the-shelf, self-supervised 2D image feature extractors such as CLIP-LSeg or DINO into a 3D feature field optimized in parallel to the radiance field.
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
•Posted Content
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
82.5K
Going deeper with convolutions
Christian Szegedy,Wei Liu,Yangqing Jia,Pierre Sermanet,Scott Reed,Dragomir Anguelov,Dumitru Erhan,Vincent Vanhoucke,Andrew Rabinovich +8 more
- 07 Jun 2015
TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).
Generative Adversarial Nets
Ian Goodfellow,Jean Pouget-Abadie,Mehdi Mirza,Bing Xu,David Warde-Farley,Sherjil Ozair,Aaron Courville,Yoshua Bengio +7 more
- 08 Dec 2014
TL;DR: A new framework for estimating generative models via an adversarial process, in which two models are simultaneously train: a generative model G that captures the data distribution and a discriminative model D that estimates the probability that a sample came from the training data rather than G.
Image-to-Image Translation with Conditional Adversarial Networks
Phillip Isola,Jun-Yan Zhu,Tinghui Zhou,Alexei A. Efros +3 more
- 21 Jul 2017
TL;DR: Conditional adversarial networks are investigated as a general-purpose solution to image-to-image translation problems and it is demonstrated that this approach is effective at synthesizing photos from label maps, reconstructing objects from edge maps, and colorizing images, among other tasks.