Journal Article10.1109/cvpr52729.2023.00421
DiffRF: Rendering-Guided 3D Radiance Field Diffusion
Norman Müller,Yawar Siddiqui,Lorenzo Porzi,Samuel Rota Bulò,Peter Kontschieder,Matthias Nießner +5 more
- 01 Jun 2023
63
TL;DR: DiffRF is a novel approach for 3D radiance field synthesis based on denoising diffusion probabilistic models. It directly generates volumetric radiance fields from posed images, learns multi-view consistent priors, and enables free-view synthesis and accurate shape generation.
read more
Abstract: We introduce DiffRF, a novel approach for 3D radiance field synthesis based on denoising diffusion probabilistic models. While existing diffusion-based methods operate on images, latent codes, or point cloud data, we are the first to directly generate volumetric radiance fields. To this end, we propose a 3D denoising model which directly operates on an explicit voxel grid representation. However, as radiance fields generated from a set of posed images can be ambiguous and contain artifacts, obtaining ground truth radiance field samples is non-trivial. We address this challenge by pairing the denoising formulation with a rendering loss, enabling our model to learn a deviated prior that favours good image quality instead of trying to replicate fitting errors like floating artifacts. In contrast to 2D-diffusion models, our model learns multi-view consistent priors, enabling free-view synthesis and accurate shape generation. Compared to 3D GANs, our diffusion-based approach naturally enables conditional generation such as masked completion or single-view 3D synthesis at inference time.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Wonder3D: Single Image to 3D Using Cross-Domain Diffusion
Xiaoxiao Long,Yuan–Chen Guo,Lin Cheng,Бо Лю,Zhiyang Dou,Lingjie Liu,Yuexin Ma,Song-Hai Zhang,Marc Habermann,Christian Theobalt,Wenping Wang +10 more
- 16 Jun 2024
72
RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation
Titas Anciukevičius,Zexiang Xu,M. Fisher,Paul Henderson,Hakan Bilen,Niloy J. Mitra,Paul Guerrero +6 more
- 01 Jun 2023
TL;DR: RenderDiffusion is the first diffusion model for 3D generation and inference, trained using only monocular 2D supervision. It generates and renders an intermediate 3D representation of a scene in each denoising step.
60
Text2Tex: Text-driven Texture Synthesis via Diffusion Models
Dave Zhenyu Chen,Yawar Siddiqui,Hsin-Ying Lee,Sergey Tulyakov,Matthias Nießner +4 more
- 01 Oct 2023
TL;DR: Text2Tex generates high-quality textures for 3D meshes from text prompts using a diffusion model and dynamic segmentation.
53
Single-Stage Diffusion NeRF: A Unified Approach to 3D Generation and Reconstruction
Hansheng Chen,Jiatao Gu,Anpei Chen,Wei Tian,Zhuowen Tu,Lingjie Liu,Hao Su +6 more
- 01 Oct 2023
TL;DR: Single-Stage Diffusion NeRF is a unified approach for 3D generation and reconstruction from multi-view images, enabling simultaneous optimization of NeRF auto-decoder and latent diffusion model.
49
ScanNet++: A High-Fidelity Dataset of 3D Indoor Scenes
Chandan Yeshwanth,Yueh-Cheng Liu,Matthias Nießner,Angela Dai +3 more
- 01 Oct 2023
TL;DR: High-fidelity dataset of 3D indoor scenes with high-quality geometry, color, and semantics. Enables new benchmarks for view synthesis and 3D semantic scene understanding.
36
References
•Posted Content
Denoising Diffusion Probabilistic Models
TL;DR: High quality image synthesis results are presented using diffusion probabilistic models, a class of latent variable models inspired by considerations from nonequilibrium thermodynamics, which naturally admit a progressive lossy decompression scheme that can be interpreted as a generalization of autoregressive decoding.
High-Resolution Image Synthesis with Latent Diffusion Models
01 Jun 2022
TL;DR: This article decompose the image formation process into a sequential application of denoising autoencoders, and apply them in the latent space of powerful pretrained autoencoder.
ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes
Angela Dai,Angel X. Chang,Manolis Savva,Maciej Halber,Thomas Funkhouser,Matthias NieBner +5 more
- 21 Jul 2017
TL;DR: This work introduces ScanNet, an RGB-D video dataset containing 2.5M views in 1513 scenes annotated with 3D camera poses, surface reconstructions, and semantic segmentations, and shows that using this data helps achieve state-of-the-art performance on several 3D scene understanding tasks.
•Posted Content
Deep Unsupervised Learning using Nonequilibrium Thermodynamics
TL;DR: This work develops an approach to systematically and slowly destroy structure in a data distribution through an iterative forward diffusion process, then learns a reverse diffusion process that restores structure in data, yielding a highly flexible and tractable generative model of the data.
•Posted Content
Score-Based Generative Modeling through Stochastic Differential Equations
TL;DR: This work presents a stochastic differential equation (SDE) that smoothly transforms a complex data distribution to a known prior distribution by slowly injecting noise, and a corresponding reverse-time SDE that transforms the prior distribution back into the data distribution by Slowly removing the noise.
3.9K