Open AccessPosted Content
Neural Radiance Flow for 4D View Synthesis and Video Processing
TL;DR: This work uses a neural implicit representation that learns to capture the 3D occupancy, radiance, and dynamics of the scene, and demonstrates that the learned representation can serve as an implicit scene prior, enabling video processing tasks such as image super-resolution and de-noising without any additional supervision.
read more
Abstract: We present a method, Neural Radiance Flow (NeRFlow),to learn a 4D spatial-temporal representation of a dynamic scene from a set of RGB images. Key to our approach is the use of a neural implicit representation that learns to capture the 3D occupancy, radiance, and dynamics of the scene. By enforcing consistency across different modalities, our representation enables multi-view rendering in diverse dynamic scenes, including water pouring, robotic interaction, and real images, outperforming state-of-the-art methods for spatial-temporal view synthesis. Our approach works even when inputs images are captured with only one camera. We further demonstrate that the learned representation can serve as an implicit scene prior, enabling video processing tasks such as image super-resolution and de-noising without any additional supervision.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
GANcraft: Unsupervised 3D Neural Rendering of Minecraft Worlds
TL;DR: GANcraft as discussed by the authors presents an unsupervised neural rendering framework for generating photorealistic images of large 3D block worlds such as those created in Minecraft, where each block is assigned a semantic label such as dirt, grass or water.
74
Spacetime Gaussian Feature Splatting for Real-Time Dynamic View Synthesis
Zhan Li,Zhang Chen,Zhong Li,Yinghao Xu +3 more
TL;DR: This work proposes Spacetime Gaussian Feature Splatting as a novel dynamic scene representation, composed of three pivotal components, and introduces splatted feature rendering, which replaces spherical harmonics with neural features.
Monocular Dynamic View Synthesis: A Reality Check
Han Gao,Ruilong Li,Shubham Tulsiani,Bryan Russell,Angjoo Kanazawa +4 more
- 24 Oct 2022
TL;DR: This work proposes a new iPhone dataset that includes more diverse real-life deformation sequences and introduces two new metrics: co-visibility masked image metrics and correspondence accuracy, which overcome the issue in existing protocols on dynamic view synthesis from monocular video.
56
Instant Volumetric Head Avatars
TL;DR: Instant Volumetric Head Avatars (INSTA) as mentioned in this paper reconstructs photo-realistic digital avatars instantaneously by modeling a dynamic neural radiance field based on neural graphics primitives embedded around a parametric face model.
56
SUDS: Scalable Urban Dynamic Scenes
TL;DR: In this article , the authors propose to factorize the scene into three separate hash table data structures to encode static, dynamic, and far-field radiance fields, and make use of unlabeled target signals consisting of RGB images, sparse LiDAR, off-the-shelf self-supervised 2D descriptors, and most importantly, 2D optical flow.
53
References
•Proceedings Article
Adam: A Method for Stochastic Optimization
Diederik P. Kingma,Jimmy Ba +1 more
- 01 Jan 2015
TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
138.5K
Image quality assessment: from error visibility to structural similarity
TL;DR: In this article, a structural similarity index is proposed for image quality assessment based on the degradation of structural information, which can be applied to both subjective ratings and objective methods on a database of images compressed with JPEG and JPEG2000.
•Proceedings Article
PyTorch: An Imperative Style, High-Performance Deep Learning Library
Adam Paszke,Sam Gross,Francisco Massa,Adam Lerer,James Bradbury,Gregory Chanan,Trevor Killeen,Zeming Lin,Natalia Gimelshein,Luca Antiga,Alban Desmaison,Andreas Kopf,Edward Z. Yang,Zachary DeVito,Martin Raison,Alykhan Tejani,Sasank Chilamkurthy,Benoit Steiner,Lu Fang,Junjie Bai,Soumith Chintala +20 more
- 01 Jan 2019
TL;DR: This paper details the principles that drove the implementation of PyTorch and how they are reflected in its architecture, and explains how the careful and pragmatic implementation of the key components of its runtime enables them to work together to achieve compelling performance.
A non-local algorithm for image denoising
Antoni Buades,Bartomeu Coll,Jean-Michel Morel +2 more
- 20 Jun 2005
TL;DR: A new measure, the method noise, is proposed, to evaluate and compare the performance of digital image denoising methods, and a new algorithm, the nonlocal means (NL-means), based on a nonlocal averaging of all pixels in the image is proposed.
•Posted Content
PointNet++: Deep Hierarchical Feature Learning on Point Sets in a Metric Space
TL;DR: A hierarchical neural network that applies PointNet recursively on a nested partitioning of the input point set and proposes novel set learning layers to adaptively combine features from multiple scales to learn deep point set features efficiently and robustly.