KinectFusion: Real-time dense surface mapping and tracking

doi:10.1109/ISMAR.2011.6092378

Open AccessProceedings Article10.1109/ISMAR.2011.6092378

KinectFusion: Real-time dense surface mapping and tracking

Richard Newcombe, +9 more

- 26 Oct 2011

- pp 127-136

4.8K

TL;DR: A system for accurate real-time mapping of complex and arbitrary indoor scenes in variable lighting conditions, using only a moving low-cost depth camera and commodity graphics hardware, which fuse all of the depth data streamed from a Kinect sensor into a single global implicit surface model of the observed scene in real- time.

Abstract: We present a system for accurate real-time mapping of complex and arbitrary indoor scenes in variable lighting conditions, using only a moving low-cost depth camera and commodity graphics hardware. We fuse all of the depth data streamed from a Kinect sensor into a single global implicit surface model of the observed scene in real-time. The current sensor pose is simultaneously obtained by tracking the live depth frame relative to the global model using a coarse-to-fine iterative closest point (ICP) algorithm, which uses all of the observed depth data available. We demonstrate the advantages of tracking against the growing full surface model compared with frame-to-frame tracking, obtaining tracking and mapping results in constant time within room sized scenes with limited drift and high accuracy. We also show both qualitative and quantitative results relating to various aspects of our tracking and mapping system. Modelling of natural scenes, in real-time with only commodity sensor and GPU hardware, promises an exciting step forward in augmented reality (AR), in particular, it allows dense surfaces to be reconstructed in real-time, with a level of detail and robustness beyond any solution yet presented using passive computer vision.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

•Journal Article•10.1109/TRO.2017.2705103

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

Raul Mur-Artal, +1 more

- 12 Jun 2017

- IEEE Transactions on Robotics

TL;DR: ORB-SLAM2, a complete simultaneous localization and mapping (SLAM) system for monocular, stereo and RGB-D cameras, including map reuse, loop closing, and relocalization capabilities, is presented, being in most cases the most accurate SLAM solution.

...read moreread less

5.4K

•Proceedings Article•10.1109/CVPR.2017.261

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

Angela Dai, +5 more

- 21 Jul 2017

TL;DR: This work introduces ScanNet, an RGB-D video dataset containing 2.5M views in 1513 scenes annotated with 3D camera poses, surface reconstructions, and semantic segmentations, and shows that using this data helps achieve state-of-the-art performance on several 3D scene understanding tasks.

...read moreread less

4.7K

•Journal Article•10.1109/TRO.2017.2705103

ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras

Raul Mur-Artal, +1 more

- 20 Oct 2016

- arXiv: Robotics

TL;DR: ORB-SLAM2 as mentioned in this paper is a complete SLAM system for monocular, stereo and RGB-D cameras, including map reuse, loop closing and relocalization capabilities.

...read moreread less

4.3K

•Proceedings Article•10.1109/IROS.2012.6385773

A benchmark for the evaluation of RGB-D SLAM systems

Jrgen Sturm, +4 more

- 24 Dec 2012

TL;DR: A large set of image sequences from a Microsoft Kinect with highly accurate and time-synchronized ground truth camera poses from a motion capture system is recorded for the evaluation of RGB-D SLAM systems.

...read moreread less

4.3K

...

Expand

References

Journal Article•10.1109/34.121791

A method for registration of 3-D shapes

Paul J. Besl, +1 more

- 01 Feb 1992

- IEEE Transactions on Pattern Analysis an...

TL;DR: In this paper, the authors describe a general-purpose representation-independent method for the accurate and computationally efficient registration of 3D shapes including free-form curves and surfaces, based on the iterative closest point (ICP) algorithm, which requires only a procedure to find the closest point on a geometric entity to a given point.

...read moreread less

20.6K

•Proceedings Article•10.1145/37401.37422

Marching cubes: A high resolution 3D surface construction algorithm

William E. Lorensen, +1 more

- 01 Aug 1987

TL;DR: In this paper, a divide-and-conquer approach is used to generate inter-slice connectivity, and then a case table is created to define triangle topology using linear interpolation.

...read moreread less

14.5K

Proceedings Article•10.1109/ICCV.1998.710815

Bilateral filtering for gray and color images

Carlo Tomasi, +1 more

- 04 Jan 1998

TL;DR: In contrast with filters that operate on the three bands of a color image separately, a bilateral filter can enforce the perceptual metric underlying the CIE-Lab color space, and smooth colors and preserve edges in a way that is tuned to human perception.

...read moreread less

9.7K

Proceedings Article•10.1109/ISMAR.2007.4538852

Parallel Tracking and Mapping for Small AR Workspaces

Georg Klein, +1 more

- 13 Nov 2007

TL;DR: A system specifically designed to track a hand-held camera in a small AR workspace, processed in parallel threads on a dual-core computer, that produces detailed maps with thousands of landmarks which can be tracked at frame-rate with accuracy and robustness rivalling that of state-of-the-art model-based systems.

...read moreread less

4.5K

•Proceedings Article•10.1145/237170.237269

A volumetric method for building complex models from range images

Brian Curless, +1 more

- 01 Aug 1996

TL;DR: This paper presents a volumetric method for integrating range images that is able to integrate a large number of range images yielding seamless, high-detail models of up to 2.6 million triangles.

...read moreread less

3.7K

...

Expand

KinectFusion: Real-time dense surface mapping and tracking

Chat with Paper

AI Agents for this Paper

Citations

Going deeper with convolutions

ORB-SLAM2: An Open-Source SLAM System for Monocular, Stereo, and RGB-D Cameras

ScanNet: Richly-Annotated 3D Reconstructions of Indoor Scenes

ORB-SLAM2: an Open-Source SLAM System for Monocular, Stereo and RGB-D Cameras

A benchmark for the evaluation of RGB-D SLAM systems

References

A method for registration of 3-D shapes

Marching cubes: A high resolution 3D surface construction algorithm

Bilateral filtering for gray and color images

Parallel Tracking and Mapping for Small AR Workspaces

A volumetric method for building complex models from range images

Related Papers (5)

A method for registration of 3-D shapes

Parallel Tracking and Mapping for Small AR Workspaces

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Marching cubes: A high resolution 3D surface construction algorithm

LSD-SLAM: Large-Scale Direct Monocular SLAM