Hybrid Scene Compression for Visual Localization

Open AccessPosted Content

Hybrid Scene Compression for Visual Localization

- 19 Jul 2018

- arXiv: Computer Vision and Pattern Recog...

2

TL;DR: In this article, a hybrid compression algorithm is proposed to obtain a more complete scene representation without increasing the memory requirements, leading to a superior performance compared to previous compression schemes. But, it does not handle ambiguous matches arising from point compression during RANSAC.

Abstract: Localizing an image wrt. a 3D scene model represents a core task for many computer vision applications. An increasing number of real-world applications of visual localization on mobile devices, e.g., Augmented Reality or autonomous robots such as drones or self-driving cars, demand localization approaches to minimize storage and bandwidth requirements. Compressing the 3D models used for localization thus becomes a practical necessity. In this work, we introduce a new hybrid compression algorithm that uses a given memory limit in a more effective way. Rather than treating all 3D points equally, it represents a small set of points with full appearance information and an additional, larger set of points with compressed information. This enables our approach to obtain a more complete scene representation without increasing the memory requirements, leading to a superior performance compared to previous compression schemes. As part of our contribution, we show how to handle ambiguous matches arising from point compression during RANSAC. Besides outperforming previous compression techniques in terms of pose accuracy under the same memory constraints, our compression scheme itself is also more efficient. Furthermore, the localization rates and accuracy obtained with our approach are comparable to state-of-the-art feature-based methods, while using a small fraction of the memory.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Book Chapter•10.1007/978-3-030-17795-9_10

Deep Learning vs. Traditional Computer Vision

Niall O'Mahony, +7 more

- 25 Apr 2019

TL;DR: The aim of this paper is to promote a discussion on whether knowledge of classical computer vision techniques should be maintained and how the two sides of computer vision can be combined.

...read moreread less

823

•Posted Content

Large-scale, real-time visual-inertial localization revisited

Simon Lynen, +10 more

- 30 Jun 2019

- arXiv: Computer Vision and Pattern Recog...

TL;DR: In this article, the authors propose an approach that combines server-side localization with real-time visual-inertial-based camera pose tracking to achieve low-latency localization queries and efficient fusion run in realtime on mobile platforms.

...read moreread less

References

Journal Article•10.1023/B:VISI.0000029664.99615.94

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004

- International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

59.3K

Journal Article•10.1145/358669.358692

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Martin A. Fischler, +1 more

- 01 Jun 1981

- Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

27.9K

Proceedings Article•10.1109/CVPR.2016.445

Structure-from-Motion Revisited

Johannes L. Schonberger, +1 more

- 27 Jun 2016

TL;DR: This work proposes a new SfM technique that improves upon the state of the art to make a further step towards building a truly general-purpose pipeline.

...read moreread less

6.1K

•Journal Article•10.1109/TPAMI.2007.1049

MonoSLAM: Real-Time Single Camera SLAM

Andrew J. Davison, +3 more

- 01 Jun 2007

- IEEE Transactions on Pattern Analysis an...

TL;DR: The first successful application of the SLAM methodology from mobile robotics to the "pure vision" domain of a single uncontrolled camera, achieving real time but drift-free performance inaccessible to structure from motion approaches is presented.

...read moreread less

4.4K

Proceedings Article•10.1109/CVPR.2007.383172

Object retrieval with large vocabularies and fast spatial matching

James Philbin, +4 more

- 17 Jun 2007

TL;DR: To improve query performance, this work adds an efficient spatial verification stage to re-rank the results returned from the bag-of-words model and shows that this consistently improves search quality, though by less of a margin when the visual vocabulary is large.

...read moreread less

3.6K

...

Expand

Hybrid Scene Compression for Visual Localization

Chat with Paper

AI Agents for this Paper

Citations

Deep Learning vs. Traditional Computer Vision

Large-scale, real-time visual-inertial localization revisited

References

Distinctive Image Features from Scale-Invariant Keypoints

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Structure-from-Motion Revisited

MonoSLAM: Real-Time Single Camera SLAM

Object retrieval with large vocabularies and fast spatial matching

Related Papers (5)

Attention-based 3D Object Reconstruction from a Single Image

Compression of 3D models with NURBS

Inter frame compression of 3D dynamic point clouds

Efficient 3D Objects Recognition Using Multifoveated Point Clouds.

Slimmer: Accelerating 3D Semantic Segmentation for Mobile Augmented Reality