Open AccessPosted Content
Hybrid Scene Compression for Visual Localization
TL;DR: In this article, a hybrid compression algorithm is proposed to obtain a more complete scene representation without increasing the memory requirements, leading to a superior performance compared to previous compression schemes. But, it does not handle ambiguous matches arising from point compression during RANSAC.
read more
Abstract: Localizing an image wrt. a 3D scene model represents a core task for many computer vision applications. An increasing number of real-world applications of visual localization on mobile devices, e.g., Augmented Reality or autonomous robots such as drones or self-driving cars, demand localization approaches to minimize storage and bandwidth requirements. Compressing the 3D models used for localization thus becomes a practical necessity. In this work, we introduce a new hybrid compression algorithm that uses a given memory limit in a more effective way. Rather than treating all 3D points equally, it represents a small set of points with full appearance information and an additional, larger set of points with compressed information. This enables our approach to obtain a more complete scene representation without increasing the memory requirements, leading to a superior performance compared to previous compression schemes. As part of our contribution, we show how to handle ambiguous matches arising from point compression during RANSAC. Besides outperforming previous compression techniques in terms of pose accuracy under the same memory constraints, our compression scheme itself is also more efficient. Furthermore, the localization rates and accuracy obtained with our approach are comparable to state-of-the-art feature-based methods, while using a small fraction of the memory.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Deep Learning vs. Traditional Computer Vision
Niall O'Mahony,Sean Campbell,Anderson Carvalho,Suman Harapanahalli,Gustavo Velasco Hernandez,Lenka Krpalkova,Daniel Riordan,Joseph Walsh +7 more
- 25 Apr 2019
TL;DR: The aim of this paper is to promote a discussion on whether knowledge of classical computer vision techniques should be maintained and how the two sides of computer vision can be combined.
823
•Posted Content
Large-scale, real-time visual-inertial localization revisited
Simon Lynen,Bernhard Zeisl,Dror Aiger,Michael Bosse,Michael Bosse,Joel A. Hesch,Joel A. Hesch,Marc Pollefeys,Marc Pollefeys,Roland Siegwart,Torsten Sattler +10 more
TL;DR: In this article, the authors propose an approach that combines server-side localization with real-time visual-inertial-based camera pose tracking to achieve low-latency localization queries and efficient fusion run in realtime on mobile platforms.
References
Distinctive Image Features from Scale-Invariant Keypoints
TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography
TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.
Structure-from-Motion Revisited
Johannes L. Schonberger,Jan-Michael Frahm +1 more
- 27 Jun 2016
TL;DR: This work proposes a new SfM technique that improves upon the state of the art to make a further step towards building a truly general-purpose pipeline.
MonoSLAM: Real-Time Single Camera SLAM
TL;DR: The first successful application of the SLAM methodology from mobile robotics to the "pure vision" domain of a single uncontrolled camera, achieving real time but drift-free performance inaccessible to structure from motion approaches is presented.
Object retrieval with large vocabularies and fast spatial matching
James Philbin,Ondrej Chum,Michael Isard,Josef Sivic,Andrew Zisserman +4 more
- 17 Jun 2007
TL;DR: To improve query performance, this work adds an efficient spatial verification stage to re-rank the results returned from the bag-of-words model and shows that this consistently improves search quality, though by less of a margin when the visual vocabulary is large.