From Coarse to Fine: Robust Hierarchical Localization at Large Scale

Open AccessPosted Content

From Coarse to Fine: Robust Hierarchical Localization at Large Scale

- 09 Dec 2018

- arXiv: Computer Vision and Pattern Recog...

528

TL;DR: HF-Net as discussed by the authors proposes a hierarchical approach based on a monolithic CNN that simultaneously predicts local features and global descriptors for accurate 6-DoF localization, which achieves remarkable localization robustness across large variations of appearance and sets a new state-of-theart on two challenging benchmarks for large-scale localization.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.5194/isprs-archives-xlviii-1-w2-2023-1059-2023

View graph construction for large-scale uav images: an evaluation of state-of-the-art methods

J. Liu, +5 more

- 13 Dec 2023

- The international archives of the photog...

TL;DR: The test results demonstrate that the optimal method VLAD with HNSW can speed up about 100 times in finding matching candidate subset and a view graph that guides scene partition and sub-scene reconstruction in parallel SfM can be created by the optimal method.

...read moreread less

Journal Article•10.3390/app14062267

Survey of Deep Learning-Based Methods for FMCW Radar Odometry and Ego-Localization

Marvin Brune, +2 more

- 08 Mar 2024

- Applied Sciences

TL;DR: Survey of deep learning-based methods for FMCW radar odometry and ego-localization focuses on the challenges of odometry and loop closure detection using FMCW radar sensors. The paper emphasizes the importance of these tasks in autonomous driving and introduces deep learning approaches to address them.

...read moreread less

Journal Article•10.48550/arxiv.2410.10799

Towards Foundation Models for 3D Vision: How Close Are We?

Yiming Zuo, +5 more

- 14 Oct 2024

TL;DR: This study evaluates 3D visual understanding in foundation models, revealing VLMs perform poorly, while specialized models are accurate but not robust, highlighting the need for improved 3D vision capabilities in AI systems.

...read moreread less

•Proceedings Article•10.5220/0010834700003124

Evaluation of Long-term Deep Visual Place Recognition

Farid Alijani, +5 more

- 01 Jan 2022

TL;DR: This paper studies recent visual place recognition and image retrieval methods and utilizes them to conduct extensive and comprehensive experiments on two diverse and large long-term indoor and outdoor robot navigation datasets, e.g., COLD and Oxford Radar RobotCar.

...read moreread less

Journal Article•10.1109/icra57147.2024.10611102

Enhancing Visual Place Recognition with Multi-modal Features and Time-constrained Graph Attention Aggregation

Zhuo Wang, +5 more

- 13 May 2024

TL;DR: This paper proposes a multi-modal visual place recognition method that incorporates depth information and a time-constrained graph attention aggregation to enhance robustness against appearance and perspective changes in autonomous driving and robotic navigation.

...read moreread less

...

Expand

References

Proceedings Article•10.1109/CVPR.2009.5206848

ImageNet: A large-scale hierarchical image database

Jia Deng, +5 more

- 20 Jun 2009

TL;DR: A new database called “ImageNet” is introduced, a large-scale ontology of images built upon the backbone of the WordNet structure, much larger in scale and diversity and much more accurate than the current image datasets.

...read moreread less

75.9K

Journal Article•10.1023/B:VISI.0000029664.99615.94

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004

- International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

59.3K

Journal Article•10.1145/358669.358692

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Martin A. Fischler, +1 more

- 01 Jun 1981

- Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

27.9K

•Posted Content

Distilling the Knowledge in a Neural Network

Geoffrey E. Hinton, +2 more

- 09 Mar 2015

- arXiv: Machine Learning

TL;DR: This work shows that it can significantly improve the acoustic model of a heavily used commercial system by distilling the knowledge in an ensemble of models into a single model and introduces a new type of ensemble composed of one or more full models and many specialist models which learn to distinguish fine-grained classes that the full models confuse.

...read moreread less

21.2K

•Proceedings Article•10.1109/CVPR.2018.00474

MobileNetV2: Inverted Residuals and Linear Bottlenecks

Mark Sandler, +4 more

- 18 Jun 2018

TL;DR: MobileNetV2 as mentioned in this paper is based on an inverted residual structure where the shortcut connections are between the thin bottleneck layers and intermediate expansion layer uses lightweight depthwise convolutions to filter features as a source of non-linearity.

...read moreread less

19.4K