Image Matching Across Wide Baselines: From Paper to Practice
Yuhe Jin,Dmytro Mishkin,Anastasiia Mishchuk,Jiri Matas,Pascal Fua,Kwang Moo Yi,Eduard Trulls +6 more
275
TL;DR: The Image Matching Challenge as mentioned in this paper provides a comprehensive benchmark for local features and robust estimation algorithms, focusing on the downstream task, the accuracy of the reconstructed camera pose, as the primary metric.
read more
Abstract: We introduce a comprehensive benchmark for local features and robust estimation algorithms, focusing on the downstream task—the accuracy of the reconstructed camera pose—as our primary metric. Our pipeline’s modular structure allows easy integration, configuration, and combination of different methods and heuristics. This is demonstrated by embedding dozens of popular algorithms and evaluating them, from seminal works to the cutting edge of machine learning research. We show that with proper settings, classical solutions may still outperform the perceived state of the art. Besides establishing the actual state of the art, the conducted experiments reveal unexpected properties of structure from motion pipelines that can help improve their performance, for both algorithmic and learned methods. Data and code are online (
https://github.com/ubc-vision/image-matching-benchmark
), providing an easy-to-use and flexible framework for the benchmarking of local features and robust estimation methods, both alongside and against top-performing methods. This work provides a basis for the Image Matching Challenge (
https://image-matching-challenge.github.io
).
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Learning to Detect Geometric Structures from Images for 3D Parsing
Yichao Zhou
- 01 Jan 2020
TL;DR: Zhou et al. as mentioned in this paper proposed a method to extract high-level geometric structures from images and use them for 3D parsing, such as lines, junctions, planes, vanishing points, and symmetry.
Exploring Progressive Hybrid-Degraded Image Processing for Homography Estimation
Yijun Lin,Xingzhe Su,Feng Wu,Junsuo Zhao +3 more
- 04 Jun 2023
TL;DR: Zhang et al. as discussed by the authors proposed an environmental epistemic model (EEM) to build task-specific prior knowledge of uncertain environments, which can be updated online and used to guide the agent's exploration and exploitation.
Stacked local feature detector for hyperspectral image
Z. Yan,Zhaocong Wu +1 more
TL;DR: This method, which is named stacked local feature detector (HSI-SFD), stack all local feature points detected from every single spectral band to lead to more reliable and robust local features.
Study on Elimination Algorithms for Line Segment Mismatches
TL;DR: Zhang et al. as mentioned in this paper systematically studied elimination algorithms of line segment mismatches by combining two transformation models (i.e., affine and homography) with 2 M-estimators or 2 sample consensus methods (i., random sample consensus, RANSAC, and least median of squares, LMedS).
A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion *
Feng Yu,Teng Zhang,Gilad Lerman +2 more
- 16 Jun 2024
References
•Journal Article
Scikit-learn: Machine Learning in Python
Fabian Pedregosa,Gaël Varoquaux,Alexandre Gramfort,Vincent Michel,Bertrand Thirion,Olivier Grisel,Mathieu Blondel,Peter Prettenhofer,Ron Weiss,Vincent Dubourg,Jake Vanderplas,Alexandre Passos,David Cournapeau,Matthieu Brucher,Matthieu Perrot,Edouard Duchesnay +15 more
TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
Distinctive Image Features from Scale-Invariant Keypoints
TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography
TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.
•Book
Multiple view geometry in computer vision
Richard Hartley,Andrew Zisserman +1 more
- 01 Jan 2000
TL;DR: In this article, the authors provide comprehensive background material and explain how to apply the methods and implement the algorithms directly in a unified framework, including geometric principles and how to represent objects algebraically so they can be computed and applied.
20.1K
Are we ready for autonomous driving? The KITTI vision benchmark suite
Andreas Geiger,Philip Lenz,Raquel Urtasun +2 more
- 16 Jun 2012
TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.
Related Papers (5)
Ethan Rublee,Vincent Rabaud,Kurt Konolige,Gary Bradski +3 more
- 06 Nov 2011
Johannes L. Schonberger,Jan-Michael Frahm +1 more
- 27 Jun 2016