Image Matching Across Wide Baselines: From Paper to Practice
Yuhe Jin,Dmytro Mishkin,Anastasiia Mishchuk,Jiri Matas,Pascal Fua,Kwang Moo Yi,Eduard Trulls +6 more
275
TL;DR: The Image Matching Challenge as mentioned in this paper provides a comprehensive benchmark for local features and robust estimation algorithms, focusing on the downstream task, the accuracy of the reconstructed camera pose, as the primary metric.
read more
Abstract: We introduce a comprehensive benchmark for local features and robust estimation algorithms, focusing on the downstream task—the accuracy of the reconstructed camera pose—as our primary metric. Our pipeline’s modular structure allows easy integration, configuration, and combination of different methods and heuristics. This is demonstrated by embedding dozens of popular algorithms and evaluating them, from seminal works to the cutting edge of machine learning research. We show that with proper settings, classical solutions may still outperform the perceived state of the art. Besides establishing the actual state of the art, the conducted experiments reveal unexpected properties of structure from motion pipelines that can help improve their performance, for both algorithmic and learned methods. Data and code are online (
https://github.com/ubc-vision/image-matching-benchmark
), providing an easy-to-use and flexible framework for the benchmarking of local features and robust estimation methods, both alongside and against top-performing methods. This work provides a basis for the Image Matching Challenge (
https://image-matching-challenge.github.io
).
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring
Emanuele Santellani,Martin Zach,Christian Sormann,Mattia Rossi,Andreas Kühn,Friedrich Fraundorfer +5 more
StereoGlue: Robust Estimation with Single-Point Solvers
Dániel Baráth,Dmytro Mishkin,Luca Cavalli,Paul-Edouard Sarlin,Petr Hruby,Marc Pollefeys +5 more
Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance
TL;DR: In this paper , an illumination-insensitive binary (IIB) descriptor was proposed by leveraging the local inter-patch invariance exhibited in multiple spatial granularities to deal with unfavorable illumination variations.
EMDQ: Removal of Image Feature Mismatches in Real-time
TL;DR: Zhang et al. as mentioned in this paper proposed an algorithm based on the re-weighting and 1-point RANSAC strategy (R1P-RNSC), which operates under the assumption that a non-rigid deformation can be approximately represented by multiple rigid transformations.
AFSRNet: learning local descriptors with adaptive multi-scale feature fusion and symmetric regularization
Dong Li,Haowen Liang,Kin-Man Lam +2 more
- 20 Apr 2024
References
•Journal Article
Scikit-learn: Machine Learning in Python
Fabian Pedregosa,Gaël Varoquaux,Alexandre Gramfort,Vincent Michel,Bertrand Thirion,Olivier Grisel,Mathieu Blondel,Peter Prettenhofer,Ron Weiss,Vincent Dubourg,Jake Vanderplas,Alexandre Passos,David Cournapeau,Matthieu Brucher,Matthieu Perrot,Edouard Duchesnay +15 more
TL;DR: Scikit-learn is a Python module integrating a wide range of state-of-the-art machine learning algorithms for medium-scale supervised and unsupervised problems, focusing on bringing machine learning to non-specialists using a general-purpose high-level language.
Distinctive Image Features from Scale-Invariant Keypoints
TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.
Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography
TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.
•Book
Multiple view geometry in computer vision
Richard Hartley,Andrew Zisserman +1 more
- 01 Jan 2000
TL;DR: In this article, the authors provide comprehensive background material and explain how to apply the methods and implement the algorithms directly in a unified framework, including geometric principles and how to represent objects algebraically so they can be computed and applied.
20.1K
Are we ready for autonomous driving? The KITTI vision benchmark suite
Andreas Geiger,Philip Lenz,Raquel Urtasun +2 more
- 16 Jun 2012
TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.
Related Papers (5)
Ethan Rublee,Vincent Rabaud,Kurt Konolige,Gary Bradski +3 more
- 06 Nov 2011
Johannes L. Schonberger,Jan-Michael Frahm +1 more
- 27 Jun 2016