Image Matching Across Wide Baselines: From Paper to Practice

doi:10.1007/S11263-020-01385-0

Open AccessJournal Article10.1007/S11263-020-01385-0

Image Matching Across Wide Baselines: From Paper to Practice

Yuhe Jin, +6 more

- 01 Feb 2021

- International Journal of Computer Vision

- Vol. 129, Iss: 2, pp 517-547

275

TL;DR: The Image Matching Challenge as mentioned in this paper provides a comprehensive benchmark for local features and robust estimation algorithms, focusing on the downstream task, the accuracy of the reconstructed camera pose, as the primary metric.

Abstract: We introduce a comprehensive benchmark for local features and robust estimation algorithms, focusing on the downstream task—the accuracy of the reconstructed camera pose—as our primary metric. Our pipeline’s modular structure allows easy integration, configuration, and combination of different methods and heuristics. This is demonstrated by embedding dozens of popular algorithms and evaluating them, from seminal works to the cutting edge of machine learning research. We show that with proper settings, classical solutions may still outperform the perceived state of the art. Besides establishing the actual state of the art, the conducted experiments reveal unexpected properties of structure from motion pipelines that can help improve their performance, for both algorithmic and learned methods. Data and code are online ( https://github.com/ubc-vision/image-matching-benchmark ), providing an easy-to-use and flexible framework for the benchmarking of local features and robust estimation methods, both alongside and against top-performing methods. This work provides a basis for the Image Matching Challenge ( https://image-matching-challenge.github.io ).

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1007/978-3-031-72980-5_5

GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring

Emanuele Santellani, +5 more

- 01 Jan 2024

- Lecture Notes in Computer Science

Journal Article•10.1007/978-3-031-72998-0_24

StereoGlue: Robust Estimation with Single-Point Solvers

Dániel Baráth, +5 more

- 29 Sep 2024

- Lecture Notes in Computer Science

•Journal Article•10.1109/tim.2023.3273211

Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance

01 Jan 2023

- IEEE Transactions on Instrumentation and...

TL;DR: In this paper , an illumination-insensitive binary (IIB) descriptor was proposed by leveraging the local inter-patch invariance exhibited in multiple spatial granularities to deal with unfavorable illumination variations.

...read moreread less

•Journal Article•10.1109/tip.2021.3134456

EMDQ: Removal of Image Feature Mismatches in Real-time

01 Jan 2022

- IEEE Transactions on Image Processing

TL;DR: Zhang et al. as mentioned in this paper proposed an algorithm based on the re-weighting and 1-point RANSAC strategy (R1P-RNSC), which operates under the assumption that a non-rigid deformation can be approximately represented by multiple rigid transformations.

...read moreread less

Journal Article•10.1007/s10489-024-05418-w

AFSRNet: learning local descriptors with adaptive multi-scale feature fusion and symmetric regularization

Dong Li, +2 more

- 20 Apr 2024

...

Expand

References

Journal Article•10.1023/B:VISI.0000029664.99615.94

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004

- International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

59.3K

Journal Article•10.1145/358669.358692

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Martin A. Fischler, +1 more

- 01 Jun 1981

- Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

27.9K

•Book

Multiple view geometry in computer vision

Richard Hartley, +1 more

- 01 Jan 2000

TL;DR: In this article, the authors provide comprehensive background material and explain how to apply the methods and implement the algorithms directly in a unified framework, including geometric principles and how to represent objects algebraically so they can be computed and applied.

...read moreread less

20.1K

Proceedings Article•10.1109/CVPR.2012.6248074

Are we ready for autonomous driving? The KITTI vision benchmark suite

Andreas Geiger, +2 more

- 16 Jun 2012

TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.

...read moreread less

16.3K

...

Expand

Image Matching Across Wide Baselines: From Paper to Practice

Chat with Paper

AI Agents for this Paper

Citations

GMM-IKRS: Gaussian Mixture Models for Interpretable Keypoint Refinement and Scoring

StereoGlue: Robust Estimation with Single-Point Solvers

Illumination-insensitive Binary Descriptor for Visual Measurement Based on Local Inter-patch Invariance

EMDQ: Removal of Image Feature Mismatches in Real-time

AFSRNet: learning local descriptors with adaptive multi-scale feature fusion and symmetric regularization

References

Scikit-learn: Machine Learning in Python

Distinctive Image Features from Scale-Invariant Keypoints

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Multiple view geometry in computer vision

Are we ready for autonomous driving? The KITTI vision benchmark suite

Related Papers (5)

SuperPoint: Self-Supervised Interest Point Detection and Description

Distinctive Image Features from Scale-Invariant Keypoints

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

ORB: An efficient alternative to SIFT or SURF

Structure-from-Motion Revisited