Image Matching Across Wide Baselines: From Paper to Practice

doi:10.1007/S11263-020-01385-0

Open AccessJournal Article10.1007/S11263-020-01385-0

Image Matching Across Wide Baselines: From Paper to Practice

Yuhe Jin, +6 more

- 01 Feb 2021

- International Journal of Computer Vision

- Vol. 129, Iss: 2, pp 517-547

275

TL;DR: The Image Matching Challenge as mentioned in this paper provides a comprehensive benchmark for local features and robust estimation algorithms, focusing on the downstream task, the accuracy of the reconstructed camera pose, as the primary metric.

Abstract: We introduce a comprehensive benchmark for local features and robust estimation algorithms, focusing on the downstream task—the accuracy of the reconstructed camera pose—as our primary metric. Our pipeline’s modular structure allows easy integration, configuration, and combination of different methods and heuristics. This is demonstrated by embedding dozens of popular algorithms and evaluating them, from seminal works to the cutting edge of machine learning research. We show that with proper settings, classical solutions may still outperform the perceived state of the art. Besides establishing the actual state of the art, the conducted experiments reveal unexpected properties of structure from motion pipelines that can help improve their performance, for both algorithmic and learned methods. Data and code are online ( https://github.com/ubc-vision/image-matching-benchmark ), providing an easy-to-use and flexible framework for the benchmarking of local features and robust estimation methods, both alongside and against top-performing methods. This work provides a basis for the Image Matching Challenge ( https://image-matching-challenge.github.io ).

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Learning to Detect Geometric Structures from Images for 3D Parsing

Yichao Zhou

- 01 Jan 2020

TL;DR: Zhou et al. as mentioned in this paper proposed a method to extract high-level geometric structures from images and use them for 3D parsing, such as lines, junctions, planes, vanishing points, and symmetry.

...read moreread less

•Proceedings Article•10.1109/icassp49357.2023.10096730

Exploring Progressive Hybrid-Degraded Image Processing for Homography Estimation

Yijun Lin, +3 more

- 04 Jun 2023

TL;DR: Zhang et al. as discussed by the authors proposed an environmental epistemic model (EEM) to build task-specific prior knowledge of uncertain environments, which can be updated online and used to guide the agent's exploration and exploitation.

...read moreread less

•Journal Article•10.5194/ISPRS-ARCHIVES-XLIII-B3-2020-495-2020

Stacked local feature detector for hyperspectral image

Z. Yan, +1 more

- 21 Aug 2020

- The International Archives of the Photog...

TL;DR: This method, which is named stacked local feature detector (HSI-SFD), stack all local feature points detected from every single spectral band to lead to more reliable and robust local features.

...read moreread less

Journal Article•10.1080/07038992.2022.2052032

Study on Elimination Algorithms for Line Segment Mismatches

fgfxgvbf fdgvgv

- 01 Apr 2022

- Canadian Journal of Remote Sensing

TL;DR: Zhang et al. as mentioned in this paper systematically studied elimination algorithms of line segment mismatches by combining two transformation models (i.e., affine and homography) with 2 M-estimators or 2 sample consensus methods (i., random sample consensus, RANSAC, and least median of squares, LMedS).

...read moreread less

Journal Article•10.1109/cvpr52733.2024.01381

A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion *

Feng Yu, +2 more

- 16 Jun 2024

...

Expand

References

Journal Article•10.1023/B:VISI.0000029664.99615.94

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004

- International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

59.3K

Journal Article•10.1145/358669.358692

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Martin A. Fischler, +1 more

- 01 Jun 1981

- Communications of The ACM

TL;DR: New results are derived on the minimum number of landmarks needed to obtain a solution, and algorithms are presented for computing these minimum-landmark solutions in closed form that provide the basis for an automatic system that can solve the Location Determination Problem under difficult viewing.

...read moreread less

27.9K

•Book

Multiple view geometry in computer vision

Richard Hartley, +1 more

- 01 Jan 2000

TL;DR: In this article, the authors provide comprehensive background material and explain how to apply the methods and implement the algorithms directly in a unified framework, including geometric principles and how to represent objects algebraically so they can be computed and applied.

...read moreread less

20.1K

Proceedings Article•10.1109/CVPR.2012.6248074

Are we ready for autonomous driving? The KITTI vision benchmark suite

Andreas Geiger, +2 more

- 16 Jun 2012

TL;DR: The autonomous driving platform is used to develop novel challenging benchmarks for the tasks of stereo, optical flow, visual odometry/SLAM and 3D object detection, revealing that methods ranking high on established datasets such as Middlebury perform below average when being moved outside the laboratory to the real world.

...read moreread less

16.3K

...

Expand

Image Matching Across Wide Baselines: From Paper to Practice

Chat with Paper

AI Agents for this Paper

Citations

Learning to Detect Geometric Structures from Images for 3D Parsing

Exploring Progressive Hybrid-Degraded Image Processing for Homography Estimation

Stacked local feature detector for hyperspectral image

Study on Elimination Algorithms for Line Segment Mismatches

A Subspace-Constrained Tyler's Estimator and its Applications to Structure from Motion *

References

Scikit-learn: Machine Learning in Python

Distinctive Image Features from Scale-Invariant Keypoints

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

Multiple view geometry in computer vision

Are we ready for autonomous driving? The KITTI vision benchmark suite

Related Papers (5)

SuperPoint: Self-Supervised Interest Point Detection and Description

Distinctive Image Features from Scale-Invariant Keypoints

Random sample consensus: a paradigm for model fitting with applications to image analysis and automated cartography

ORB: An efficient alternative to SIFT or SURF

Structure-from-Motion Revisited