LoFTR: Detector-Free Local Feature Matching with Transformers

doi:10.1109/CVPR46437.2021.00881

Open AccessProceedings Article10.1109/CVPR46437.2021.00881

LoFTR: Detector-Free Local Feature Matching with Transformers

Jiaming Sun, +4 more

- 01 Apr 2021

- pp 8922-8931

1.3K

TL;DR: LoFTR as discussed by the authors uses self and cross attention layers in Transformer to obtain feature descriptors that are conditioned on both images, which enables the method to produce dense matches in low-texture areas.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Figure 1: Comparison between the proposed method LoFTR and the detector-based method SuperGlue [37]. This example demonstrates that LoFTR is capable of finding correspondences on the texture-less wall and the floor with repetitive patterns, where detector-based methods struggle to find repeatable interest points.1

Table 3: Evaluation on MegaDepth [21] for outdoor pose estimation. Matching with LoFTR results in better performance in the outdoor pose estimation task.

Table 1: Homography estimation on HPatches [7]. The AUC of the corner error in percentage is reported. The suffix DS indicates the differentiable matching with dualsoftmax.

Table 2: Evaluation on ScanNet [7] for indoor pose estimation. The AUC of the pose error in percentage is reported. LoFTR improves the state-of-the-art methods by a large margin. †indicates models trained on MegaDepth. The suffixes OT and DS indicate differentiable matching with optimal transport and dual-softmax, respectively.

Table 4: Visual localization evaluation on the Aachen Day-Night [54] benchmark v1.1. The evaluation results on both the local feature evaluation track and the full visual localization track are reported.

Table 5: Visual localization evaluation on the InLoc [41] benchmark.

Citations

Proceedings Article•10.1109/CVPR52688.2022.00116

TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers

Xuyang Bai, +6 more

- 22 Mar 2022

TL;DR: The proposed TransFusion, a robust solution to LiDAR-camera fusion with a soft-association mechanism to handle inferior image conditions, achieves state-of-the-art performance on large-scale datasets and is extended to the 3D tracking task.

...read moreread less

398

•Proceedings Article•10.1109/cvpr52688.2022.00116

TransFusion: Robust LiDAR-Camera Fusion for 3D Object Detection with Transformers

01 Jun 2022

TL;DR: TransFusion as mentioned in this paper proposes a soft-association mechanism to handle inferior image conditions, e.g., bad illumination and sensor misalignment, and achieves state-of-the-art performance on large-scale datasets.

...read moreread less

385

•Proceedings Article•10.1109/cvpr52688.2022.01086

Geometric Transformer for Fast and Robust Point Cloud Registration

01 Jun 2022

TL;DR: GeoTransformer as mentioned in this paper learns geometric feature for robust superpoint matching, which encodes pair-wise distances and triplet-wise angles, making it robust in low-overlap cases and invariant to rigid transformation.

...read moreread less

240

Journal Article•10.48550/arXiv.2306.16928

One-2-3-45: Any Single Image to 3D Mesh in 45 Seconds without Per-Shape Optimization

Minghua Liu, +6 more

- 29 Jun 2023

- arXiv.org

TL;DR: Zhang et al. as discussed by the authors proposed a view-conditioned 2D diffusion model, Zero123, to generate multi-view images for the input view, and then aim to lift them up to 3D space.

...read moreread less

223

•Posted Content

GMFlow: Learning Optical Flow via Global Matching

Haofei Xu, +4 more

- 26 Nov 2021

- arXiv: Computer Vision and Pattern Recog...

TL;DR: In this article, the authors propose a GMFlow framework, which consists of three main components: a customized Transformer for feature enhancement, a correlation and softmax layer for global feature matching, and a self-attention layer for flow propagation.

...read moreread less

194

...

Expand

References

•Proceedings Article

COTR: Correspondence Transformer for Matching Across Images

Wei Jiang, +4 more

- 25 Mar 2021

TL;DR: In this article, the authors propose a novel framework for finding correspondences in images based on a deep neural network that, given two images and a query point in one of them, finds its correspondence in the other.

...read moreread less

184

Preprint•10.48550/arxiv.1810.10510

Neighbourhood Consensus Networks

Ignacio Rocco, +5 more

- 01 Jan 2018

TL;DR: A novel end-to-end trainable convolutional neural network for finding reliable dense correspondences between a pair of images based on neighbourhood consensus patterns.

...read moreread less

164

•Posted Content

ContextDesc: Local Descriptor Augmentation with Cross-Modality Context

Zixin Luo, +7 more

- 08 Apr 2019

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes a unified learning framework that leverages and aggregates the cross-modality contextual information, including visual context from high-level image representation, and geometric context from 2D keypoint distribution, and proposes an effective N-pair loss that eschews the empirical hyper-parameter search and improves the convergence.

...read moreread less

163

•Proceedings Article•10.1109/CVPR46437.2021.00566

Learning Accurate Dense Correspondences and When to Trust Them

Prune Truong, +3 more

- 05 Jan 2021

TL;DR: PDCNet as discussed by the authors proposes a probabilistic approach to estimate a dense flow field relating two images, coupled with a robust pixel-wise confidence map indicating the reliability and accuracy of the prediction.

...read moreread less

160

•Proceedings Article•10.1109/CVPR42600.2020.00629

GLU-Net: Global-Local Universal Network for Dense Flow and Correspondences

Prune Truong, +2 more

- 14 Jun 2020

TL;DR: GLU-Net as mentioned in this paper proposes a universal network architecture that is directly applicable to all the aforementioned dense correspondence problems, achieving both high accuracy and robustness to large displacements by investigating the combined use of global and local correlation layers.

...read moreread less

154