PatchmatchNet: Learned Multi-View Patchmatch Stereo

Open AccessPosted Content

PatchmatchNet: Learned Multi-View Patchmatch Stereo

- 02 Dec 2020

- arXiv: Computer Vision and Pattern Recog...

234

TL;DR: For the first time, an iterative multi-scale Patchmatch in an end-to-end trainable architecture is introduced and the Patchmatch core algorithm is improved with a novel and learned adaptive propagation and evaluation scheme for each iteration.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.48550/arXiv.2206.10092

BEVDepth: Acquisition of Reliable Depth for Multi-view 3D Object Detection

Yinhao Li, +7 more

- 21 Jun 2022

TL;DR: Without any bells and whistles, BEVDepth achieves the new state-of-the-art 60.0% NDS on the challenging nuScenes test set while maintaining high efﬁciency and for the first time, the performance gap between the camera and LiDAR is largely reduced within 10% N DS.

...read moreread less

366

Proceedings Article•10.1109/CVPR52688.2022.01264

Attention Concatenation Volume for Accurate and Efficient Stereo Matching

Gangwei Xu, +3 more

- 04 Mar 2022

TL;DR: A novel cost volume construction method which generates attention weights from correlation clues to suppress redundant information and enhance matching-related information in the concatenation volume is presented.

...read moreread less

122

•Journal Article•10.1609/aaai.v37i2.25234

BEVStereo: Enhancing Depth Estimation in Multi-View 3D Object Detection with Temporal Stereo

Yinhao Li, +4 more

- 26 Jun 2023

- Proceedings of the ... AAAI Conference o...

TL;DR: In this paper , the authors propose an effective method for creating temporal stereo by dynamically determining the center and range of the temporal stereo, the most confident center is found using the EM algorithm.

...read moreread less

116

•Proceedings Article•10.1109/cvpr52688.2022.00840

RayMVSNet: Learning Ray-based 1D Implicit Fields for Accurate Multi-View Stereo

01 Jun 2022

TL;DR: RayNet as discussed by the authors directly optimizes the depth value along each camera ray, mimicking the range (depth) finding of a laser scanner, which reduces the MVS problem to ray-based depth optimization which is much more light-weight than full cost volume optimization.

...read moreread less

113

Journal Article•10.1007/s11263-022-01697-3

Vis-MVSNet: Visibility-Aware Multi-view Stereo Network

Jingyang Zhang, +4 more

- 14 Oct 2022

- International Journal of Computer Vision

TL;DR: This paper explicitly infer and integrate the pixel-wise occlusion information in the MVS network via the matching uncertainty estimation, and jointly inferred with the pair-wise depth map, which is further used as weighting guidance during the multi-view cost volume fusion.

...read moreread less

107

...

Expand

References

•Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 01 Jan 2015

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

138.5K

•Proceedings Article•10.1109/CVPR.2017.106

Feature Pyramid Networks for Object Detection

Tsung-Yi Lin, +5 more

- 21 Jul 2017

TL;DR: This paper exploits the inherent multi-scale, pyramidal hierarchy of deep convolutional networks to construct feature pyramids with marginal extra cost and achieves state-of-the-art single-model results on the COCO detection benchmark without bells and whistles.

...read moreread less

29.5K

•Proceedings Article•10.3115/V1/D14-1179

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Kyunghyun Cho, +8 more

- 01 Jan 2014

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.

...read moreread less

28.6K

•Proceedings Article

PyTorch: An Imperative Style, High-Performance Deep Learning Library

Adam Paszke, +20 more

- 01 Jan 2019

TL;DR: This paper details the principles that drove the implementation of PyTorch and how they are reflected in its architecture, and explains how the careful and pragmatic implementation of the key components of its runtime enables them to work together to achieve compelling performance.

...read moreread less

10.3K

Proceedings Article•10.1109/CVPR.2016.445

Structure-from-Motion Revisited

Johannes L. Schonberger, +1 more

- 27 Jun 2016

TL;DR: This work proposes a new SfM technique that improves upon the state of the art to make a further step towards building a truly general-purpose pipeline.

...read moreread less

6.1K