Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification

doi:10.1109/TIP.2019.2928126

Journal Article10.1109/TIP.2019.2928126

Learning Modality-Specific Representations for Visible-Infrared Person Re-Identification

Zhanxiang Feng, +2 more

- 01 Jan 2020

- IEEE Transactions on Image Processing

- Vol. 29, pp 579-590

278

TL;DR: This paper proposes a novel framework that employs modality-specific networks to tackle with the heterogeneous matching problem and demonstrates that the MSR effectively improves the performance of deep networks on VI-REID and remarkably outperforms the state-of-the-art methods.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Deep Learning for Person Re-identification: A Survey and Outlook

Mang Ye, +5 more

- 13 Jan 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A powerful AGW baseline is designed, achieving state-of-the-art or at least comparable performance on twelve datasets for four different Re-ID tasks, and a new evaluation metric (mINP) is introduced, indicating the cost for finding all the correct matches, which provides an additional criteria to evaluate the Re- ID system for real applications.

...read moreread less

1.5K

•Book Chapter•10.1007/978-3-030-58520-4_14

Dynamic Dual-Attentive Aggregation Learning for Visible-Infrared Person Re-identification

Mang Ye, +4 more

- 23 Aug 2020

TL;DR: An intra-modality weighted-part attention module to extract discriminative part-aggregated features, by imposing the domain knowledge on the part relationship mining, and a parameter-free dynamic dual aggregation learning strategy to adaptively integrate the two components in a progressive joint training manner.

...read moreread less

443

•Journal Article•10.1609/AAAI.V34I04.5891

Infrared-Visible Cross-Modal Person Re-Identification with an X Modality

Diangang Li, +3 more

- 03 Apr 2020

TL;DR: An X-Infrared-Visible (XIV) ReID cross-modal learning framework is proposed, which achieves an absolute gain of over 7% in terms of rank 1 and mAP even compared with the latest state-of-the-art methods.

...read moreread less

413

•Proceedings Article•10.1109/CVPR42600.2020.01339

Cross-Modality Person Re-Identification With Shared-Specific Feature Transfer

Yan Lu, +6 more

- 14 Jun 2020

TL;DR: Wang et al. as mentioned in this paper proposed a cross-modality shared-specific feature transfer algorithm (termed cm-SSFT) to explore the potential of both the modality-shared information and the modal-specific characteristics to boost the reID performance.

...read moreread less

399

Journal Article•10.1016/J.INFFUS.2021.02.012

A review of multimodal image matching: Methods and applications

Xingyu Jiang, +4 more

- 01 Sep 2021

- Information Fusion

TL;DR: This survey provides a comprehensive review of multimodal image matching methods from handcrafted to deep methods for each research field according to their imaging nature, including medical, remote sensing and computer vision.

...read moreread less

372

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Posted Content

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 10 Dec 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work presents a residual learning framework to ease the training of networks that are substantially deeper than those used previously, and provides comprehensive empirical evidence showing that these residual networks are easier to optimize, and can gain accuracy from considerably increased depth.

...read moreread less

117.9K

•Proceedings Article•10.1109/CVPR.2005.177

Histograms of oriented gradients for human detection

Navneet Dalal, +1 more

- 20 Jun 2005

TL;DR: It is shown experimentally that grids of histograms of oriented gradient (HOG) descriptors significantly outperform existing feature sets for human detection, and the influence of each stage of the computation on performance is studied.

...read moreread less

36.7K

Proceedings Article•10.1145/2647868.2654889

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 03 Nov 2014

TL;DR: Caffe provides multimedia scientists and practitioners with a clean and modifiable framework for state-of-the-art deep learning algorithms and a collection of reference models for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

14.9K

•Posted Content

Caffe: Convolutional Architecture for Fast Feature Embedding

Yangqing Jia, +7 more

- 20 Jun 2014

- arXiv: Computer Vision and Pattern Recog...

TL;DR: Caffe as discussed by the authors is a BSD-licensed C++ library with Python and MATLAB bindings for training and deploying general-purpose convolutional neural networks and other deep models efficiently on commodity architectures.

...read moreread less

13.1K