Fine-Tuning CNN Image Retrieval with No Human Annotation

doi:10.1109/TPAMI.2018.2846566

Open AccessJournal Article10.1109/TPAMI.2018.2846566

Fine-Tuning CNN Image Retrieval with No Human Annotation

Filip Radenovic, +2 more

- 01 Jul 2019

- IEEE Transactions on Pattern Analysis an...

- Vol. 41, Iss: 7, pp 1655-1668

1.1K

TL;DR: It is shown that both hard-positive and hard-negative examples, selected by exploiting the geometry and the camera positions available from the 3D models, enhance the performance of particular-object retrieval.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Posted Content

Emerging Properties in Self-Supervised Vision Transformers

Mathilde Caron, +7 more

- 29 Apr 2021

- arXiv: Computer Vision and Pattern Recog...

TL;DR: In this paper, self-supervised learning provides new properties to Vision Transformer (ViT) that stand out compared to convolutional networks (convnets) beyond the fact that adapting selfsupervised methods to this architecture works particularly well, they make the following observations: first, self-vised ViT features contain explicit information about the semantic segmentation of an image, which does not emerge as clearly with supervised ViTs, nor with convnets.

...read moreread less

2.5K

•Posted Content

Deep Learning for Person Re-identification: A Survey and Outlook

Mang Ye, +5 more

- 13 Jan 2020

- arXiv: Computer Vision and Pattern Recog...

TL;DR: A powerful AGW baseline is designed, achieving state-of-the-art or at least comparable performance on twelve datasets for four different Re-ID tasks, and a new evaluation metric (mINP) is introduced, indicating the cost for finding all the correct matches, which provides an additional criteria to evaluate the Re- ID system for real applications.

...read moreread less

1.5K

Proceedings Article•10.1109/CVPR.2019.00828

D2-Net: A Trainable CNN for Joint Description and Detection of Local Features

Mihai Dusmanu, +6 more

- 15 Jun 2019

TL;DR: This work proposes an approach where a single convolutional neural network plays a dual role: It is simultaneously a dense feature descriptor and a feature detector, and shows that this model can be trained using pixel correspondences extracted from readily available large-scale SfM reconstructions, without any further annotations.

...read moreread less

1.1K

•Proceedings Article•10.1109/CVPR.2019.00521

Label Propagation for Deep Semi-Supervised Learning

Ahmet Iscen, +3 more

- 15 Jun 2019

TL;DR: This work employs a transductive label propagation method that is based on the manifold assumption to make predictions on the entire dataset and use these predictions to generate pseudo-labels for the unlabeled data and train a deep neural network.

...read moreread less

609

•Proceedings Article•10.1109/ICCV.2019.00521

Learning With Average Precision: Training Image Retrieval With a Listwise Loss

Jerome Revaud, +3 more

- 01 Oct 2019

TL;DR: In this article, the authors proposed to directly optimize the global mAP by leveraging recent advances in listwise loss formulations, using a histogram binning approximation, which can be differentiated and thus employed to end-to-end learning.

...read moreread less

496

...

Expand

References

10.48550/arxiv.1409.1556

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

TL;DR: This study investigates the effect of convolutional network depth on image recognition accuracy, achieving significant improvements with 16-19 weight layers, and securing top places in the ImageNet Challenge 2014, with publicly available models for further research.

...read moreread less

10.48550/arxiv.1509.06033

Deep Convolutional Features for Image Based Retrieval and Scene Categorization

Arsalan Mousavian, +1 more

TL;DR: This paper proposes an image retrieval and scene categorization approach using deep convolutional features from an earlier layer of a CNN, demonstrating superior performance on INRIA Holidays and SUN397 datasets with reduced computational cost and memory requirements.

...read moreread less

•Book Chapter•10.1007/978-3-319-10599-4_25

Orientation Covariant Aggregation of Local Descriptors with Embeddings

Giorgos Tolias, +2 more

- 06 Sep 2014

TL;DR: Image search systems based on local descriptors typically achieve orientation invariance by aligning the patches on their dominant orientations, but this choice introduces too much invariance because it does not guarantee that the patches are rotated consistently.

...read moreread less

Proceedings Article•10.1109/CVPR.2011.5995373

Hello neighbor: Accurate object retrieval with k-reciprocal nearest neighbors

Danfeng Qin, +4 more

- 20 Jun 2011

TL;DR: This paper introduces a simple yet effective method to improve visual word based image retrieval based on an analysis of the k-reciprocal nearest neighbor structure in the image space and demonstrates a significant improvement over standard bag-of-words retrieval.

...read moreread less

10.48550/arxiv.1310.1531

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition.

Jeff Donahue, +6 more

TL;DR: Researchers propose DeCAF, a deep convolutional activation feature, for generic visual recognition tasks, achieving state-of-the-art results on various challenges, including scene recognition, domain adaptation, and fine-grained recognition, with an open-source implementation and associated network parameters.

...read moreread less

...

Expand

Fine-Tuning CNN Image Retrieval with No Human Annotation

Chat with Paper

AI Agents for this Paper

Citations

Emerging Properties in Self-Supervised Vision Transformers

Deep Learning for Person Re-identification: A Survey and Outlook

D2-Net: A Trainable CNN for Joint Description and Detection of Local Features

Label Propagation for Deep Semi-Supervised Learning

Learning With Average Precision: Training Image Retrieval With a Listwise Loss

References

Very Deep Convolutional Networks for Large-Scale Image Recognition

Deep Convolutional Features for Image Based Retrieval and Scene Categorization

Orientation Covariant Aggregation of Local Descriptors with Embeddings

Hello neighbor: Accurate object retrieval with k-reciprocal nearest neighbors

DeCAF: A Deep Convolutional Activation Feature for Generic Visual Recognition.

Related Papers (5)

Deep Residual Learning for Image Recognition

Object retrieval with large vocabularies and fast spatial matching

Distinctive Image Features from Scale-Invariant Keypoints

NetVLAD: CNN Architecture for Weakly Supervised Place Recognition

ImageNet Classification with Deep Convolutional Neural Networks