Learning Deep Binary Descriptor with Multi-Quantization

doi:10.1109/TPAMI.2018.2858760

Journal Article10.1109/TPAMI.2018.2858760

Learning Deep Binary Descriptor with Multi-Quantization

Yueqi Duan, +4 more

- 01 Aug 2019

- IEEE Transactions on Pattern Analysis an...

- Vol. 41, Iss: 8, pp 1924-1938

50

TL;DR: A K-Autoencoders (KAEs) network is designed to jointly learn the parameters of feature extractor and the binarization functions under a deep learning framework, so that discriminative binary descriptors can be obtained with a fine-grained multi-quantization.

Abstract: In this paper, we propose an unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual analysis. Existing learning-based binary descriptors such as compact binary face descriptor (CBFD) and DeepBit utilize the rigid sign function for binarization despite of data distributions, which usually suffer from severe quantization loss. In order to address the limitation, we propose a deep multi-quantization network to learn a data-dependent binarization in an unsupervised manner. More specifically, we design a K-Autoencoders (KAEs) network to jointly learn the parameters of feature extractor and the binarization functions under a deep learning framework, so that discriminative binary descriptors can be obtained with a fine-grained multi-quantization. As DBD-MQ simply allocates the same number of quantizers to each real-valued feature dimension ignoring the elementwise diversity of informativeness, we further propose a deep competitive binary descriptor with multi-quantization (DCBD-MQ) method to learn optimal allocation of bits with the fixed binary length in a competitive manner, where informative dimensions gain more bits for complete representation. Moreover, we present a similarity-aware binary encoding strategy based on the earth mover's distance of Autoencoders, so that elements that are quantized into similar Autoencoders will have smaller Hamming distances. Extensive experimental results on six widely-used datasets show that our DBD-MQ and DCBD-MQ outperform most state-of-the-art unsupervised binary descriptors.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1109/TIM.2021.3053991

Deep Distillation Hashing for Unconstrained Palmprint Recognition

Huikai Shao, +2 more

- 25 Jan 2021

- IEEE Transactions on Instrumentation and...

TL;DR: Li et al. as mentioned in this paper proposed a novel deep distillation hashing (DDH) algorithm as a benchmark for efficient deep palmprint recognition, which can outperform other baselines to achieve the state-of-the-art performance.

...read moreread less

51

Journal Article•10.1109/TIFS.2019.2946938

LS-CNN: Characterizing Local Patches at Multiple Scales for Face Recognition

Qiangchang Wang, +1 more

- 01 Jan 2020

- IEEE Transactions on Information Forensi...

TL;DR: This work proposes a new model, called Local and multi-Scale Convolutional Neural Networks (LS-CNN), developed by incorporating DFA into HSNet model, the first effort to employ attentions for the general face recognition task.

...read moreread less

50

Journal Article•10.1109/TMM.2020.3016122

Deep Unsupervised Binary Descriptor Learning Through Locality Consistency and Self Distinctiveness

Bin Fan, +5 more

- 01 Jan 2021

- IEEE Transactions on Multimedia

TL;DR: The core idea of the proposed unsupervised deep learning method for binary descriptor learning is to explore the locality consistency in the descriptor space as well as to distinguish different patches while maintaining the ability to match a patch with its geometric transformed ones.

...read moreread less

43

Journal Article•10.1109/tpami.2023.3272925

Diverse Sample Generation: Pushing the Limit of Generative Data-Free Quantization

01 Jan 2023

- IEEE Transactions on Pattern Analysis an...

TL;DR: In this article , a generic Diverse Sample Generation (DSG) scheme was proposed for the generative data-free quantization, which first slack the statistics alignment for features in the BN layer to relax the distribution constraint, then strengthen the loss impact of the specific BN layers for different samples and inhibit the correlation among samples in the generation process, to diversify samples from the statistical and spatial perspectives, respectively.

...read moreread less

28

•Journal Article•10.1109/tip.2021.3136710

Fast ORB-SLAM Without Keypoint Descriptors

01 Jan 2022

- IEEE Transactions on Image Processing

TL;DR: FastORB-SLAM as mentioned in this paper proposes a two-stage descriptor-independent keypoint matching method based on sparse optical flow to track keypoints between adjacent frames without computing descriptors.

...read moreread less

26

...

Expand

References

•Proceedings Article•10.1109/CVPR.2016.90

Deep Residual Learning for Image Recognition

Kaiming He, +3 more

- 27 Jun 2016

TL;DR: In this article, the authors proposed a residual learning framework to ease the training of networks that are substantially deeper than those used previously, which won the 1st place on the ILSVRC 2015 classification task.

...read moreread less

198.7K

•Proceedings Article

Very Deep Convolutional Networks for Large-Scale Image Recognition

Karen Simonyan, +1 more

- 04 Sep 2014

TL;DR: This work investigates the effect of the convolutional network depth on its accuracy in the large-scale image recognition setting using an architecture with very small convolution filters, which shows that a significant improvement on the prior-art configurations can be achieved by pushing the depth to 16-19 weight layers.

...read moreread less

102.6K

•Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

- 03 Dec 2012

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

88.4K

Journal Article•10.1023/B:VISI.0000029664.99615.94

Distinctive Image Features from Scale-Invariant Keypoints

David G. Lowe

- 01 Nov 2004

- International Journal of Computer Vision

TL;DR: This paper presents a method for extracting distinctive invariant features from images that can be used to perform reliable matching between different views of an object or scene and can robustly identify objects among clutter and occlusion while achieving near real-time performance.

...read moreread less

59.3K

•Proceedings Article•10.1109/CVPR.2015.7298594

Going deeper with convolutions

Christian Szegedy, +8 more

- 07 Jun 2015

TL;DR: Inception as mentioned in this paper is a deep convolutional neural network architecture that achieves the new state of the art for classification and detection in the ImageNet Large-Scale Visual Recognition Challenge 2014 (ILSVRC14).

...read moreread less

56.6K

...

Expand

Learning Deep Binary Descriptor with Multi-Quantization

Chat with Paper

AI Agents for this Paper

Citations

Deep Distillation Hashing for Unconstrained Palmprint Recognition

LS-CNN: Characterizing Local Patches at Multiple Scales for Face Recognition

Deep Unsupervised Binary Descriptor Learning Through Locality Consistency and Self Distinctiveness

Diverse Sample Generation: Pushing the Limit of Generative Data-Free Quantization

Fast ORB-SLAM Without Keypoint Descriptors

References

Deep Residual Learning for Image Recognition

Very Deep Convolutional Networks for Large-Scale Image Recognition

ImageNet Classification with Deep Convolutional Neural Networks

Distinctive Image Features from Scale-Invariant Keypoints

Going deeper with convolutions

Related Papers (5)

Distinctive Image Features from Scale-Invariant Keypoints

ORB: An efficient alternative to SIFT or SURF

Learning Compact Binary Face Descriptor for Face Recognition

Discriminative Learning of Local Image Descriptors

BRIEF: binary robust independent elementary features