Learning Deep Binary Descriptor with Multi-Quantization

doi:10.1109/TPAMI.2018.2858760

Journal Article10.1109/TPAMI.2018.2858760

Learning Deep Binary Descriptor with Multi-Quantization

Yueqi Duan, +4 more

- 01 Aug 2019

- IEEE Transactions on Pattern Analysis an...

- Vol. 41, Iss: 8, pp 1924-1938

50

TL;DR: A K-Autoencoders (KAEs) network is designed to jointly learn the parameters of feature extractor and the binarization functions under a deep learning framework, so that discriminative binary descriptors can be obtained with a fine-grained multi-quantization.

Abstract: In this paper, we propose an unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual analysis. Existing learning-based binary descriptors such as compact binary face descriptor (CBFD) and DeepBit utilize the rigid sign function for binarization despite of data distributions, which usually suffer from severe quantization loss. In order to address the limitation, we propose a deep multi-quantization network to learn a data-dependent binarization in an unsupervised manner. More specifically, we design a K-Autoencoders (KAEs) network to jointly learn the parameters of feature extractor and the binarization functions under a deep learning framework, so that discriminative binary descriptors can be obtained with a fine-grained multi-quantization. As DBD-MQ simply allocates the same number of quantizers to each real-valued feature dimension ignoring the elementwise diversity of informativeness, we further propose a deep competitive binary descriptor with multi-quantization (DCBD-MQ) method to learn optimal allocation of bits with the fixed binary length in a competitive manner, where informative dimensions gain more bits for complete representation. Moreover, we present a similarity-aware binary encoding strategy based on the earth mover's distance of Autoencoders, so that elements that are quantized into similar Autoencoders will have smaller Hamming distances. Extensive experimental results on six widely-used datasets show that our DBD-MQ and DCBD-MQ outperform most state-of-the-art unsupervised binary descriptors.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Book Chapter•10.1201/9781003162810-13

A Survey of Quantization Methods for Efficient Neural Network Inference

12 Jan 2022

TL;DR: In this paper , the authors provide approaches to quantizing the numerical values in deep neural network computations, covering the advantages/disadvantages of current methods, and provide a survey of the history of quantization up to 1998.

...read moreread less

612

•Journal Article•10.1109/tcsvt.2021.3080920

A Decade Survey of Content Based Image Retrieval Using Deep Learning

Dana Sukau

- 01 May 2022

- IEEE Transactions on Circuits and System...

TL;DR: A comprehensive survey of deep learning based developments in the past decade for content-based image retrieval is presented in this paper , where a performance analysis is also performed using the state-of-the-art methods.

...read moreread less

207

•Journal Article•10.1109/TIP.2019.2921877

A Deep Learning Approach for Multi-Frame In-Loop Filter of HEVC

Tianyi Li, +5 more

- 14 Jun 2019

- IEEE Transactions on Image Processing

TL;DR: A multi-frame in-loop filter (MIF) for HEVC, which enhances the visual quality of each encoded frame by leveraging its adjacent frames and utilizing the spatial information of this frame and the temporal information of its neighboring higher-quality frames.

...read moreread less

173

Proceedings Article•10.1109/CVPR.2017.516

Learning Deep Binary Descriptor with Multi-quantization

Yueqi Duan, +4 more

- 01 Jul 2017

TL;DR: An unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual matching that applies a K-AutoEncoders (KAEs) network to jointly learn the parameters and the binarization functions under a deep learning framework so that discriminative binary descriptors can be obtained with a fine-grained multi- quantization.

...read moreread less

110

Journal Article•10.1109/TIP.2020.2981879

Similarity-Preserving Linkage Hashing for Online Image Retrieval

Mingbao Lin, +4 more

- 24 Mar 2020

- IEEE Transactions on Image Processing

TL;DR: A novel online hashing method, termed Similarity Preserving Linkage Hashing (SPLH), which not only utilizes pairwise similarity to learn the intra- class relationships, but also fully exploits a latent linkage space to capture the inter-class relationships and the common characteristics between label vectors and to-be-learned hash codes.

...read moreread less

63

...

Expand

References

Proceedings Article•10.1109/ICCV.2015.169

Fast R-CNN

Ross Girshick

- 07 Dec 2015

TL;DR: Fast R-CNN as discussed by the authors proposes a Fast Region-based Convolutional Network method for object detection, which employs several innovations to improve training and testing speed while also increasing detection accuracy and achieves a higher mAP on PASCAL VOC 2012.

...read moreread less

24.1K

•Dissertation

Learning Multiple Layers of Features from Tiny Images

Alex Krizhevsky

- 01 Jan 2009

TL;DR: In this paper, the authors describe how to train a multi-layer generative model of natural images, using a dataset of millions of tiny colour images, described in the next section.

...read moreread less

23.7K

•Posted Content

Fast R-CNN

Ross Girshick

- 30 Apr 2015

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This paper proposes a Fast Region-based Convolutional Network method (Fast R-CNN) for object detection that builds on previous work to efficiently classify object proposals using deep convolutional networks.

...read moreread less

20.3K

•Posted Content

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

Andrew Howard, +7 more

- 17 Apr 2017

- arXiv: Computer Vision and Pattern Recog...

TL;DR: This work introduces two simple global hyper-parameters that efficiently trade off between latency and accuracy and demonstrates the effectiveness of MobileNets across a wide range of applications and use cases including object detection, finegrain classification, face attributes and large scale geo-localization.

...read moreread less

18.5K

Journal Article•10.1109/TPAMI.2002.1017623

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

Timo Ojala, +2 more

- 01 Jul 2002

- IEEE Transactions on Pattern Analysis an...

TL;DR: A generalized gray-scale and rotation invariant operator presentation that allows for detecting the "uniform" patterns for any quantization of the angular space and for any spatial resolution and presents a method for combining multiple operators for multiresolution analysis.

...read moreread less

16.4K

...

Expand

Learning Deep Binary Descriptor with Multi-Quantization

Chat with Paper

AI Agents for this Paper

Citations

A Survey of Quantization Methods for Efficient Neural Network Inference

A Decade Survey of Content Based Image Retrieval Using Deep Learning

A Deep Learning Approach for Multi-Frame In-Loop Filter of HEVC

Learning Deep Binary Descriptor with Multi-quantization

Similarity-Preserving Linkage Hashing for Online Image Retrieval

References

Fast R-CNN

Learning Multiple Layers of Features from Tiny Images

Fast R-CNN

MobileNets: Efficient Convolutional Neural Networks for Mobile Vision Applications

Multiresolution gray-scale and rotation invariant texture classification with local binary patterns

Related Papers (5)

Distinctive Image Features from Scale-Invariant Keypoints

ORB: An efficient alternative to SIFT or SURF

Learning Compact Binary Face Descriptor for Face Recognition

Discriminative Learning of Local Image Descriptors

BRIEF: binary robust independent elementary features