Journal Article10.1109/TPAMI.2018.2858760
Learning Deep Binary Descriptor with Multi-Quantization
50
TL;DR: A K-Autoencoders (KAEs) network is designed to jointly learn the parameters of feature extractor and the binarization functions under a deep learning framework, so that discriminative binary descriptors can be obtained with a fine-grained multi-quantization.
read more
Abstract: In this paper, we propose an unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual analysis. Existing learning-based binary descriptors such as compact binary face descriptor (CBFD) and DeepBit utilize the rigid sign function for binarization despite of data distributions, which usually suffer from severe quantization loss. In order to address the limitation, we propose a deep multi-quantization network to learn a data-dependent binarization in an unsupervised manner. More specifically, we design a K-Autoencoders (KAEs) network to jointly learn the parameters of feature extractor and the binarization functions under a deep learning framework, so that discriminative binary descriptors can be obtained with a fine-grained multi-quantization. As DBD-MQ simply allocates the same number of quantizers to each real-valued feature dimension ignoring the elementwise diversity of informativeness, we further propose a deep competitive binary descriptor with multi-quantization (DCBD-MQ) method to learn optimal allocation of bits with the fixed binary length in a competitive manner, where informative dimensions gain more bits for complete representation. Moreover, we present a similarity-aware binary encoding strategy based on the earth mover's distance of Autoencoders, so that elements that are quantized into similar Autoencoders will have smaller Hamming distances. Extensive experimental results on six widely-used datasets show that our DBD-MQ and DCBD-MQ outperform most state-of-the-art unsupervised binary descriptors.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A Survey of Quantization Methods for Efficient Neural Network Inference
12 Jan 2022
TL;DR: In this paper , the authors provide approaches to quantizing the numerical values in deep neural network computations, covering the advantages/disadvantages of current methods, and provide a survey of the history of quantization up to 1998.
612
A Decade Survey of Content Based Image Retrieval Using Deep Learning
TL;DR: A comprehensive survey of deep learning based developments in the past decade for content-based image retrieval is presented in this paper , where a performance analysis is also performed using the state-of-the-art methods.
A Deep Learning Approach for Multi-Frame In-Loop Filter of HEVC
TL;DR: A multi-frame in-loop filter (MIF) for HEVC, which enhances the visual quality of each encoded frame by leveraging its adjacent frames and utilizing the spatial information of this frame and the temporal information of its neighboring higher-quality frames.
173
Learning Deep Binary Descriptor with Multi-quantization
Yueqi Duan,Jiwen Lu,Ziwei Wang,Jianjiang Feng,Jie Zhou +4 more
- 01 Jul 2017
TL;DR: An unsupervised feature learning method called deep binary descriptor with multi-quantization (DBD-MQ) for visual matching that applies a K-AutoEncoders (KAEs) network to jointly learn the parameters and the binarization functions under a deep learning framework so that discriminative binary descriptors can be obtained with a fine-grained multi- quantization.
Similarity-Preserving Linkage Hashing for Online Image Retrieval
TL;DR: A novel online hashing method, termed Similarity Preserving Linkage Hashing (SPLH), which not only utilizes pairwise similarity to learn the intra- class relationships, but also fully exploits a latent linkage space to capture the inter-class relationships and the common characteristics between label vectors and to-be-learned hash codes.
63
References
Data clustering: 50 years beyond K-means
Anil K. Jain
- 01 Jun 2010
TL;DR: A brief overview of clustering is provided, well known clustering methods are summarized, the major challenges and key issues in designing clustering algorithms are discussed, and some of the emerging and useful research directions are pointed out.
8.4K
•Proceedings Article
Two-Stream Convolutional Networks for Action Recognition in Videos
Karen Simonyan,Andrew Zisserman +1 more
- 08 Dec 2014
TL;DR: This work proposes a two-stream ConvNet architecture which incorporates spatial and temporal networks and demonstrates that a ConvNet trained on multi-frame dense optical flow is able to achieve very good performance in spite of limited training data.
8.3K
Data Clustering: 50 Years Beyond K-means
Anil K. Jain
- 15 Sep 2008
TL;DR: Cluster analysis as mentioned in this paper is the formal study of algorithms and methods for grouping objects according to measured or perceived intrinsic characteristics, which is one of the most fundamental modes of understanding and learning.
Deep face recognition
Omkar M. Parkhi,Andrea Vedaldi,Andrew Zisserman +2 more
- 01 Jan 2015
TL;DR: It is shown how a very large scale dataset can be assembled by a combination of automation and human in the loop, and the trade off between data purity and time is discussed.
Face Description with Local Binary Patterns: Application to Face Recognition
TL;DR: This paper presents a novel and efficient facial image representation based on local binary pattern (LBP) texture features that is assessed in the face recognition problem under different challenges.
6.2K