Hamming space

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1109/TPAMI.2012.193•

Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval

[...]

Yunchao Gong¹, Svetlana Lazebnik², Albert Gordo³, Florent Perronnin⁴•Institutions (4)

University of North Carolina at Chapel Hill¹, University of Illinois at Urbana–Champaign², Autonomous University of Barcelona³, Xerox⁴

01 Dec 2013-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This paper addresses the problem of learning similarity-preserving binary codes for efficient similarity search in large-scale image collections by proposing a simple and efficient alternating minimization algorithm, dubbed iterative quantization (ITQ), and demonstrating an application of ITQ to learning binary attributes or "classemes" on the ImageNet data set.

...read moreread less

Abstract: This paper addresses the problem of learning similarity-preserving binary codes for efficient similarity search in large-scale image collections. We formulate this problem in terms of finding a rotation of zero-centered data so as to minimize the quantization error of mapping this data to the vertices of a zero-centered binary hypercube, and propose a simple and efficient alternating minimization algorithm to accomplish this task. This algorithm, dubbed iterative quantization (ITQ), has connections to multiclass spectral clustering and to the orthogonal Procrustes problem, and it can be used both with unsupervised data embeddings such as PCA and supervised embeddings such as canonical correlation analysis (CCA). The resulting binary codes significantly outperform several other state-of-the-art methods. We also show that further performance improvements can result from transforming the data with a nonlinear kernel mapping prior to PCA or CCA. Finally, we demonstrate an application of ITQ to learning binary attributes or "classemes" on the ImageNet data set.

...read moreread less

2,110 citations

Proceedings Article•10.1109/CVPR.2012.6247912•

Supervised hashing with kernels

[...]

Wei Liu¹, Jun Wang², Rongrong Ji¹, Yu-Gang Jiang³, Shih-Fu Chang¹ - Show less +1 more•Institutions (3)

Columbia University¹, IBM², Fudan University³

16 Jun 2012

TL;DR: A novel kernel-based supervised hashing model which requires a limited amount of supervised information, i.e., similar and dissimilar data pairs, and a feasible training cost in achieving high quality hashing, and significantly outperforms the state-of-the-arts in searching both metric distance neighbors and semantically similar neighbors is proposed.

...read moreread less

Abstract: Recent years have witnessed the growing popularity of hashing in large-scale vision problems. It has been shown that the hashing quality could be boosted by leveraging supervised information into hash function learning. However, the existing supervised methods either lack adequate performance or often incur cumbersome model training. In this paper, we propose a novel kernel-based supervised hashing model which requires a limited amount of supervised information, i.e., similar and dissimilar data pairs, and a feasible training cost in achieving high quality hashing. The idea is to map the data to compact binary codes whose Hamming distances are minimized on similar pairs and simultaneously maximized on dissimilar pairs. Our approach is distinct from prior works by utilizing the equivalence between optimizing the code inner products and the Hamming distances. This enables us to sequentially and efficiently train the hash functions one bit at a time, yielding very short yet discriminative codes. We carry out extensive experiments on two image benchmarks with up to one million samples, demonstrating that our approach significantly outperforms the state-of-the-arts in searching both metric distance neighbors and semantically similar neighbors, with accuracy gains ranging from 13% to 46%.

...read moreread less

1,770 citations

Journal Article•10.1109/TPAMI.2011.103•

LDAHash: Improved Matching with Smaller Descriptors

[...]

Christoph Strecha¹, Alexander M. Bronstein², Michael M. Bronstein, Pascal Fua¹•Institutions (2)

École Polytechnique Fédérale de Lausanne¹, Technion – Israel Institute of Technology²

01 Jan 2012-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: This work reduces the size of the descriptors by representing them as short binary strings and learn descriptor invariance from examples, and shows extensive experimental validation, demonstrating the advantage of the proposed approach.

...read moreread less

Abstract: SIFT-like local feature descriptors are ubiquitously employed in computer vision applications such as content-based retrieval, video analysis, copy detection, object recognition, photo tourism, and 3D reconstruction. Feature descriptors can be designed to be invariant to certain classes of photometric and geometric transformations, in particular, affine and intensity scale transformations. However, real transformations that an image can undergo can only be approximately modeled in this way, and thus most descriptors are only approximately invariant in practice. Second, descriptors are usually high dimensional (e.g., SIFT is represented as a 128-dimensional vector). In large-scale retrieval and matching problems, this can pose challenges in storing and retrieving descriptor data. We map the descriptor vectors into the Hamming space in which the Hamming metric is used to compare the resulting representations. This way, we reduce the size of the descriptors by representing them as short binary strings and learn descriptor invariance from examples. We show extensive experimental validation, demonstrating the advantage of the proposed approach.

...read moreread less

744 citations

Proceedings Article•10.1109/CVPR.2015.7299011•

Semantics-preserving hashing for cross-view retrieval

[...]

Zijia Lin¹, Guiguang Ding¹, Mingqing Hu², Jianmin Wang¹•Institutions (2)

Tsinghua University¹, Chinese Academy of Sciences²

7 Jun 2015

TL;DR: This paper proposes an effective Semantics-Preserving Hashing method, termed SePH, which transforms semantic affinities of training data as supervised information into a probability distribution and approximates it with to-be-learnt hash codes in Hamming space via minimizing the Kullback-Leibler divergence.

...read moreread less

Abstract: With benefits of low storage costs and high query speeds, hashing methods are widely researched for efficiently retrieving large-scale data, which commonly contains multiple views, e.g. a news report with images, videos and texts. In this paper, we study the problem of cross-view retrieval and propose an effective Semantics-Preserving Hashing method, termed SePH. Given semantic affinities of training data as supervised information, SePH transforms them into a probability distribution and approximates it with to-be-learnt hash codes in Hamming space via minimizing the Kullback-Leibler divergence. Then kernel logistic regression with a sampling strategy is utilized to learn the nonlinear projections from features in each view to the learnt hash codes. And for any unseen instance, predicted hash codes and their corresponding output probabilities from observed views are utilized to determine its unified hash code, using a novel probabilistic approach. Extensive experiments conducted on three benchmark datasets well demonstrate the effectiveness and reasonableness of SePH.

...read moreread less

630 citations

Journal Article•10.1109/TPAMI.2018.2847335•

Binary Multi-View Clustering

[...]

Zheng Zhang¹, Li Liu, Fumin Shen², Heng Tao Shen², Ling Shao - Show less +1 more•Institutions (2)

University of Queensland¹, University of Electronic Science and Technology of China²

01 Jul 2019-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: A novel Binary Multi-View Clustering (BMVC) framework, which can dexterously manipulate multi-view image data and easily scale to large data, and is formulated by two key components: compact collaborative discrete representation learning and binary clustering structure learning, in a joint learning framework.

...read moreread less

Abstract: Clustering is a long-standing important research problem, however, remains challenging when handling large-scale image data from diverse sources. In this paper, we present a novel Binary Multi-View Clustering (BMVC) framework, which can dexterously manipulate multi-view image data and easily scale to large data. To achieve this goal, we formulate BMVC by two key components: compact collaborative discrete representation learning and binary clustering structure learning, in a joint learning framework. Specifically, BMVC collaboratively encodes the multi-view image descriptors into a compact common binary code space by considering their complementary information; the collaborative binary representations are meanwhile clustered by a binary matrix factorization model, such that the cluster structures are optimized in the Hamming space by pure, extremely fast bit-operations. For efficiency, the code balance constraints are imposed on both binary data representations and cluster centroids. Finally, the resulting optimization problem is solved by an alternating optimization scheme with guaranteed fast convergence. Extensive experiments on four large-scale multi-view image datasets demonstrate that the proposed method enjoys the significant reduction in both computation and memory footprint, while observing superior (in most cases) or very competitive performance, in comparison with state-of-the-art clustering methods.

...read moreread less

588 citations

...

Expand

Year	Papers
2025	2
2024	6
2023	21
2022	51
2021	47
2020	45

Topic Tools

Papers published on a yearly basis

Papers

Iterative Quantization: A Procrustean Approach to Learning Binary Codes for Large-Scale Image Retrieval

Supervised hashing with kernels

LDAHash: Improved Matching with Smaller Descriptors

Semantics-preserving hashing for cross-view retrieval

Binary Multi-View Clustering

Related Topics (5)

Performance Metrics