Top 10 papers published in the topic of Open addressing in 2018

Journal Article•10.1016/J.PATCOG.2017.03.021•

Quantization-based hashing

[...]

Jingkuan Song¹, Lianli Gao¹, Li Liu², Xiaofeng Zhu³, Nicu Sebe⁴ - Show less +1 more•Institutions (4)

University of Electronic Science and Technology of China¹, University of East Anglia², Guangxi Normal University³, University of Trento⁴

01 Mar 2018-Pattern Recognition

TL;DR: Quantization-based Hashing (QBH) is a generic framework which incorporates the advantages of quantization error reduction methods into conventional property preserving hashing methods and can be applied to both unsupervised and supervised hashing methods.

...read moreread less

216 citations

Journal Article•10.1016/J.NEUCOM.2017.09.042•

Bagging–boosting-based semi-supervised multi-hashing with query-adaptive re-ranking

[...]

Wing W. Y. Ng¹, Xiancheng Zhou¹, Xing Tian¹, Xizhao Wang², Daniel S. Yeung¹ - Show less +1 more•Institutions (2)

South China University of Technology¹, Shenzhen University²

31 Jan 2018-Neurocomputing

TL;DR: A bagging–boosting-based semi-supervised multi-hashing with query-adaptive re-ranking (BBSHR) is proposed, which yields better precision and recall rates for given numbers of hash tables and bits.

...read moreread less

25 citations

Journal Article•10.1109/TIP.2017.2759250•

Binary Multidimensional Scaling for Hashing

[...]

Yameng Huang¹, Zhouchen Lin¹•Institutions (1)

Peking University¹

01 Jan 2018-IEEE Transactions on Image Processing

TL;DR: Empirical results show that the proposed unified and concise unsupervised hashing framework, called binary multidimensional scaling, outperforms state-of-the-art methods by a large margin in terms of distance preservation, which is practical for real-world applications.

...read moreread less

Abstract: Hashing is a useful technique for fast nearest neighbor search due to its low storage cost and fast query speed. Unsupervised hashing aims at learning binary hash codes for the original features so that the pairwise distances can be best preserved. While several works have targeted on this task, the results are not satisfactory mainly due to the over-simplified model. In this paper, we propose a unified and concise unsupervised hashing framework, called binary multidimensional scaling , which is able to learn the hash code for distance preservation in both batch and online mode. In the batch mode, unlike most existing hashing methods, we do not need to simplify the model by predefining the form of hash map. Instead, we learn the binary codes directly based on the pairwise distances among the normalized original features by alternating minimization. This enables a stronger expressive power of the hash map. In the online mode, we consider the holistic distance relationship between current query example and those we have already learned, rather than only focusing on current data chunk. It is useful when the data come in a streaming fashion. Empirical results show that while being efficient for training, our algorithm outperforms state-of-the-art methods by a large margin in terms of distance preservation, which is practical for real-world applications.

...read moreread less

12 citations

Journal Article•10.1016/J.CAG.2017.08.012•

Exclusive grouped spatial hashing

[...]

Weiwei Duan, Jianxin Luo, Guiqiang Ni, Bin Tang, Qi Hu, Yi Gao - Show less +2 more

01 Feb 2018-Computers & Graphics

TL;DR: Here a full use of these collisions is obtained and therefore the spatial data compression rate is improved, and the performance of exclusive grouped spatial hashing is presented in 2D and 3D graphic examples.

...read moreread less

10 citations

Book Chapter•10.1007/978-3-319-98398-1_8•

SIMD Vectorized Hashing for Grouped Aggregation

[...]

Bala Gurumurthy¹, David Broneske¹, Marcus Pinnecke¹, Gabriel Campero¹, Gunter Saake¹ - Show less +1 more•Institutions (1)

Otto-von-Guericke University Magdeburg¹

2 Sep 2018

TL;DR: Overall, this work provides a basic structure of a dedicated SIMD accelerated grouped aggregation framework that can be adapted with different hashing techniques and observes different impacts of vectorization on these techniques.

...read moreread less

Abstract: Grouped aggregation is a commonly used analytical function. The common implementation of the function using hashing techniques suffers lower throughput rate due to the collision of the insert keys in the hashing techniques. During collision, the underlying technique searches for an alternative location to insert keys. Searching an alternative location increases the processing time for an individual key thereby degrading the overall throughput. In this work, we use Single Instruction Multiple Data (SIMD) vectorization to search multiple slots at an instant followed by direct aggregation of results. We provide our experimental results of our vectorized grouped aggregation with various open-addressing hashing techniques using several dataset distributions and our inferences on them. Among our findings, we observe different impacts of vectorization on these techniques. Namely, linear probing and two-choice hashing improve their performance with vectorization, whereas cuckoo and hopscotch hashing show a negative impact. Overall, we provide in this work a basic structure of a dedicated SIMD accelerated grouped aggregation framework that can be adapted with different hashing techniques.

...read moreread less

9 citations

Proceedings Article•10.1109/CONFLUENCE.2018.8442607•

Matrix Hashing with Two Level of Collision Resolution

[...]

Anand Agrawal¹, Sriram Bhyravarapu¹, Nuthalapati Venkata Krishna Chaitanya¹•Institutions (1)

Vignan University¹

1 Jan 2018

TL;DR: This paper presents a new and innovative technique for collision resolution based on two-dimensional array based on a unique way of evaluating and implementing algorithms to resolve collisions in hash tables.

...read moreread less

Abstract: Hashing is a well-known heuristic used for indexing and retrieving items from database as it uses a shorter hashed key, for finding the element, which is more efficient. In Data Structures, we use a hash table for looking up data rapidly. Hash functions enable rapid lookup of tables or databases by detecting duplicated records in a large file. Hash function should be properly designed to avoid collisions. However collisions are inevitable [1]. This paper presents a new and innovative technique for collision resolution based on two-dimensional array. The proposed strategy followed a unique way of evaluating and implementing algorithms to resolve collisions in hash tables. Analytical modelling and software simulations are quantifiable measures for the effectiveness of our algorithm. Efficient implementations that are easily realizable and productive in modern technologies are discussed. The performance benefits are significant and machines with moderate memory and speed specifications are prerequisites.

...read moreread less

4 citations

Journal Article•10.1007/S11042-017-4625-X•

Robust image authentication via locality sensitive hashing with core alignment

[...]

Qiang Ma¹, Qiang Ma², Lei Xu³, Ling Xing², Bin Wu² - Show less +1 more•Institutions (3)

China Academy of Engineering Physics¹, Southwest University of Science and Technology², Tsinghua University³

01 Mar 2018-Multimedia Tools and Applications

TL;DR: Experimental results show that the proposed hashing optimizations can find optimal solutions with limited steps, and the hashing method is superior to other state-of-the-art methods in terms of authentication and robustness.

...read moreread less

Abstract: Robust image hashing is a promising technique to represent image’s perceptual content. However, when it comes to image authentication, tradeoff between robustness and discrimination is a non-negligible issue. The allowed content preserving operations and sensitive malicious manipulations on images are quite subjective to human’s perception. So it needs tactics to design good hashing methods. In this paper we incorporate the novel concept of core alignment into hashing, where the proposed core alignment improves the performances of balance. First, we formulize the hashing as a supervised minimal optimization problem based on Locality Sensitive Hashing, in which p-stable distribution is exploited to maintain high dimensional locality features. Then we solve this problem by two sub-optimization problems, i.e., searching for optimal shift and searching for optimal quantization intervals. By using particle swarm optimization and simulated annealing programming approaches we develop two stochastic solutions to those two problems, respectively. Experimental results show that our proposed hashing optimizations can find optimal solutions with limited steps, and the hashing method is superior to other state-of-the-art methods in terms of authentication and robustness.

...read moreread less

4 citations

Journal Article•10.1109/TNNLS.2016.2615085•

In Defense of Locality-Sensitive Hashing

[...]

Kun Ding¹, Chunlei Huo¹, Bin Fan¹, Shiming Xiang¹, Chunhong Pan¹ - Show less +1 more•Institutions (1)

Chinese Academy of Sciences¹

01 Jan 2018-IEEE Transactions on Neural Networks

TL;DR: This paper developed the locality-sensitive two-step hashing (LS-TSH) that generates the binary codes through LSH rather than any complex optimization technique, and could obtain comparable retrieval accuracy with state of the arts with two to three orders of magnitudes faster training speed.

...read moreread less

Abstract: Hashing-based semantic similarity search is becoming increasingly important for building large-scale content-based retrieval system. The state-of-the-art supervised hashing techniques use flexible two-step strategy to learn hash functions. The first step learns binary codes for training data by solving binary optimization problems with millions of variables, thus usually requiring intensive computations. Despite simplicity and efficiency, locality-sensitive hashing (LSH) has never been recognized as a good way to generate such codes due to its poor performance in traditional approximate neighbor search. We claim in this paper that the true merit of LSH lies in transforming the semantic labels to obtain the binary codes, resulting in an effective and efficient two-step hashing framework. Specifically, we developed the locality-sensitive two-step hashing (LS-TSH) that generates the binary codes through LSH rather than any complex optimization technique. Theoretically, with proper assumption, LS-TSH is actually a useful LSH scheme, so that it preserves the label-based semantic similarity and possesses sublinear query complexity for hash lookup. Experimentally, LS-TSH could obtain comparable retrieval accuracy with state of the arts with two to three orders of magnitudes faster training speed.

...read moreread less

Journal Article•10.1016/J.NEUCOM.2017.10.061•

Hashing in the zero shot framework with domain adaptation

[...]

Shubham Pachori¹, Ameya Deshpande¹, Shanmuganathan Raman¹•Institutions (1)

Indian Institute of Technology Gandhinagar¹

31 Jan 2018-Neurocomputing

TL;DR: In this paper, an unsupervised domain adaptation model is proposed to learn hash codes from training images belonging to seen classes, which can efficiently encode images of unseen classes to binary codes.

...read moreread less

Proceedings Article•10.1145/3230718.3232629•

Cuckoo++ hash tables: high-performance hash tables for networking applications

[...]

Nicolas Le Scouarnec

23 Jul 2018

TL;DR: In this article, the authors proposed an algorithm to improve the performance of cuckoo hash tables without altering the properties of the original Cuckoo Hashing Table, and they also presented an implementation tailored to run efficiently on Intel Xeon processors to support NFV and softwarization trends.

...read moreread less

Abstract: Hash tables are essential data-structures for networking applications (e.g., connection tracking, firewalls, network address translators). Among these, cuckoo hash tables provide excellent performance by processing lookups with very few memory accesses (2 to 3 per lookup). Yet, they remain memory bound and each memory access impacts performance. In this paper, we propose algorithmic improvements to cuckoo hash tables to eliminate unnecessary memory accesses, without altering the properties of the original cuckoo hash table so that all existing theoretical analysis remain applicable. We also present an implementation tailored to run efficiently on Intel Xeon processors, thus supporting NFV and softwarization trends and compare it to the optimized implementation of DPDK. On a single core, our implementation achieves 37M positive lookups per second (i.e., when the key looked up is present in the table), and 60M negative lookups per second, a 45% to 70% improvement over DPDK.

...read moreread less

Showing papers on "Open addressing published in 2018"

Quantization-based hashing

Bagging–boosting-based semi-supervised multi-hashing with query-adaptive re-ranking

Binary Multidimensional Scaling for Hashing

Exclusive grouped spatial hashing

SIMD Vectorized Hashing for Grouped Aggregation

Matrix Hashing with Two Level of Collision Resolution

Robust image authentication via locality sensitive hashing with core alignment

In Defense of Locality-Sensitive Hashing

Hashing in the zero shot framework with domain adaptation

Cuckoo++ hash tables: high-performance hash tables for networking applications