Journal Article10.14778/2850583.2850588
Processing and optimizing main memory spatial-keyword queries
Taesung Lee,Jin-Woo Park,Sanghoon Lee,Seung-won Hwang,Sameh Elnikety,Yuxiong He +5 more
- 01 Nov 2015
- Vol. 9, Iss: 3, pp 132-143
TL;DR: This work employs a cost-based optimizer to process spatial-keyword queries using a spatial index and a keyword index, and introduces five optimization techniques that efficiently reduce the search space and produce a query plan with low cost.
read more
Abstract: Important cloud services rely on spatial-keyword queries, containing a spatial predicate and arbitrary boolean keyword queries. In particular, we study the processing of such queries in main memory to support short response times. In contrast, current state-of-the-art spatial-keyword indexes and relational engines are designed for different assumptions. Rather than building a new spatial-keyword index, we employ a cost-based optimizer to process these queries using a spatial index and a keyword index. We address several technical challenges to achieve this goal. We introduce three operators as the building blocks to construct plans for main memory query processing. We then develop a cost model for the operators and query plans. We introduce five optimization techniques that efficiently reduce the search space and produce a query plan with low cost. The optimization techniques are computationally efficient, and they identify a query plan with a formal approximation guarantee under the common independence assumption. Furthermore, we extend the framework to exploit interesting orders. We implement the query optimizer to empirically validate our proposed approach using real-life datasets. The evaluation shows that the optimizations provide significant reduction in the average and tail latency of query processing: 7- to 11-fold reduction over using a single index in terms of 99th percentile response time. In addition, this approach outperforms existing spatial-keyword indexes, and DBMS query optimizers for both average and high-percentile response times.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Posted Content
Fast Set Intersection in Memory
Bolin Ding,Arnd Christian König +1 more
TL;DR: A very simple version of this algorithm that has weaker asymptotic guarantees but performs even better in practice is introduced; both algorithms outperform the state of the art techniques for both synthetic and real data sets and workloads.
94
Location- and keyword-based querying of geo-textual data: a survey
Zhida Chen,Lisi Chen,Gao Cong,Christian S. Jensen +3 more
- 30 Mar 2021
TL;DR: A survey of both the research problems studied and the solutions proposed in these two settings is offered, which aims to offer the reader a first understanding of key concepts and techniques underlying proposed solutions to the querying of geo-textual data.
44
Reverse Top-k Geo-Social Keyword Queries in Road Networks
Jingwen Zhao,Yunjun Gao,Gang Chen,Christian S. Jensen,Rui Chen,Deng Cai +5 more
- 19 Apr 2017
TL;DR: This paper proposes a hybrid index, the GIM-tree, which indexes locations, keywords, and social information of geo-tagged users and objects, and then presents efficient RkGSK query processing algorithms that exploit several pruning strategies.
39
Towards Why-Not Spatial Keyword Top-$k$ Queries: A Direction-Aware Approach
TL;DR: It is proved that the best refined query directions lie in a finite solution space for a special case and the search for the optimal refinement to a linear programming problem for the general case is reduced.
Time-Aware Boolean Spatial Keyword Queries
TL;DR: An efficient index structure, called the TA-tree, and its corresponding algorithms, which can prune the search space using both spatio-temporal and textual information are proposed.
29
References
A Method for the Construction of Minimum-Redundancy Codes
David A. Huffman
- 01 Sep 1952
TL;DR: A minimum-redundancy code is one constructed in such a way that the average number of coding digits per message is minimized.
6.1K
A method for the construction of minimum-redundancy codes
TL;DR: A minimum-redundancy code is one constructed in such a way that the average number of coding digits per message is minimized.
5.2K
Access path selection in a relational database management system
P. Griffiths Selinger,Morton M. Astrahan,Donald D. Chamberlin,Raymond A. Lorie,T. G. Price +4 more
- 30 May 1979
TL;DR: System R as mentioned in this paper is an experimental database management system developed to carry out research on the relational model of data, which chooses access paths for both simple (single relation) and complex queries (such as joins), given a user specification of desired data as a boolean expression of predicates.
The tail at scale
Jeffrey Dean,Luiz Andre Barroso +1 more
TL;DR: Software techniques that tolerate latency variability are vital to building responsive large-scale Web services.
1.9K
Inverted files for text search engines
Justin Zobel,Alistair Moffat +1 more
TL;DR: This tutorial introduces the key techniques in the area of text indexing, describing both a core implementation and how the core can be enhanced through a range of extensions.
Related Papers (5)
I. De Felipe,Vagelis Hristidis,Naphtali Rishe +2 more
- 07 Apr 2008
Xin Cao,Gao Cong,Christian S. Jensen,Beng Chin Ooi +3 more
- 12 Jun 2011