Efficient Algorithms and Cost Models for Reverse Spatial-Keyword k-Nearest Neighbor Search
TL;DR: This article introduces the Reverse Spatial-Keyword k-Nearest Neighbor (RSKkNN) query, which finds those objects that have the query as one of their k-nearest spatial-textual objects and proposes a hybrid index tree, called IUR-tree (Intersection-Union R-tree), that effectively combines location proximity with textual similarity.
read more
Abstract: Geographic objects associated with descriptive texts are becoming prevalent, justifying the need for spatial-keyword queries that consider both locations and textual descriptions of the objects. Specifically, the relevance of an object to a query is measured by spatial-textual similarity that is based on both spatial proximity and textual similarity. In this article, we introduce the Reverse Spatial-Keyword k-Nearest Neighbor (RSKkNN) query, which finds those objects that have the query as one of their k-nearest spatial-textual objects. The RSKkNN queries have numerous applications in online maps and GIS decision support systems. To answer RSKkNN queries efficiently, we propose a hybrid index tree, called IUR-tree (Intersection-Union R-tree) that effectively combines location proximity with textual similarity. Subsequently, we design a branch-and-bound search algorithm based on the IUR-tree. To accelerate the query processing, we improve IUR-tree by leveraging the distribution of textual description, leading to some variants of the IUR-tree called Clustered IUR-tree (CIUR-tree) and combined clustered IUR-tree (C2IUR-tree), for each of which we develop optimized algorithms. We also provide a theoretical cost model to analyze the efficiency of our algorithms. Our empirical studies show that the proposed algorithms are efficient and scalable.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Figures

Fig. 15. Experimental results on the GN dataset 
Fig. 8. Illustration to RSKkNN algorithm 
Fig. 5. Illustration of spatial approximation 
Fig. 2. Example for illustrating the relationship of RSkNN, RKkNN and RSKkNN 
Table I. Summary of the notations used 
Fig. 10. Illustration for the maximal spatial distances between entries and minimal spatial distances between query object q and entries at level l.
Citations
•Journal Article
ACM Transactions on Database Systems
Dan Suciu,Gerhard Weikum +1 more
TL;DR: BLOCKIN BLOCKINÒ BLOCKin× ½¸ÔÔº ¾ßß¿º ¿ ¾ ¾ à ¼ à à 0
425
Spatial keyword search: a survey
TL;DR: This survey summarizes the findings of existing spatial keyword search studies, thus uncovering new insights that may guide software engineers as well as further research.
91
Best Keyword Cover Search
TL;DR: A generic version of Closest Keywords search called Best Keyword Cover which considers inter-objects distance as well as the keyword rating of objects is investigated to investigate the increasing availability and importance of keyword rating in object evaluation for the better decision making.
68
Parallel Semantic Trajectory Similarity Join
Lisi Chen,Shuo Shang,Christian S. Jensen,Bin Yao,Panos Kalnis +4 more
- 20 Apr 2020
TL;DR: An efficient divide-and-conquer algorithm is proposed to derive bounds of spatial similarity and textual similarity between two semantic trajectories, which enable us prune dissimilar trajectory pairs without the need of computing the exact value of spatio-textual similarity.
Location- and keyword-based querying of geo-textual data: a survey
Zhida Chen,Lisi Chen,Gao Cong,Christian S. Jensen +3 more
- 30 Mar 2021
TL;DR: A survey of both the research problems studied and the solutions proposed in these two settings is offered, which aims to offer the reader a first understanding of key concepts and techniques underlying proposed solutions to the querying of geo-textual data.
44
References
•Proceedings Article
A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise
Martin Ester,Hans-Peter Kriegel,Jörg Sander,Xiaowei Xu +3 more
- 02 Aug 1996
TL;DR: In this paper, a density-based notion of clusters is proposed to discover clusters of arbitrary shape, which can be used for class identification in large spatial databases and is shown to be more efficient than the well-known algorithm CLAR-ANS.
20.3K
•Proceedings Article
A density-based algorithm for discovering clusters in large spatial Databases with Noise
Martin Ester,Hans-Peter Kriegel,Jörg Sander,Xiaowei Xu +3 more
- 01 Jan 1996
TL;DR: DBSCAN, a new clustering algorithm relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape, is presented which requires only one input parameter and supports the user in determining an appropriate value for it.
Term Weighting Approaches in Automatic Text Retrieval
Gerard Salton,Chris Buckley +1 more
TL;DR: This paper summarizes the insights gained in automatic term weighting, and provides baseline single term indexing models with which other more elaborate content analysis procedures can be compared.
R-trees: a dynamic index structure for spatial searching
Antonin Guttman
- 01 Jun 1984
TL;DR: A dynamic index structure called an R-tree is described which meets this need, and algorithms for searching and updating it are given and it is concluded that it is useful for current database systems in spatial applications.
8K