Efficient Algorithms and Cost Models for Reverse Spatial-Keyword k-Nearest Neighbor Search

doi:10.1145/2576232

Open AccessJournal Article10.1145/2576232

Efficient Algorithms and Cost Models for Reverse Spatial-Keyword k-Nearest Neighbor Search

Ying Lu, +4 more

- 26 May 2014

- ACM Transactions on Database Systems

- Vol. 39, Iss: 2, pp 13

47

TL;DR: This article introduces the Reverse Spatial-Keyword k-Nearest Neighbor (RSKkNN) query, which finds those objects that have the query as one of their k-nearest spatial-textual objects and proposes a hybrid index tree, called IUR-tree (Intersection-Union R-tree), that effectively combines location proximity with textual similarity.

Abstract: Geographic objects associated with descriptive texts are becoming prevalent, justifying the need for spatial-keyword queries that consider both locations and textual descriptions of the objects. Specifically, the relevance of an object to a query is measured by spatial-textual similarity that is based on both spatial proximity and textual similarity. In this article, we introduce the Reverse Spatial-Keyword k-Nearest Neighbor (RSKkNN) query, which finds those objects that have the query as one of their k-nearest spatial-textual objects. The RSKkNN queries have numerous applications in online maps and GIS decision support systems. To answer RSKkNN queries efficiently, we propose a hybrid index tree, called IUR-tree (Intersection-Union R-tree) that effectively combines location proximity with textual similarity. Subsequently, we design a branch-and-bound search algorithm based on the IUR-tree. To accelerate the query processing, we improve IUR-tree by leveraging the distribution of textual description, leading to some variants of the IUR-tree called Clustered IUR-tree (CIUR-tree) and combined clustered IUR-tree (C2IUR-tree), for each of which we develop optimized algorithms. We also provide a theoretical cost model to analyze the efficiency of our algorithms. Our empirical studies show that the proposed algorithms are efficient and scalable.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Figures

Fig. 15. Experimental results on the GN dataset

Fig. 8. Illustration to RSKkNN algorithm

Fig. 5. Illustration of spatial approximation

Fig. 2. Example for illustrating the relationship of RSkNN, RKkNN and RSKkNN

Fig. 10. Illustration for the maximal spatial distances between entries and minimal spatial distances between query object q and entries at level l.

Citations

•Journal Article

ACM Transactions on Database Systems

Dan Suciu, +1 more

- 01 Jan 2005

- ACM Transactions on Database Systems

TL;DR: BLOCKIN BLOCKINÒ BLOCKin× ½¸ÔÔº ¾ßß¿º ¿ ¾ ¾ Ã ¼ Ã Ã 0

...read moreread less

425

Journal Article•10.1007/S10707-019-00373-Y

Spatial keyword search: a survey

Lisi Chen, +3 more

- 01 Jan 2020

- Geoinformatica

TL;DR: This survey summarizes the findings of existing spatial keyword search studies, thus uncovering new insights that may guide software engineers as well as further research.

...read moreread less

91

Journal Article•10.1109/TKDE.2014.2324897

Best Keyword Cover Search

Ke Deng, +3 more

- 01 Jan 2015

- IEEE Transactions on Knowledge and Data ...

TL;DR: A generic version of Closest Keywords search called Best Keyword Cover which considers inter-objects distance as well as the keyword rating of objects is investigated to investigate the increasing availability and importance of keyword rating in object evaluation for the better decision making.

...read moreread less

68

•Proceedings Article•10.1109/ICDE48307.2020.00091

Parallel Semantic Trajectory Similarity Join

Lisi Chen, +4 more

- 20 Apr 2020

TL;DR: An efficient divide-and-conquer algorithm is proposed to derive bounds of spatial similarity and textual similarity between two semantic trajectories, which enable us prune dissimilar trajectory pairs without the need of computing the exact value of spatio-textual similarity.

...read moreread less

59

Journal Article•10.1007/S00778-021-00661-W

Location- and keyword-based querying of geo-textual data: a survey

Zhida Chen, +3 more

- 30 Mar 2021

TL;DR: A survey of both the research problems studied and the solutions proposed in these two settings is offered, which aims to offer the reader a first understanding of key concepts and techniques underlying proposed solutions to the querying of geo-textual data.

...read moreread less

44

...

Expand

References

•Proceedings Article

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

Martin Ester, +3 more

- 02 Aug 1996

TL;DR: In this paper, a density-based notion of clusters is proposed to discover clusters of arbitrary shape, which can be used for class identification in large spatial databases and is shown to be more efficient than the well-known algorithm CLAR-ANS.

...read moreread less

20.3K

•Journal Article•10.1214/AOMS/1177729694

On Information and Sufficiency

Solomon Kullback, +1 more

- 01 Mar 1951

- Annals of Mathematical Statistics

19.8K

•Proceedings Article

A density-based algorithm for discovering clusters in large spatial Databases with Noise

Martin Ester, +3 more

- 01 Jan 1996

TL;DR: DBSCAN, a new clustering algorithm relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape, is presented which requires only one input parameter and supports the user in determining an appropriate value for it.

...read moreread less

17.8K

•Journal Article•10.1016/0306-4573(88)90021-0

Term Weighting Approaches in Automatic Text Retrieval

Gerard Salton, +1 more

- 01 Aug 1988

- Information Processing and Management

TL;DR: This paper summarizes the insights gained in automatic term weighting, and provides baseline single term indexing models with which other more elaborate content analysis procedures can be compared.

...read moreread less

10.5K

Proceedings Article•10.1145/602259.602266

R-trees: a dynamic index structure for spatial searching

Antonin Guttman

- 01 Jun 1984

TL;DR: A dynamic index structure called an R-tree is described which meets this need, and algorithms for searching and updating it are given and it is concluded that it is useful for current database systems in spatial applications.

...read moreread less

8K

...

Expand

Efficient Algorithms and Cost Models for Reverse Spatial-Keyword k-Nearest Neighbor Search

Chat with Paper

AI Agents for this Paper

Figures

Citations

ACM Transactions on Database Systems

Spatial keyword search: a survey

Best Keyword Cover Search

Parallel Semantic Trajectory Similarity Join

Location- and keyword-based querying of geo-textual data: a survey

References

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

On Information and Sufficiency

A density-based algorithm for discovering clusters in large spatial Databases with Noise

Term Weighting Approaches in Automatic Text Retrieval

R-trees: a dynamic index structure for spatial searching

Related Papers (5)

Efficient processing of top-k spatial keyword queries

Keyword Search on Spatial Databases

Keyword Search in Spatial Databases: Towards Searching by Document

R-trees: a dynamic index structure for spatial searching

Efficient and scalable method for processing top-k spatial Boolean queries