iDistance

Topic Tools

Papers published on a yearly basis

Papers

Proceedings Article•10.1145/602259.602266•

R-trees: a dynamic index structure for spatial searching

[...]

Antonin Guttman¹•Institutions (1)

University of California, Berkeley¹

1 Jun 1984

TL;DR: A dynamic index structure called an R-tree is described which meets this need, and algorithms for searching and updating it are given and it is concluded that it is useful for current database systems in spatial applications.

...read moreread less

Abstract: In order to handle spatial data efficiently, as required in computer aided design and geo-data applications, a database system needs an index mechanism that will help it retrieve data items quickly according to their spatial locations However, traditional indexing methods are not well suited to data objects of non-zero size located m multi-dimensional spaces In this paper we describe a dynamic index structure called an R-tree which meets this need, and give algorithms for searching and updating it. We present the results of a series of tests which indicate that the structure performs well, and conclude that it is useful for current database systems in spatial applications

...read moreread less

8,065 citations

Proceedings Article•10.1145/93597.98741•

The R*-tree: an efficient and robust access method for points and rectangles

[...]

Norbert Beckmann¹, Hans-Peter Kriegel¹, Ralf Schneider¹, Bernhard Seeger¹•Institutions (1)

University of Bremen¹

1 May 1990

TL;DR: The R*-tree is designed which incorporates a combined optimization of area, margin and overlap of each enclosing rectangle in the directory which clearly outperforms the existing R-tree variants.

...read moreread less

Abstract: The R-tree, one of the most popular access methods for rectangles, is based on the heuristic optimization of the area of the enclosing rectangle in each inner node. By running numerous experiments in a standardized testbed under highly varying data, queries and operations, we were able to design the R*-tree which incorporates a combined optimization of area, margin and overlap of each enclosing rectangle in the directory. Using our standardized testbed in an exhaustive performance comparison, it turned out that the R*-tree clearly outperforms the existing R-tree variants. Guttman's linear and quadratic R-tree and Greene's variant of the R-tree. This superiority of the R*-tree holds for different types of queries and operations, such as map overlay, for both rectangles and multidimensional points in all experiments. From a practical point of view the R*-tree is very attractive because of the following two reasons 1 it efficiently supports point and spatial data at the same time and 2 its implementation cost is only slightly higher than that of other R-trees.

...read moreread less

4,923 citations

Proceedings Article•

M-tree: An Efficient Access Method for Similarity Search in Metric Spaces

[...]

Paolo Ciaccia¹, Marco Patella, Pavel Zezula•Institutions (1)

University of Bologna¹

25 Aug 1997

TL;DR: The results demonstrate that the Mtree indeed extends the domain of applicability beyond the traditional vector spaces, performs reasonably well in high-dimensional data spaces, and scales well in case of growing files.

...read moreread less

Abstract: A new access method, called M-tree, is proposed to organize and search large data sets from a generic “metric space”, i.e. where object proximity is only defined by a distance function satisfying the positivity, symmetry, and triangle inequality postulates. We detail algorithms for insertion of objects and split management, which keep the M-tree always balanced - several heuristic split alternatives are considered and experimentally evaluated. Algorithms for similarity (range and k-nearest neighbors) queries are also described. Results from extensive experimentation with a prototype system are reported, considering as the performance criteria the number of page I/O’s and the number of distance computations. The results demonstrate that the Mtree indeed extends the domain of applicability beyond the traditional vector spaces, performs reasonably well in high-dimensional data spaces, and scales well in case of growing files.

...read moreread less

1,936 citations

Proceedings Article•10.1109/ICDE.1996.492202•

Similarity indexing with the SS-tree

[...]

David A. White¹, Ramesh Jain¹•Institutions (1)

University of California, San Diego¹

26 Feb 1996

TL;DR: This work describes the fundamental types of "similarity queries" that should be supported and proposes a new dynamic structure for similarity indexing called the similarity search tree or SS-tree, which performs better than the R*-tree in nearly every test.

...read moreread less

Abstract: Efficient indexing of high dimensional feature vectors is important to allow visual information systems and a number other applications to scale up to large databases. We define this problem as "similarity indexing" and describe the fundamental types of "similarity queries" that we believe should be supported. We also propose a new dynamic structure for similarity indexing called the similarity search tree or SS-tree. In nearly every test we performed on high dimensional data, we found that this structure performed better than the R*-tree. Our tests also show that the SS-tree is much better suited for approximate queries than the R*-tree.

...read moreread less

736 citations

Journal Article•10.1145/1071610.1071612•

iDistance: An adaptive B+-tree based indexing method for nearest neighbor search

[...]

H. V. Jagadish¹, Beng Chin Ooi², Kian-Lee Tan², Cui Yu³, Rui Zhang² - Show less +1 more•Institutions (3)

University of Michigan¹, National University of Singapore², Monmouth University³

01 Jun 2005-ACM Transactions on Database Systems

TL;DR: An efficient B+-tree based indexing method for K-nearest neighbor (KNN) search in a high-dimensional metric space, called iDistance, which partitions the data based on a space- or data-partitioning strategy, and selects a reference point for each partition.

...read moreread less

Abstract: In this article, we present an efficient Bp-tree based indexing method, called iDistance, for K-nearest neighbor (KNN) search in a high-dimensional metric space. iDistance partitions the data based on a space- or data-partitioning strategy, and selects a reference point for each partition. The data points in each partition are transformed into a single dimensional value based on their similarity with respect to the reference point. This allows the points to be indexed using a Bp-tree structure and KNN search to be performed using one-dimensional range search. The choice of partition and reference points adapts the index structure to the data distribution.We conducted extensive experiments to evaluate the iDistance technique, and report results demonstrating its effectiveness. We also present a cost model for iDistance KNN search, which can be exploited in query optimization.

...read moreread less

670 citations

...

Expand

Year	Papers
2021	3
2017	1
2016	6
2015	1
2014	4
2013	7

Topic Tools

Papers published on a yearly basis

Papers

R-trees: a dynamic index structure for spatial searching

The R*-tree: an efficient and robust access method for points and rectangles

M-tree: An Efficient Access Method for Similarity Search in Metric Spaces

Similarity indexing with the SS-tree

iDistance: An adaptive B+-tree based indexing method for nearest neighbor search

Related Topics (5)

Performance Metrics