Journal Article10.1007/S007780050016
Join algorithm costs revisited
Evan P. Harris,Kotagiri Ramamohanarao +1 more
- 01 Jan 1996
- Vol. 5, Iss: 1, pp 064-084
TL;DR: Analysis of expected and experimental results of various join algorithms show that a combination of the optimal nested block and optimal GRACE hash join algorithms usually provide the greatest cost benefit, unless the relation size is a small multiple of the memory size.
read more
Abstract: A method of analysing join algorithms based upon the time required to access, transfer and perform the relevant CPU-based operations on a disk page is proposed. The costs of variations of several of the standard join algorithms, including nested block, sort-merge, GRACE hash and hybrid hash, are presented. For a given total buffer size, the cost of these join algorithms depends on the parts of the buffer allocated for each purpose. For example, when joining two relations using the nested block join algorithm, the amount of buffer space allocated for the outer and inner relations can significantly affect the cost of the join. Analysis of expected and experimental results of various join algorithms show that a combination of the optimal nested block and optimal GRACE hash join algorithms usually provide the greatest cost benefit, unless the relation size is a small multiple of the memory size. Algorithms to quickly determine a buffer allocation producing the minimal cost for each of these algorithms are presented. When the relation size is a small multiple of the amount of main memory available (typically up to three to six times), the hybrid hash join algorithm is preferable.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Improving OLAP performance by multidimensional hierarchical clustering
Volker Markl,Frank Ramsak,Rudolf Bayer +2 more
- 02 Aug 1999
TL;DR: Multidimensional hierarchical clustering (MHC) of OLAP data is introduced as a way to speed up aggregation queries without additional storage cost for materialization and performance measurements on real world data for a typical star schema are presented.
•Proceedings Article
Generalised Hash Teams for Join and Group-by
Alfons Kemper,Donald Kossmann,Christian Wiesner +2 more
- 07 Sep 1999
TL;DR: A new class of algorithms is proposed that can be used to speed up the execution of multi-way join queries or of queries that involve one or more joins and a group-by, by indirectly partitioning the input data.
Evaluating skylines in the presence of equijoins
Wen Jin,Michael Morse,Jignesh M. Patel,Martin Ester,Zengjian Hu +4 more
- 01 Mar 2010
TL;DR: A framework for evaluating skylines in the presence of equijoins is described, including the development of algorithms to answer queries over large input tables in a non-blocking, pipeline fashion, which significantly speeds up the entire query evaluation time.
Processing operations with restrictions in RDBMS without external sorting: the Tetris algorithm
Volker Markl,M. Zirkel,Rudolf Bayer +2 more
- 23 Mar 1999
TL;DR: The Tetris algorithm is presented, which utilizes restrictions to process a table in sort order of any attribute without the need for external sorting and results are produced earlier than with traditional sorting techniques allowing better response times for interactive applications and pipelined processing of the result set.
29
•Proceedings Article
Evaluating Functional Joins Along Nested Reference Sets in Object-Relational and Object-Oriented Databases
Reinhard Braumandl,Jens Claußen,Alfons Kemper +2 more
- 24 Aug 1998
TL;DR: This work develops a new functional join algorithm that can be used for any realization form for OIDs (physical or logical) and is particularly geared towards supporting functional joins along nested reference sets.
References
•Book
Sorting and Searching
Donald E. Knuth
- 01 Jan 1973
TL;DR: The first revision of this third volume is a survey of classical computer techniques for sorting and searching that extends the treatment of data structures to consider both large and small databases and internal and external memories.
1.7K
Query evaluation techniques for large databases
TL;DR: This survey describes a wide array of practical query evaluation techniques for both relational and postrelational database systems, including iterative execution of complex query evaluation plans, the duality of sort- and hash-based set-matching algorithms, types of parallel query execution and their implementation, and special operators for emerging database application domains.
•Book
Simulated Annealing and Boltzmann Machines: A Stochastic Approach to Combinatorial Optimization and Neural Computing
Emile H. L. Aarts,Jan Korst +1 more
- 01 Jan 1989
TL;DR: Combinatorial Optimization and Boltzmann Machines, Parallel Simulated Annealing Algorithms, and Neural Computing.
1.4K
Simulated Annealing and Boltzmann Machines: A Stochastic Approach to Combinatorial Optimization and Neural Computing.
TL;DR: Simulated Annealing in practice as discussed by the authors : Simulated annealing is an algorithm for parallel simulated Annealing algorithms with Boltzmann machines, which can be found in the literature.
1.3K
Implementation techniques for main memory database systems
David J. DeWitt,Randy H. Katz,Frank Olken,Leonard D. Shapiro,Michael Stonebraker,Darien Wood +5 more
- 01 Jun 1984
TL;DR: This paper considers the changes necessary to permit a relational database system to take advantage of large amounts of main memory, and evaluates AVL vs B+-tree access methods, hash-based query processing strategies vs sort-merge, and study recovery issues when most or all of the database fits in main memory.
Related Papers (5)
Laura M. Haas,Michael J. Carey,Miron Livny,Amit Shukla +3 more
- 08 Aug 1997
Priti Mishra,Margaret H. Eich +1 more
Jignesh M. Patel,Michael J. Carey,Mary K. Vernon +2 more
- 01 May 1994