Patent
Efficient multidimensional data aggregation operator implementation
Jim Gray,Donald C. Reichart +1 more
- 16 Dec 1996
182
TL;DR: In this article, an efficient implementation of a multidimensional data aggregation operator that generates all aggregates and super-aggregates for all available values in a results set by first generating a minimal number of aggregates at the lowest possible system level, and second categorizing the aggregate function being applied and applying aggregate function with the fewest possible function calls.
read more
Abstract: An efficient implementation of a multidimensional data aggregation operator that generates all aggregates and super-aggregates for all available values in a results set by first generating a minimal number of aggregates at the lowest possible system level using a minimal number of function calls, and second categorizing the aggregate function being applied and applying the aggregate function with the fewest possible function calls. The aggregates are generated from a union of roll-ups of the n attributes to the GROUP BY clause of the SELECT statement. The number of roll-ups are minimized by including a barrel shift of the attributes being rolled up. A scoreboard array of 2n bits is updated during the roll-up and barrel shifting process to keep track of which roll-ups are complete and with are not yet complete. Generating super-aggregates is further optimized by identifying the type of aggregate function being applied and facilitating the most efficient application of the aggregate function. A lter-- super() function is implemented to facilitate the most efficient application of algebraic aggregate functions that require access to intermediate aggregate data that heretofore was not available to any algebraic aggregation operator.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Discovery-Driven Exploration of OLAP Data Cubes
Sunita Sarawagi,Rakesh Agrawal,Nimrod Megiddo +2 more
- 23 Mar 1998
TL;DR: A method for locating data anomalies in a k dimensional data cube that includes the steps of associating a surprise value with each cell of a data cube, and indicating a data anomaly when the surprise value associated with a cell exceeds a predetermined exception threshold.
521
Patent
Relational database management system having integrated non-relational multi-dimensional data store of aggregated data elements
Reuven Bakalash,Guy Shaked,Joseph Caspi +2 more
- 31 Mar 2009
TL;DR: In this paper, an improved method of and apparatus for joining and aggregating data elements integrated within a relational database management system (RDBMS) using a non-relational multi-dimensional data structure (MDD) is presented.
265
Patent
Data aggregation server for managing a multi-dimensional database and database management system having data aggregation server integrated therein
Reuven Bakalash,Guy Shaked,Joseph Caspi +2 more
- 28 Feb 2001
TL;DR: In this article, a stand-alone aggregation server for multidimensional databases (MDDBs) is presented, which can uniformly distribute data elements among a plurality of processors for balanced loading and processing, and therefore is highly scalable.
191
Patent
Enterprise-wide data-warehouse with integrated data aggregation engine
Reuven Bakalash,Guy Shaked,Joseph Caspi +2 more
- 09 Dec 2002
TL;DR: In this article, an enterprise-wide data-warehouse comprising a database management system (DBMS) including a relational datastore storing data in tables is described, and a query processing mechanism processes query statements, wherein, upon identifying that a given query statement is on the second reference, the query process mechanism communicates with the aggregation module to retrieve portions of aggregated data identified by the reference that are relevant to the given query statements.
173
Patent
Accessing multi-dimensional data by mapping dense data blocks to rows in a relational database
William Earl Malloy,Gary Robinson,Craig Reginald Tomlyn +2 more
- 30 Jun 1997
TL;DR: In this paper, a method, apparatus, and article of manufacture for using a relational database management system to support on-line analytical processing (OLAP) systems is presented, where a database has a fact table and at least one related dimension table, and a dimension identifier is received that identifies a data block in a multi-dimensional database.
160
References
Data cube: a relational aggregation operator generalizing GROUP-BY, CROSS-TAB, and SUB-TOTALS
Jim Gray,A. Bosworth,A. Lyaman,Hamid Pirahesh +3 more
- 26 Feb 1996
TL;DR: The data cube operator as discussed by the authors generalizes the histogram, cross-tabulation, roll-up, drill-down, and sub-total constructs found in most report writers.
Implementing data cubes efficiently
Venky Harinarayan,Anand Rajaraman,Jeffrey D. Ullman +2 more
- 01 Jun 1996
TL;DR: In this article, a lattice framework is used to express dependencies among views and greedy algorithms are presented to determine a good set of views to materialize, with a small constant factor of optimal.
Patent
Extending the semantics of the outer join operator for un-nesting queries to a data base
Murali M. Krishna
- 30 Apr 1992
TL;DR: In this paper, the semantics of the outer join operator are extended to permit the application of different predicates to the join tuples and the anti-join tuples, such that the predicate of anti-joining tuples is evaluated assuming a count value of zero.
315
Patent
Method and apparatus for storing and retrieving multi-dimensional data in computer memory.
Robert J. Earle
- 30 Mar 1993
TL;DR: In this article, a multi-level data structure is defined wherein one level contains those dimensions chosen by the user to result in dense data and the other level contains the remaining sparse data combinations.
211
Patent
Hash-based database grouping system and method
Anoop Sharma,Hansjorg Zeller +1 more
- 20 Jan 1995
TL;DR: In this paper, a structured query language (SQL) grouping and aggregation system and method that incorporates hash-based techniques, several overflow handling strategies and statistics-based process-selection criteria is presented.
155