Open AccessBook
Query optimization by semantic reasoning
Jonathan Jay King
- 01 Jan 1981
TL;DR: The thesis formally defines transformations that preserve semantic equivalence for queries in the relational calculus and identifies several classes of cost-reducing query transformations for relational database queries, and provides quantitative estimates of the improvements they can produce, based upon widely accepted models of query processing.
read more
Abstract: The problem of database query optimization is to select an efficient way to process a query expressed in logical terms from among the alternative ways it can be carried out in the physical database. This thesis presents a new approach to this problem, called semantic query optimization. The goal of semantic query optimization is to produce a semantically equivalent query that is less expensive to process than the original query.
Semantic query optimization actually transforms the original query into a new one by means of a process of inference. The transformations are limited to those that yield a semantically equivalent query, one that is guaranteed to produce the same answer as the original query in any permitted state of the database. This guarantee is achieved because the knowledge used to transform a query is the same knowledge used to insure the semantic integrity of the data stored in the database. Thus, semantic query optimization brings together the apparently separate research areas of query processing the database integrity.
The thesis also addresses an important issue in current automatic planning research: production not just of a correct solution but of a "good" one, by means of an efficient problem solver. Semantic query optimization advances the notion of a problem reformulation step for problem-solving programs. In this step, equivalent statements of the original problem are sought, one of which may have a better solution than the original problem. This method avoids explicit and possibly costly analysis of efficiency factors during planning itself.
Semantic query optimization can also be viewed as one aspect of intelligent database mediation. It applies knowledge of a problem domain and of the capabilities and limitations of the database to pose the most effective and easily processed queries to solve a user's problem.
The thesis formally defines transformations that preserve semantic equivalence for queries in the relational calculus. In addition, it identifies several classes of cost-reducing query transformations for relational database queries, and provides quantitative estimates of the improvements they can produce, based upon widely accepted models of query processing.
The thesis also discusses the design and implementation of a system that carries out semantic query optimization for an important class of relational database queries. The system is called QUIST, standing for QUery Improvement through Semantic Transformation.
The QUIST system has analyzed a range of queries for which different transformations apply. For these queries, QUIST obtains substantial reductions in the cost of processing at a negligible cost for the analysis itself.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A predicate-based caching scheme for client-server database architectures
Arthur M. Keller,Julie Basu +1 more
- 01 Jan 1996
TL;DR: Lower query-response times, reduced message traffic, higher server throughput, and better scalability are some of the expected benefits of this approach over commonly used relational server-side and object ID-based or page-based client-side caching.
•Book
Query processing in the SIMS information mediator
Yigal Arens,Chun-Nan Hsu,Craig A. Knoblock +2 more
- 01 Oct 1997
TL;DR: A flexible and efficient information mediator that takes a domain-level query and dynamically selects the appropriate information sources based on their content and availability, generates a query access plan that specifies the operations and their order for processing the data, and then performs semantic query optimization to minimize the overall execution time is described.
•Book
Optimizing datalog programs
Yehoshua Sagiv
- 01 Aug 1988
TL;DR: In this paper, the equivalence problem for Datalog programs is shown to be decidable and an algorithm is given for minimizing a DATALOG program under uniform equivalence.
209
A predicate-based caching scheme for client-server database architectures
Arthur M. Keller,Julie Basu +1 more
- 28 Sep 1994
TL;DR: This work proposes a new client-side data caching scheme for relational databases with a central server and multiple clients, and examines various performance and optimization issues involved in addressing the questions of cache currency and completeness using predicate descriptions.
Optimizing datalog programs
Yehoshua Sagiv
- 01 Jun 1987
TL;DR: In this article, the equivalence problem for Datalog programs is shown to be decidable and an algorithm is given for minimizing a DATALOG program under uniform equivalence.
165
References
The entity-relationship model: toward a unified view of data
Peter P. Chen
- 22 Sep 1975
TL;DR: A data model, called the entity-relationship model, which incorporates the semantic information in the real world is proposed, and a special diagramatic technique is introduced for exhibiting entities and relationships.
3.7K
Ubiquitous B-Tree
TL;DR: The major variations of the B-tree are discussed, especially the B+-tree, contrasting the merits and costs of each implementation and illustrating a general purpose access method that uses a B- tree.
2.1K
The design and implementation of INGRES
TL;DR: The currently operational (March 1976) version of the INGRES database management system is described in this article, which gives a relational view of data, supports two high level nonprocedural data sublanguages, and runs as a collection of user processes on top of the UNIX operating system for Digital Equipment Corporation PDP 11/40, 11/45, and 11/70 computers.
Relational completeness of data base sublanguages
E. F. Codd
- 01 Jan 2000
TL;DR: This paper attempts to provide a theoretical basis which may be used to determine how complete a selection capability is provided in a proposed data sublanguage independently of any host language in which the sublanguage may be embedded.
860
Prolegomena to a theory of mechanized formal reasoning
TL;DR: This is an informal description of my ideas about using formal logic as a tool for reasoning systems using computers, illustrated by the features of FOL.
440
Related Papers (5)
Michael Hammer,Stanley B. Zdonik +1 more
- 01 Oct 1980
Matthias Jarke,Jürgen Koch +1 more
Jeffrey D. Ullman
- 01 Jan 1979
Sreekumar T. Shenoy,Z. Meral Ozsoyoglu +1 more
- 01 Dec 1987