Query optimization by semantic reasoning

Open AccessBook

Query optimization by semantic reasoning

- 01 Jan 1981

125

TL;DR: The thesis formally defines transformations that preserve semantic equivalence for queries in the relational calculus and identifies several classes of cost-reducing query transformations for relational database queries, and provides quantitative estimates of the improvements they can produce, based upon widely accepted models of query processing.

Abstract: The problem of database query optimization is to select an efficient way to process a query expressed in logical terms from among the alternative ways it can be carried out in the physical database. This thesis presents a new approach to this problem, called semantic query optimization. The goal of semantic query optimization is to produce a semantically equivalent query that is less expensive to process than the original query. Semantic query optimization actually transforms the original query into a new one by means of a process of inference. The transformations are limited to those that yield a semantically equivalent query, one that is guaranteed to produce the same answer as the original query in any permitted state of the database. This guarantee is achieved because the knowledge used to transform a query is the same knowledge used to insure the semantic integrity of the data stored in the database. Thus, semantic query optimization brings together the apparently separate research areas of query processing the database integrity. The thesis also addresses an important issue in current automatic planning research: production not just of a correct solution but of a "good" one, by means of an efficient problem solver. Semantic query optimization advances the notion of a problem reformulation step for problem-solving programs. In this step, equivalent statements of the original problem are sought, one of which may have a better solution than the original problem. This method avoids explicit and possibly costly analysis of efficiency factors during planning itself. Semantic query optimization can also be viewed as one aspect of intelligent database mediation. It applies knowledge of a problem domain and of the capabilities and limitations of the database to pose the most effective and easily processed queries to solve a user's problem. The thesis formally defines transformations that preserve semantic equivalence for queries in the relational calculus. In addition, it identifies several classes of cost-reducing query transformations for relational database queries, and provides quantitative estimates of the improvements they can produce, based upon widely accepted models of query processing. The thesis also discusses the design and implementation of a system that carries out semantic query optimization for an important class of relational database queries. The system is called QUIST, standing for QUery Improvement through Semantic Transformation. The QUIST system has analyzed a range of queries for which different transformations apply. For these queries, QUIST obtains substantial reductions in the cost of processing at a negligible cost for the analysis itself.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.1007/S007780050014

A predicate-based caching scheme for client-server database architectures

Arthur M. Keller, +1 more

- 01 Jan 1996

TL;DR: Lower query-response times, reduced message traffic, higher server throughput, and better scalability are some of the expected benefits of this approach over commonly used relational server-side and object ID-based or page-based client-side caching.

...read moreread less

296

•Book

Query processing in the SIMS information mediator

Yigal Arens, +2 more

- 01 Oct 1997

TL;DR: A flexible and efficient information mediator that takes a domain-level query and dynamically selects the appropriate information sources based on their content and availability, generates a query access plan that specifies the operations and their order for processing the data, and then performs semantic query optimization to minimize the overall execution time is described.

...read moreread less

248

•Book

Optimizing datalog programs

Yehoshua Sagiv

- 01 Aug 1988

TL;DR: In this paper, the equivalence problem for Datalog programs is shown to be decidable and an algorithm is given for minimizing a DATALOG program under uniform equivalence.

...read moreread less

209

•Proceedings Article•10.1109/PDIS.1994.331711

A predicate-based caching scheme for client-server database architectures

Arthur M. Keller, +1 more

- 28 Sep 1994

TL;DR: This work proposes a new client-side data caching scheme for relational databases with a central server and multiple clients, and examines various performance and optimization issues involved in addressing the questions of cache currency and completeness using predicate descriptions.

...read moreread less

193

Proceedings Article•10.1145/28659.28696

Optimizing datalog programs

Yehoshua Sagiv

- 01 Jun 1987

TL;DR: In this article, the equivalence problem for Datalog programs is shown to be decidable and an algorithm is given for minimizing a DATALOG program under uniform equivalence.

...read moreread less

165

...

Expand

References

Proceedings Article•10.1145/1282480.1282492

The entity-relationship model: toward a unified view of data

Peter P. Chen

- 22 Sep 1975

TL;DR: A data model, called the entity-relationship model, which incorporates the semantic information in the real world is proposed, and a special diagramatic technique is introduced for exhibiting entities and relationships.

...read moreread less

3.7K

Journal Article•10.1145/356770.356776

Ubiquitous B-Tree

Douglas Comer

- 01 Jun 1979

- ACM Computing Surveys

TL;DR: The major variations of the B-tree are discussed, especially the B+-tree, contrasting the merits and costs of each implementation and illustrating a general purpose access method that uses a B- tree.

...read moreread less

2.1K

•Journal Article•10.1145/320473.320476

The design and implementation of INGRES

Michael Stonebraker, +3 more

- 01 Sep 1976

- ACM Transactions on Database Systems

TL;DR: The currently operational (March 1976) version of the INGRES database management system is described in this article, which gives a relational view of data, supports two high level nonprocedural data sublanguages, and runs as a collection of user processes on top of the UNIX operating system for Digital Equipment Corporation PDP 11/40, 11/45, and 11/70 computers.

...read moreread less

957

Relational completeness of data base sublanguages

E. F. Codd

- 01 Jan 2000

TL;DR: This paper attempts to provide a theoretical basis which may be used to determine how complete a selection capability is provided in a proposed data sublanguage independently of any host language in which the sublanguage may be embedded.

...read moreread less

860

Journal Article•10.1016/0004-3702(80)90015-6

Prolegomena to a theory of mechanized formal reasoning

Richard W. Weyhrauch

- 01 Apr 1980

- Artificial Intelligence

TL;DR: This is an informal description of my ideas about using formal logic as a tool for reasoning systems using computers, illustrated by the features of FOL.

...read moreread less

440