Database patchwork on the Internet

doi:10.1145/304181.304573

Journal Article10.1145/304181.304573

Database patchwork on the Internet

Reinhard Braumandl, +2 more

- 01 Jun 1999

- Vol. 28, Iss: 2, pp 550-552

7

TL;DR: A distributed database system to be used in the Internet capable of dynamically loading (external) query operators, written in Java and supplied by any function provider, and executing these query operators in concert with pre-defined and other external operators in order to evaluate a query.

Abstract: Naturally, data processing requires three kinds of resources: the data itself,the functionality (i.e. database operations) andthe machines on which to run the operations.Because of the Internet we believe that in the long run there will be alternative providers for all of these three resources for any given application. Data providers will bring more and more data and more and more different kinds of data to the net. Likewise, function providers will develop new methods to process and work with the data; e.g., function providers might develop new algorithms to compress data or to produce thumbnails out of large images and try to sell these on the Internet. It is also conceivable, that some people allow other people to use spare cycles of their idle machines in the Internet (as in the Condor system of the University of Wisconsin) or that some companies (cycle providers) even specialize on selling computing time to businesses that occasionally need to carry out very complex operations for which regular hardware is not sufficient.At the University of Passau, we are currently developing a distributed database system to be used in the Internet. The goal is to ultimately have a system which is able to run on any machine, manage any kind of data, import any kind of data from other systems and import any kind of database operations. The system is entirely written in Java. One of the most important features of the system is that it is capable of dynamically loading (external) query operators, written in Java and supplied by any function provider, and executing these query operators in concert with pre-defined and other external operators in order to evaluate a query. Compared to object-relational database systems, which allow to integrate external data and functionality by the means of extensions (datablades, extenders or cartridges) or heterogeneous database systems such as Garlic [MS97] or Tsimmis [GMPQ+97], our approach makes it possible to place external query operators anywhere in a query evaluation plan as opposed to restricting the placement of external operations to the “access level” of plans. It would, for example, be possible to make our system execute a completely new relational join method, if somebody finds a new join method which is worth-while implementing. Because our system is written in Java, it is highly portable and could be used by data, function and cycle providers with almost no effort. Furthermore, our query engine is, of course, completely distributed providing all the required infrastructure for server-server communication, name services, etc.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1145/371578.371598

The state of the art in distributed query processing

Donald Kossmann

- 01 Dec 2000

- ACM Computing Surveys

TL;DR: The paper presents the “textbook” architecture for distributed query processing and a series of techniques that are particularly useful for distributed database systems, and discusses different kinds of distributed systems such as client-server, middleware (multitier), and heterogeneous database systems and shows how query processing works in these systems.

...read moreread less

1K

Proceedings Article•10.1109/ICDE.2001.914872

Integrating semi-join-reducers into state-of-the-art query processors

Konrad Stocker, +3 more

- 02 Apr 2001

TL;DR: This paper shows that semi-join reducers can indeed be beneficial in modern client-server or middleware systems - either to reduce communication costs or to better exploit all the resources of a system.

...read moreread less

72

Quality of service and optimization in data integration systems

Reinhard Braumandl

- 01 Jan 2003

TL;DR: In this paper, the authors present techniques for the construction of a global data integrations system that allows declarative queries in order to express user-specific information needs, and show how service composition, extensibility and quality of service can be supported in an open system of providers for data, functionality for query processing operations and computing power.

...read moreread less

7

Book Chapter•10.1007/978-3-642-56687-5_10

Verteilte Metadatenverwaltung für die Anfragebearbeitung auf Internet-Datenquellen

Markus Keidl, +3 more

- 07 Mar 2001

TL;DR: Wir zeigen anhand des offenen, verteilten Anfragebearbeitungssystems ObjectGlobe, wie die MDV das Auffinden wichtiger Ressourcen fur die Anfrageschubergreifender Datenverarbeitung auf Internet-Datenquellen ermoglicht ist.

...read moreread less

6

•Journal Article

ObjectGlobe: Integrating Data Processing Services on the Internet.

Reinhard Braumandl, +5 more

- 01 Jan 1999

- Datenbank Rundbrief

TL;DR: Because of the Internet, in the long run there will be alternative providers for all of these three resources for any given application: data providers, function providers and commercial cycle providers.

...read moreread less

References

•Journal Article•10.1145/152610.152611

Query evaluation techniques for large databases

Goetz Graefe

- 01 Jun 1993

- ACM Computing Surveys

TL;DR: This survey describes a wide array of practical query evaluation techniques for both relational and postrelational database systems, including iterative execution of complex query evaluation plans, the duality of sort- and hash-based set-matching algorithms, types of parallel query execution and their implementation, and special operators for emerging database application domains.

...read moreread less

1.5K

•Proceedings Article

Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources

Mary Roth, +1 more

- 25 Aug 1997

TL;DR: The architecture for wrappers, key components of Garlic that encapsulate data sources and mediate between them and the middleware are described, which shows that Garlic wrappers can be written quickly and that the architecture is flexible enough to accommodate data sources with a variety of data models and a broad range of traditional and non-traditional query processing capabilities.

...read moreread less

469

Journal Article•10.1023/A:1008683107812

The TSIMMIS Approach to Mediation: Data Models and Languages

Hector Garcia-Molina, +7 more

- 31 Mar 1997

TL;DR: TSIMMIS—The Stanford-IBM Manager of Multiple Information sources offers a datamodel and a common query language that are designed to support the combining of information from many different sources.

...read moreread less

•Proceedings Article

Optimizing Queries Across Diverse Data Sources

Laura M. Haas, +3 more

- 25 Aug 1997

TL;DR: This work presents the design of a query optimizer for Garlic, a middleware system designed to integrate data from a broad range of data sources with very different query capabilities, and describes the design and implementation.

...read moreread less

Proceedings Article•10.1145/93597.98720

Encapsulation of parallelism in the Volcano query processing system

Goetz Graefe

- 01 May 1990

TL;DR: The reasons for not choosing the bracket model, the novel operator model, and details of Volcano's exchange operator that parallelizes all other operators are described, which makes implementation of parallel database algorithms significantly easier and more robust.

...read moreread less