Journal Article10.1145/304181.304573
Database patchwork on the Internet
Reinhard Braumandl,Alfons Kemper,Donald Kossmann +2 more
- 01 Jun 1999
- Vol. 28, Iss: 2, pp 550-552
TL;DR: A distributed database system to be used in the Internet capable of dynamically loading (external) query operators, written in Java and supplied by any function provider, and executing these query operators in concert with pre-defined and other external operators in order to evaluate a query.
read more
Abstract: Naturally, data processing requires three kinds of resources: the data itself,the functionality (i.e. database operations) andthe machines on which to run the operations.Because of the Internet we believe that in the long run there will be alternative providers for all of these three resources for any given application. Data providers will bring more and more data and more and more different kinds of data to the net. Likewise, function providers will develop new methods to process and work with the data; e.g., function providers might develop new algorithms to compress data or to produce thumbnails out of large images and try to sell these on the Internet. It is also conceivable, that some people allow other people to use spare cycles of their idle machines in the Internet (as in the Condor system of the University of Wisconsin) or that some companies (cycle providers) even specialize on selling computing time to businesses that occasionally need to carry out very complex operations for which regular hardware is not sufficient.At the University of Passau, we are currently developing a distributed database system to be used in the Internet. The goal is to ultimately have a system which is able to run on any machine, manage any kind of data, import any kind of data from other systems and import any kind of database operations. The system is entirely written in Java. One of the most important features of the system is that it is capable of dynamically loading (external) query operators, written in Java and supplied by any function provider, and executing these query operators in concert with pre-defined and other external operators in order to evaluate a query. Compared to object-relational database systems, which allow to integrate external data and functionality by the means of extensions (datablades, extenders or cartridges) or heterogeneous database systems such as Garlic [MS97] or Tsimmis [GMPQ+97], our approach makes it possible to place external query operators anywhere in a query evaluation plan as opposed to restricting the placement of external operations to the “access level” of plans. It would, for example, be possible to make our system execute a completely new relational join method, if somebody finds a new join method which is worth-while implementing. Because our system is written in Java, it is highly portable and could be used by data, function and cycle providers with almost no effort. Furthermore, our query engine is, of course, completely distributed providing all the required infrastructure for server-server communication, name services, etc.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
The state of the art in distributed query processing
TL;DR: The paper presents the “textbook” architecture for distributed query processing and a series of techniques that are particularly useful for distributed database systems, and discusses different kinds of distributed systems such as client-server, middleware (multitier), and heterogeneous database systems and shows how query processing works in these systems.
Integrating semi-join-reducers into state-of-the-art query processors
Konrad Stocker,Donald Kossmann,R. Braumandi,Alfons Kemper +3 more
- 02 Apr 2001
TL;DR: This paper shows that semi-join reducers can indeed be beneficial in modern client-server or middleware systems - either to reduce communication costs or to better exploit all the resources of a system.
72
Quality of service and optimization in data integration systems
Reinhard Braumandl
- 01 Jan 2003
TL;DR: In this paper, the authors present techniques for the construction of a global data integrations system that allows declarative queries in order to express user-specific information needs, and show how service composition, extensibility and quality of service can be supported in an open system of providers for data, functionality for query processing operations and computing power.
Verteilte Metadatenverwaltung für die Anfragebearbeitung auf Internet-Datenquellen
Markus Keidl,Alexander Kreutz,Alfons Kemper,Donald Kossmann +3 more
- 07 Mar 2001
TL;DR: Wir zeigen anhand des offenen, verteilten Anfragebearbeitungssystems ObjectGlobe, wie die MDV das Auffinden wichtiger Ressourcen fur die Anfrageschubergreifender Datenverarbeitung auf Internet-Datenquellen ermoglicht ist.
6
•Journal Article
ObjectGlobe: Integrating Data Processing Services on the Internet.
Reinhard Braumandl,Alfons Kemper,Donald Kossmann,Alexander Kreutz,Stefan Pröls,Konrad Stocker +5 more
TL;DR: Because of the Internet, in the long run there will be alternative providers for all of these three resources for any given application: data providers, function providers and commercial cycle providers.
References
Query evaluation techniques for large databases
TL;DR: This survey describes a wide array of practical query evaluation techniques for both relational and postrelational database systems, including iterative execution of complex query evaluation plans, the duality of sort- and hash-based set-matching algorithms, types of parallel query execution and their implementation, and special operators for emerging database application domains.
•Proceedings Article
Don't Scrap It, Wrap It! A Wrapper Architecture for Legacy Data Sources
Mary Roth,Peter Schwarz +1 more
- 25 Aug 1997
TL;DR: The architecture for wrappers, key components of Garlic that encapsulate data sources and mediate between them and the middleware are described, which shows that Garlic wrappers can be written quickly and that the architecture is flexible enough to accommodate data sources with a variety of data models and a broad range of traditional and non-traditional query processing capabilities.
The TSIMMIS Approach to Mediation: Data Models and Languages
Hector Garcia-Molina,Yannis Papakonstantinou,Dallan Quass,Anand Rajaraman,Yehoshua Sagiv,Jeffrey D. Ullman,Vasilis Vassalos,Jennifer Widom +7 more
- 31 Mar 1997
TL;DR: TSIMMIS—The Stanford-IBM Manager of Multiple Information sources offers a datamodel and a common query language that are designed to support the combining of information from many different sources.
•Proceedings Article
Optimizing Queries Across Diverse Data Sources
Laura M. Haas,Donald Kossmann,Edward L. Wimmers,Jun Yang +3 more
- 25 Aug 1997
TL;DR: This work presents the design of a query optimizer for Garlic, a middleware system designed to integrate data from a broad range of data sources with very different query capabilities, and describes the design and implementation.
Encapsulation of parallelism in the Volcano query processing system
Goetz Graefe
- 01 May 1990
TL;DR: The reasons for not choosing the bracket model, the novel operator model, and details of Volcano's exchange operator that parallelizes all other operators are described, which makes implementation of parallel database algorithms significantly easier and more robust.