Open Access
Monitoring Stream Properties for Continuous Query Processing
Utkarsh Srivastava,Shivnath Babu,Jennifer Widom +2 more
- 01 Jan 2003
6
TL;DR: A general-purpose Data Stream Management System for processing continuous queries over multiple continuous data streams that creates an initial query plan, and allocates initial resources, such as memory for join or aggregation synopses and for input and inter-operator queues.
read more
Abstract: We are developing a general-purpose Data Stream Management System for processing continuous queries over multiple continuous data streams. When a new continuous query is registered, our query optimizer creates an initial query plan (possibly merged with existing plans for previously registered queries), and allocates initial resources, such as memory for join or aggregation synopses and for input and inter-operator queues. Dynamically, plans may be altered, resources may be reallocated, and scheduling
decisions are made, all based on stream data and arrival characteristics, query plan execution behavior, and
resource utilization.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Journal Article
Extension of CQL over Dynamic Databases
TL;DR: A short description of CQL, a characterization of update-problems, and possible suggestions for the semantic extension of CQLS are offered.
54
Punctuated data streams
Peter A. Tucker,David Maier +1 more
- 01 Jan 2005
TL;DR: It is shown that a query benefits from an input punctuation scheme (in terms of being able to produce a given output scheme), if each set in the groupings induced by the operators of the query is covered by a finite number of punctuations in the scheme—a kind of compactness.
23
•Dissertation
Efficient Algorithms for Mining Data Streams
Arnold P. Boedihardjo,Chang-Tien Lu,Weiguo Fan,Yao Liang,Naren Ramakrishnan +4 more
- 10 Aug 2010
TL;DR: In this work, kernel density estimators (KDEs) are developed that satisfy the stringent computational stipulations of data streams, model unknown and dynamic distributions, and enhance the estimation quality of complex structures.
6
Parallel processing of continuous data streams
A. Buza
- 12 Dec 2005
TL;DR: The data stream processing as the processing program read continuously the data stream records and after some postponement time it produces the answer also continuously, and the process produces correct answer without loss of data.
1
Design and Implementation of a Data Stream Load Shedding Model
Dan Wang,Maozeng Li +1 more
- 18 Nov 2008
TL;DR: The experiment result shows that the load shedding model presented can effectively avoid the low processing efficiency when system is overload, and guarantee the coordination of arrived data and system processing capability, and make a combination of system stability and correctness as a result.
References
•Book
A Probabilistic Theory of Pattern Recognition
Luc Devroye,László Györfi,Gábor Lugosi +2 more
- 01 Jan 1996
TL;DR: The Bayes Error and Vapnik-Chervonenkis theory are applied as guide for empirical classifier selection on the basis of explicit specification and explicit enforcement of the maximum likelihood principle.
Eddies: continuously adaptive query processing
Ron Avnur,Joseph M. Hellerstein +1 more
- 16 May 2000
TL;DR: This paper introduces a query processing mechanism called an eddy, which continuously reorders operators in a query plan as it runs, and describes the moments of symmetry during which pipelined joins can be easily reordered, and the synchronization barriers that require inputs from different sources to be coordinated.
Maintaining Stream Statistics over Sliding Windows
TL;DR: The problem of maintaining aggregates and statistics over data streams, with respect to the last N data elements seen so far, is considered, and it is shown that, using $O(\frac{1}{\epsilon} \log^2 N)$ bits of memory, the number of 1's can be estimated to within a factor of $1 + \ep silon$.
991
Maximizing the output rate of multi-way join queries over streaming information sources
Stratis D. Viglas,Jeffrey F. Naughton,Josef Burger +2 more
- 09 Sep 2003
TL;DR: The results show that in many instances the MJoin produces outputs sooner than any tree of binary operators, which suggests that supporting multiway joins in a single, symmetric, streaming operator may be a useful addition to systems that support queries over input streams from remote sites.
•Proceedings Article
Querying and Mining Data Streams: You Only Get One Look.
Minos Garofalakis,Johannes Gehrke +1 more
- 01 Jan 2002
TL;DR: In this article, the authors propose a query processing algorithm for continuous data streams, which can summarize the data stream(s) involved in a concise, but reasonably accurate, synopsis that can be stored in the allotted (small) amount of memory and can be used to provide approximate answers to user queries along with some reasonable guarantees on the quality of the approximation.
286