Open AccessProceedings Article
Inter-Operator Feedback in Data Stream Management Systems via Punctuation
Rafael J Fernandez-Moctezuma,Kristin Tufte,Jin Li +2 more
- 14 Sep 2009
TL;DR: In this paper, the authors present a comprehensive framework designed to support prioritization, avoidance of unnecessary work, and on-demand result production over distributed, unreliable, bursty, disordered data sources, typical of many streams.
read more
Abstract: high-speed data streams may overwhelm the capabilities of stream processing systems; techniques such as data prioritization, avoidance of unnecessary processing and on- demand result production may be necessary to reduce processing requirements. However, the dynamic nature of data streams, in terms of both rate and content, makes the application of such techniques challenging. Such techniques have been addressed in the context of static and centralized query optimization; however, they have not been fully addressed for data-stream management systems. In this work, we present a comprehensive framework designed to support prioritization, avoidance of unnecessary work, and on-demand result production over distributed, unreliable, bursty, disordered data sources, typical of many streams. We propose a form of inter-operator feedback, which flows against the stream direction, to communicate the information needed to enable execution of these techniques. This feedback leverages punctuations to describe the subsets of interest. We identify potential sources of feedback information, characterize new types of punctuation to support feedback, and describe the roles of producers, exploiters, and relayers of feedback that query operators may implement. We also present initial experimental observations using the NiagaraST data-stream system.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Book
Synthesis Lectures on Data Management
M. Tamer Özsu
- 01 Jan 2010
TL;DR: This lecture gives an overview of recent research in stream processing, ranging from answering simple queries on high-speed streams to loading real-time data feeds into a streaming warehouse for off-line analysis.
Patent
Complex event processor for historic/live/replayed data
Laurent Bussard,Ivo Jose Garcia Dos Santos,Olivier Nano,Tihomir Tarnavski,Jonathan Goldstein,Badrish Chandramouli,Lev Novik +6 more
- 02 Apr 2015
TL;DR: In this article, a complex event processor is described which has a communications interface configured to retrieve event data by pulling it from one or more sources and to receive at least one live event stream pushed to the interface.
44
•Dissertation
Système de gestion de flux pour l'Internet des objets intelligents
Benjamin Billet
- 19 Mar 2015
TL;DR: In this paper, a prototype of Dioptase is presented, and the authors evaluate the performance of the prototype and propose a solution for the realisation of a systeme distribue de gestion de flux de donnees for l'Internet des objects.
Towards execution guarantees for stream queries
Rafael J Fernandez-Moctezuma,David Maier,Kristin Tufte +2 more
- 19 Apr 2010
TL;DR: This work looks at deriving execution guarantees with respect to result production and state management for complete queries over punctuated streams, and introduces a framework, punctuation contracts, for analyzing data processing and punctuation propagation from input to output on individual operators.
4
Physically Independent Stream Merging
Badrish Chandramouli,David Maier,Jonathan Goldstein +2 more
- 01 Apr 2012
TL;DR: This paper introduces a new stream operator called Logical Merge (LMerge) that takes multiple logically consistent streams as input and outputs a single stream that is compatible with all of them, and shows that LMerge and its extensions can provide performance benefits in several real-world applications.
References
•Proceedings Article
The Design of the Borealis Stream Processing Engine
Daniel J. Abadi,Yanif Ahmad,Magdalena Balazinska,Mitch Cherniack,Jeong-Hyon Hwang,Wolfgang Lindner,Anurag S. Maskey,Alexander Rasin,Esther Ryvkina,Nesime Tatbul,Ying Xing,Stan Zdonik +11 more
- 01 Jan 2005
TL;DR: This paper outlines the basic design and functionality of Borealis, and presents a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.
•Proceedings Article
TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.
Sirish Chandrasekaran,Owen Cooper,Amol Deshpande,Michael J. Franklin,Joseph M. Hellerstein,Wei Hong,Sailesh Krishnamurthy,Samuel Madden,Vijayshankar Raman,Frederick Reiss,Mehul A. Shah +10 more
- 01 Jan 2003
TL;DR: The next generation Telegraph system, called TelegraphCQ, is focused on meeting the challenges that arise in handling large streams of continuous queries over high-volume, highly-variable data streams and leverages the PostgreSQL open source code base.
1.2K
Eddies: continuously adaptive query processing
Ron Avnur,Joseph M. Hellerstein +1 more
- 16 May 2000
TL;DR: This paper introduces a query processing mechanism called an eddy, which continuously reorders operators in a query plan as it runs, and describes the moments of symmetry during which pipelined joins can be easily reordered, and the synchronization barriers that require inputs from different sources to be coordinated.
Out-of-order processing: a new architecture for high-performance stream systems
Jin Li,Kristin Tufte,Vladislav Shkapenyuk,Vassilis Papadimos,Theodore Johnson,David Maier +5 more
- 01 Aug 2008
TL;DR: This work introduces a new architecture for stream systems, out-of-order processing (OOP), that avoids ordering constraints and shows that the OOP approach can significantly outperform IOP in a number of aspects, including memory, throughput and latency.
A Security Punctuation Framework for Enforcing Access Control on Streaming Data
Rimma V. Nehme,Elke A. Rundensteiner,Elisa Bertino +2 more
- 07 Apr 2008
TL;DR: A novel ";stream-centric"; approach, where security restrictions are not persistently stored on the DSMS server, but rather streamed together with the data, and the access control policies are expressed via security constraints and are embedded into data streams.
84
Related Papers (5)
Peter Pietzuch,Marco Fiscato,QH Vu +2 more
- 01 Aug 2009
Utkarsh Srivastava,Jennifer Widom +1 more
- 14 Jun 2004
Ying Liu,Beth Plale +1 more
- 01 Jan 2006
Edward Y. Chang,Ankur Jain +1 more
- 01 Jan 2006
Jennifer Widom,Shivnath Babu +1 more
- 01 Jan 2005