Revision Processing in a Stream Processing Engine: A High-Level Design
Esther Ryvkina,Anurag S. Maskey,Mitch Cherniack,Stanley B. Zdonik +3 more
- 03 Apr 2006
- pp 141-141
TL;DR: This work states that any stream processing engine should process revision inputs by generating revision outputs that correct previous query results, and knows of no stream processing system that presently has this capability.
read more
Abstract: Data stream processing systems have become ubiquitous in academic [1, 2, 5, 6] and commercial [11] sectors, with application areas that include financial services, network traffic analysis, battlefield monitoring and traffic control [3]. The append-only model of streams implies that input data is immutable and therefore always correct. But in practice, streaming data sources often contend with noise (e.g., embedded sensors) or data entry errors (e.g., financial data feeds) resulting in erroneous inputs and therefore, erroneous query results. Many data stream sources (e.g., commercial ticker feeds) issue "revision tuples" (revisions) that amend previously issued tuples (e.g. erroneous share prices). Ideally, any stream processing engine should process revision inputs by generating revision outputs that correct previous query results. We know of no stream processing system that presently has this capability.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
C-store: a column-oriented DBMS
Michael Stonebraker,Daniel J. Abadi,Adam Batkin,Xuedong Chen,Mitch Cherniack,Miguel Ferreira,Edmond Lau,Amerson Lin,Samuel Madden,Elizabeth O'Neil,Patrick O'Neil,Alexander Rasin,Nga Tran,Stan Zdonik +13 more
- 01 Dec 2018
TL;DR: Preliminary performance data on a subset of TPC-H is presented and it is shown that the system the team is building, C-Store, is substantially faster than popular commercial products.
The end of an architectural era: it's time for a complete rewrite
Michael Stonebraker,Samuel Madden,Daniel J. Abadi,Stavros Harizopoulos,Nabil Hachem,Pat Helland +5 more
- 01 Dec 2018
TL;DR: The current RDBMS code lines, while attempting to be a "one size fits all" solution, in fact, excel at nothing and should be retired in favor of a collection of "from scratch" specialized engines.
Supporting views in data stream management systems
TL;DR: The sliding window approach is generalized by introducing the synchronization principle that empowers SyncSQL with a formal mechanism to express queries with arbitrary refresh conditions and the article introduces the Nile-SyncSQL prototype to support SyncSQL queries.
153
Sequence Pattern Query Processing over Out-of-Order Event Streams
Mo Liu,Ming Li,Denis Golovnya,Elke A. Rundensteiner,Kajal T. Claypool +4 more
- 29 Mar 2009
TL;DR: This work proposes two alternate solutions: aggressive and conservative strategies respectively to process sequence pattern queries on out-of-order event streams and proposes a partial order guarantee (POG) model under which such correctness can be guaranteed.
High-performance dynamic pattern matching over disordered streams
Badrish Chandramouli,Jonathan Goldstein,David Maier +2 more
- 01 Sep 2010
TL;DR: This work proposes clean order-agnostic pattern-detection semantics for AFAs, with new algorithms that allow a very efficient implementation, while retaining significant expressiveness and supporting native handling of out-of-order input, stream revisions, dynamic patterns, and several optimizations.
References
•Proceedings Article
The Design of the Borealis Stream Processing Engine
Daniel J. Abadi,Yanif Ahmad,Magdalena Balazinska,Mitch Cherniack,Jeong-Hyon Hwang,Wolfgang Lindner,Anurag S. Maskey,Alexander Rasin,Esther Ryvkina,Nesime Tatbul,Ying Xing,Stan Zdonik +11 more
- 01 Jan 2005
TL;DR: This paper outlines the basic design and functionality of Borealis, and presents a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.
Monitoring streams: a new class of data management applications
Don Carney,Uǧur Çetintemel,Mitch Cherniack,Christian Convey,Sangdon Lee,Greg Seidman,Michael Stonebraker,Nesime Tatbul,Stan Zdonik +8 more
- 20 Aug 2002
TL;DR: This paper presents Aurora, a new DBMS that is currently under construction at Brandeis University, Brown University, and M.I.T. and describes the basic system architecture, a stream-oriented set of operators, optimization tactics, and support for real-time operation.
TelegraphCQ: continuous dataflow processing
Sirish Chandrasekaran,Owen Cooper,Amol Deshpande,Michael J. Franklin,Joseph M. Hellerstein,Wei Hong,Sailesh Krishnamurthy,Samuel Madden,Fred Reiss,Mehul A. Shah +9 more
- 09 Jun 2003
TL;DR: The current version of TelegraphCQ is shown, which is implemented by leveraging the code base of the open source PostgreSQL database system, which found that a significant portion of the PostgreSQL code was easily reusable.
882
•Proceedings Article
STREAM: The Stanford Stream Data Manager.
Arvind Arasu,Brian Babcock,Shivnath Babu,Mayur Datar,Keith Ito,Itaru Nishizawa,Justin Rosenstein,Jennifer Widom +7 more
- 01 Jan 2003
608