Data stream management system

Topic Tools

Papers published on a yearly basis

Papers

Proceedings Article•10.1145/543613.543615•

Models and issues in data stream systems

[...]

Brian Babcock¹, Shivnath Babu¹, Mayur Datar¹, Rajeev Motwani¹, Jennifer Widom¹ - Show less +1 more•Institutions (1)

Stanford University¹

3 Jun 2002

TL;DR: The need for and research issues arising from a new model of data processing, where data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, time-varying data streams are motivated.

...read moreread less

Abstract: In this overview paper we motivate the need for and research issues arising from a new model of data processing. In this model, data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, time-varying data streams. In addition to reviewing past work relevant to data stream systems and current projects in the area, the paper explores topics in stream query languages, new requirements and challenges in query processing, and algorithmic issues.

...read moreread less

3,094 citations

Journal Article•10.1007/S00778-003-0095-Z•

Aurora: a new model and architecture for data stream management

[...]

Daniel J. Abadi¹, Don Carney², Uğur Çetintemel², Mitch Cherniack¹, Christian Convey², Sangdon Lee², Michael Stonebraker³, Nesime Tatbul², Stan Zdonik² - Show less +5 more•Institutions (3)

Brandeis University¹, Brown University², Massachusetts Institute of Technology³

1 Aug 2003

TL;DR: The basic processing model and architecture of Aurora, a new system to manage data streams for monitoring applications, are described and a stream-oriented set of operators are described.

...read moreread less

Abstract: .This paper describes the basic processing model and architecture of Aurora, a new system to manage data streams for monitoring applications. Monitoring applications differ substantially from conventional business data processing. The fact that a software system must process and react to continual inputs from many sources (e.g., sensors) rather than from human operators requires one to rethink the fundamental architecture of a DBMS for this application area. In this paper, we present Aurora, a new DBMS currently under construction at Brandeis University, Brown University, and M.I.T. We first provide an overview of the basic Aurora model and architecture and then describe in detail a stream-oriented set of operators.

...read moreread less

1,652 citations

Proceedings Article•

The Design of the Borealis Stream Processing Engine

[...]

Daniel J. Abadi¹, Yanif Ahmad², Magdalena Balazinska¹, Mitch Cherniack³, Jeong-Hyon Hwang², Wolfgang Lindner¹, Anurag S. Maskey³, Alexander Rasin², Esther Ryvkina³, Nesime Tatbul², Ying Xing², Stan Zdonik² - Show less +8 more•Institutions (3)

Massachusetts Institute of Technology¹, Brown University², Brandeis University³

1 Jan 2005

TL;DR: This paper outlines the basic design and functionality of Borealis, and presents a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.

...read moreread less

Abstract: Borealis is a second-generation distributed stream processing engine that is being developed at Brandeis University, Brown University, and MIT. Borealis inherits core stream processing functionality from Aurora [14] and distribution functionality from Medusa [51]. Borealis modifies and extends both systems in non-trivial and critical ways to provide advanced capabilities that are commonly required by newly-emerging stream processing applications. In this paper, we outline the basic design and functionality of Borealis. Through sample real-world applications, we motivate the need for dynamically revising query results and modifying query specifications. We then describe how Borealis addresses these challenges through an innovative set of features, including revision records, time travel, and control lines. Finally, we present a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.

...read moreread less

1,612 citations

Journal Article•10.1007/S00778-004-0147-Z•

The CQL continuous query language: semantic foundations and query execution

[...]

Arvind Arasu¹, Shivnath Babu¹, Jennifer Widom¹•Institutions (1)

Stanford University¹

1 Jun 2006

TL;DR: This paper presents the structure of CQL's query execution plans as well as details of the most important components: operators, interoperator queues, synopses, and sharing of components among multiple operators and queries.

...read moreread less

Abstract: CQL, a continuous query language, is supported by the STREAM prototype data stream management system (DSMS) at Stanford. CQL is an expressive SQL-based declarative language for registering continuous queries against streams and stored relations. We begin by presenting an abstract semantics that relies only on “black-box” mappings among streams and relations. From these mappings we define a precise and general interpretation for continuous queries. CQL is an instantiation of our abstract semantics using SQL to map from relations to relations, window specifications derived from SQL-99 to map from streams to relations, and three new operators to map from relations to streams. Most of the CQL language is operational in the STREAM system. We present the structure of CQL's query execution plans as well as details of the most important components: operators, interoperator queues, synopses, and sharing of components among multiple operators and queries. Examples throughout the paper are drawn from the Linear Road benchmark recently proposed for DSMSs. We also curate a public repository of data stream applications that includes a wide variety of queries expressed in CQL. The relative ease of capturing these applications in CQL is one indicator that the language contains an appropriate set of constructs for data stream processing.

...read moreread less

1,414 citations

Proceedings Article•

TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.

[...]

Sirish Chandrasekaran, Owen Cooper, Amol Deshpande, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong, Sailesh Krishnamurthy, Samuel Madden, Vijayshankar Raman, Frederick Reiss, Mehul A. Shah - Show less +7 more

1 Jan 2003

TL;DR: The next generation Telegraph system, called TelegraphCQ, is focused on meeting the challenges that arise in handling large streams of continuous queries over high-volume, highly-variable data streams and leverages the PostgreSQL open source code base.

...read moreread less

Abstract: Increasingly pervasive networks are leading towards a world where data is constantly in motion. In such a world, conventional techniques for query processing, which were developed under the assumption of a far more static and predictable computational environment, will not be sufficient. Instead, query processors based on adaptive dataflow will be necessary. The Telegraph project has developed a suite of novel technologies for continuously adaptive query processing. The next generation Telegraph system, called TelegraphCQ, is focused on meeting the challenges that arise in handling large streams of continuous queries over high-volume, highly-variable data streams. In this paper, we describe the system architecture and its underlying technology, and report on our ongoing implementation effort, which leverages the PostgreSQL open source code base. We also discuss open issues and our research agenda.

...read moreread less

1,290 citations

...

Expand

Year	Papers
2021	6
2020	2
2019	1
2018	3
2017	8
2016	13

Topic Tools

Papers published on a yearly basis

Papers

Models and issues in data stream systems

Aurora: a new model and architecture for data stream management

The Design of the Borealis Stream Processing Engine

The CQL continuous query language: semantic foundations and query execution

TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.

Related Topics (5)

Performance Metrics