Efficient query processing on distributed stream processing engine

doi:10.1145/3022227.3022255

Proceedings Article10.1145/3022227.3022255

Efficient query processing on distributed stream processing engine

Manhui Han, +2 more

- 05 Jan 2017

- pp 29

5

TL;DR: This paper proposes a methodology to transform queries executable in the engine and optimization technique for query processing and results show that the methodology is efficient on processing queries for data streams.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1016/J.INS.2020.09.037

An automatic clustering technique for query plan recommendation

Elham Azhir, +6 more

- 04 Feb 2021

- Information Sciences

TL;DR: A multi-objective automatic query plan recommendation method, a combination of incremental DBSCAN and NSGA-II, which outperforms the other well-known approaches for query processing and improves the accuracy of clustering.

...read moreread less

21

•Journal Article•10.3390/math10193517

Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark

Elham Azhir, +3 more

- 17 Sep 2022

- Mathematics

TL;DR: The results of the experiments demonstrated the effectiveness of parallel query clustering in achieving high scalability, and Apache Spark achieved better performance than Apache Hadoop, reaching an average speedup of 2x.

...read moreread less

5

•Posted Content•10.31219/osf.io/mgpr7

Performance Evaluation of Query Plan Recommendation with Apache Hadoop and Apache Spark

17 Sep 2022

TL;DR: In this paper , a MapReduce-based access plan recommendation method is proposed to cluster different sizes of query datasets in the query space based on the query execution plans (QEPs) and the performance evaluation is performed based on execution time.

...read moreread less

5

•Journal Article•10.7717/PEERJ-CS.580

A technique for parallel query optimization using MapReduce framework and a semantic-based clustering method.

Elham Azhir, +4 more

- 01 Jan 2021

- PeerJ

TL;DR: In this article, the authors have applied and tested a model for clustering variant sizes of large query datasets parallelly using MapReduce and showed the effectiveness of the parallel implementation of query workloads clustering to achieve good scalability.

...read moreread less

4

Proceedings Article•10.1145/3400903.3400932

Shared Execution Techniques for Business Data Analytics over Big Data Streams

Serkan Uzunbaz, +1 more

- 07 Jul 2020

TL;DR: A global query execution plan to simultaneously support multiple queries, and minimize the number of input scans, operators, and tuples flowing between the operators is presented.

...read moreread less

2

References

•Journal Article•10.1007/S00778-003-0095-Z

Aurora: a new model and architecture for data stream management

Daniel J. Abadi, +8 more

- 01 Aug 2003

TL;DR: The basic processing model and architecture of Aurora, a new system to manage data streams for monitoring applications, are described and a stream-oriented set of operators are described.

...read moreread less

1.6K

•Proceedings Article

The Design of the Borealis Stream Processing Engine

Daniel J. Abadi, +11 more

- 01 Jan 2005

TL;DR: This paper outlines the basic design and functionality of Borealis, and presents a highly flexible and scalable QoS-based optimization model that operates across server and sensor networks and a new fault-tolerance model with flexible consistency-availability trade-offs.

...read moreread less

1.6K

Journal Article•10.1007/S00778-004-0147-Z

The CQL continuous query language: semantic foundations and query execution

Arvind Arasu, +2 more

- 01 Jun 2006

TL;DR: This paper presents the structure of CQL's query execution plans as well as details of the most important components: operators, interoperator queues, synopses, and sharing of components among multiple operators and queries.

...read moreread less

1.4K

•Proceedings Article

TelegraphCQ: Continuous Dataflow Processing for an Uncertain World.

Sirish Chandrasekaran, +10 more

- 01 Jan 2003

TL;DR: The next generation Telegraph system, called TelegraphCQ, is focused on meeting the challenges that arise in handling large streams of continuous queries over high-volume, highly-variable data streams and leverages the PostgreSQL open source code base.

...read moreread less

1.2K

•Journal Article