Proceedings Article10.1109/IPDPS.2007.370281
Optimizing Multiple Distributed Stream Queries Using Hierarchical Network Partitions
Sangeetha Seshadri,Vineet Kumar,Brian F. Cooper,Ling Liu +3 more
- 26 Mar 2007
- pp 1-10
TL;DR: This paper proposes two algorithms - top-down and bottom-up which utilize hierarchical network partitions to provide scalable query optimization and establishes the bounds on the search-space and shows the sub-optimality of these algorithms.
read more
Abstract: We consider the problem of query optimization in distributed data stream systems where multiple continuous queries may be executing simultaneously. In order to achieve the best performance, query planning (such as join ordering) must be considered in conjunction with deployment planning (e.g., assigning operators to physical nodes with optimal ordering). However, such a combination involves not only a large number of network nodes but also many query operators, resulting in an extremely large search space for optimal solutions. Our paper aims at addressing this problem by utilizing hierarchical network partitions. We propose two algorithms - top-down and bottom-up which utilize hierarchical network partitions to provide scalable query optimization. Formal analysis is presented to establish the bounds on the search-space and to show the sub-optimality of our algorithms. Through simulations and experiments using a prototype deployed on Emulab we demonstrate the effectiveness of our algorithms.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Patent
Optimal Operator Placement for Distributed Query Processing
Ki Hong Kim,Sangyong Hwang,Sung Heun Wi,Jane Jung Lee,Joo Young Yoon,Sang Kyun Cha +5 more
- 01 Aug 2014
TL;DR: In this paper, total global minimum costs can be determined for multiple sub-plans for completing a multi-operation database process to be performed in a distributed database management system that includes a plurality of nodes.
9
VStore: efficiently storing virtualized state across mobile devices
Balasubramanian Seshasayee,Nitya Narasimhan,Ashish Bijlani,Ankur Pai,Karsten Schwan +4 more
- 17 Jun 2008
TL;DR: VStore is presented, a flexible mechanism for storage management and content protection that exploits virtualization to modularize data access and sharing mechanisms into containers separate from those containing guest operating systems and applications.
7
Managing opportunistic and dedicated resources in a bi-modal service deployment architecture
Shah Asaduzzaman
- 01 Jan 2007
TL;DR: A new bi-modal architecture for a geographically distributed and cost-effective service hosting platform that utilizes a combination of statically provisioned dedicated resource pools and widely available opportunistic public resources to provide quality assured services is introduced.
5
Dynamic Block Sizing for Data Stream Processing Systems
Robert Birke,Evangelia Kalyvianaki,Walter Binder,Martin L. Schmatz,Lydia Y. Chen +4 more
- 01 Apr 2016
TL;DR: A controller, iBLOC, is developed that adjusts the block sizes of streaming jobs on the fly and the parallelism level of jobs, according to the input data rates and the query priorities, to achieve significant CPU-core savings from the primary query type such that multiple queries can run together without impairing their latency constraints, in comparison to a static block-sizing scheme.
4
Grouping Distributed Stream Query Services by Operator Similarity and Network Locality
Sangeetha Seshadri,Bhuvan Bamba,Brian F. Cooper,Vineet Kumar,Ling Liu,Karsten Schwan,G. Zhang +6 more
- 06 Jul 2008
TL;DR: Evaluation of the design and evaluation of a distributed stream query service that achieves massive scalability by taking advantage of the opportunity to reuse the same distributed operator for multiple and different concurrent queries shows that stream queries relaxed and grouped using the approach operate efficiently without a priori knowledge of workload, and offer an order of magnitude improvement over existing approaches.
4
References
TAG: a Tiny AGgregation service for Ad-Hoc sensor networks
Samuel Madden,Michael J. Franklin,Joseph M. Hellerstein,Wei Hong +3 more
- 09 Dec 2002
TL;DR: This work presents the Tiny AGgregation (TAG) service for aggregation in low-power, distributed, wireless environments, and discusses a variety of optimizations for improving the performance and fault tolerance of the basic solution.
Models and issues in data stream systems
Brian Babcock,Shivnath Babu,Mayur Datar,Rajeev Motwani,Jennifer Widom +4 more
- 03 Jun 2002
TL;DR: The need for and research issues arising from a new model of data processing, where data does not take the form of persistent relations, but rather arrives in multiple, continuous, rapid, time-varying data streams are motivated.