An Efficient Execution Model for Reactive Stream Programs
Vu Thien Nga Nguyen
- 25 Aug 2015
TL;DR: This thesis exploits characteristics of RSPs to derive effective scheduling strategies on uniform shared-memory platforms to optimise both throughput and latency, implemented in two heuristic-based schedulers.
read more
Abstract: Stream programming is a paradigm where a program is structured by a set of computational nodes connected by streams Focusing on data moving between computational nodes via streams, this programming model fits well for applications that process long sequences of data We call such applications reactive stream programs (RSPs) to distinguish them from stream programs with rather small and finite input data In stream programming, concurrency is expressed implicitly via communication streams This helps to reduce the complexity of parallel programming For this reason, stream programming has gained popularity as a programming model for parallel platforms However, it is also challenging to analyse and improve the performance without an understanding of the program’s internal behaviour This thesis targets an efficient execution model for deploying RSPs on parallel platforms This execution model includes a monitoring framework to understand the internal behaviour of RSPs, scheduling strategies for RSPs on uniform shared-memory platforms; and mapping techniques for deploying RSPs on heterogeneous distributed platforms The foundation of the execution model is based on a study of the performance of RSPs in terms of throughput and latency This study includes quantitative formulae for throughput and latency; and the identification of factors that influence these performance metrics Based on the study of RSP performance, this thesis exploits characteristics of RSPs to derive effective scheduling strategies on uniform shared-memory platforms Aiming to optimise both throughput and latency, these scheduling strategies are implemented in two heuristic-based schedulers Both of them are designed to be centralised to provide load balancing for RSPs with dynamic behaviour as well as dynamic structures The first one uses the notion of positive and negative data demands on each stream to determine the scheduling priorities This scheduler is independent from the runtime system The second one requires the runtime system to provide the position information for each computational node in the RSP; and uses that to decide the scheduling priorities Our experiments show that both schedulers provides similar performance while being significantly better than a reference implementation without dynamic load balancing Also based on the study of RSP performance, we present in this thesis two new heuristic partitioning algorithms which are used to map RSPs onto heterogeneous distributed platforms These are Kernighan-Lin Adaptation (KLA) and Congestion Avoidance (CA), where the main objective is to optimise the throughput This is a multi-parameter optimisation problem where existing graph partitioning algorithms are not applicable Compared to the generic meta-heuristic Simulated Annealing algorithm, both proposed algorithms achieve equally good or better results KLA is faster for small benchmarks while slower for large ones In contrast, CA is always orders of magnitudes faster even for very large benchmarks
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Scheduling And Load Balancing In Parallel And Distributed Systems
Monika Richter
- 01 Jan 2016
TL;DR: Thank you for reading scheduling and load balancing in parallel and distributed systems, which helps people to enjoy a good book with a cup of tea in the afternoon instead of juggling with some harmful bugs inside their computer.
22
Dynamic Scheduling Strategies for an Adaptive, Asynchronous Parallel Global Optimization Algorithm ; CU-CS-625-92
Sharon L Smith,Robert B. Schnabel +1 more
- 01 Jan 1992
TL;DR: This paper explores the use of dynamic scheduling strategies for irregular parallel algorithms in distributed memory computational environments, and considers a new hybrid approach that uses aspects of both centralized and distributed scheduling.
3
RA-LPEL : a Resource-Aware Light-weight Parallel Execution Layer for reactive stream processing networks on the SCC many-core tiled architecture
Nilesh Karavadara
- 26 Jul 2016
TL;DR: The research contribution focuses on the design of the execution layer to run RSPs on tiled many-core architectures, using the Intel’s Single-chip Cloud Computer (SCC) processor as a concrete experimentation platform and a Dynamic Voltage and Frequency Scaling (DVFS) technique for RSP deployed on many- core architectures.
2
References
Optimization by Simulated Annealing
TL;DR: There is a deep and useful connection between statistical mechanics and multivariate or combinatorial optimization (finding the minimum of a given function depending on many parameters), and a detailed analogy with annealing in solids provides a framework for optimization of very large and complex systems.
46.9K
The Pricing of Options and Corporate Liabilities
Fischer Black,Myron S. Scholes +1 more
TL;DR: In this paper, a theoretical valuation formula for options is derived, based on the assumption that options are correctly priced in the market and it should not be possible to make sure profits by creating portfolios of long and short positions in options and their underlying stocks.
31.9K
Normalized cuts and image segmentation
Jianbo Shi,Jitendra Malik +1 more
TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
Normalized cuts and image segmentation
Jianbo Shi,Jitendra Malik +1 more
- 17 Jun 1997
TL;DR: This work treats image segmentation as a graph partitioning problem and proposes a novel global criterion, the normalized cut, for segmenting the graph, which measures both the total dissimilarity between the different groups as well as the total similarity within the groups.
•Book
Scheduling: Theory, Algorithms, and Systems
Michael Pinedo
- 15 Jul 1994
TL;DR: Scheduling will serve as an essential reference for professionals working on scheduling problems in manufacturing and computing environments and Graduate students in operations management, operations research, industrial engineering and computer science will find the book to be an accessible and invaluable resource.
7.2K