A GPU-Based Enhanced Genetic Algorithm for Power-Aware Task Scheduling Problem in HPC Cloud
Nguyen Quang-Hung,Le Thanh Tan,Chiem Thach Phat,Nam Thoai +3 more
- 14 Apr 2014
- pp 159-169
TL;DR: The proposed genetic algorithm (GA) can handle large-scale task scheduling problems with scalable speedup under limitations of GPU device (e.g. GPU’s device memory, number of GPU cores, etc.).
read more
Abstract: In this paper, we consider power-aware task scheduling (PATS) in HPC clouds. Users request virtual machines (VMs) to execute their tasks. Each task is executed on one single VM, and requires a fixed number of cores (i.e., processors), computing power (million instructions per second - MIPS) of each core, a fixed start time and non-preemption in a duration. Each physical machine has maximum capacity resources on processors (cores); each core has limited computing power. The energy consumption of each placement is measured for cost calculating purposes. The power consumption of a physical machine is in a linear relationship with its CPU utilization. We want to minimize the total energy consumption of the placements of tasks. We propose here a genetic algorithm (GA) to solve the PATS problem. The GA is developed with two versions: (1) BKGPUGA, which is an adaptively implemented using NVIDIA’s Compute Unified Device Architecture (CUDA) framework; and (2) SGA, which is a serial GA version on CPU. The experimental results show the BKGPUGA program that executed on a single NVIDIA® TESLATM M2090 GPU (512 cores) card obtains significant speedups in comparing to the SGA program executing on Intel® XeonTM E5-2630 (2.3 GHz) on same input problem size. Both versions share the same GA’s parameters (e.g. number of generations, crossover and mutation probability, etc.) and a relative small (10-11) on difference of two finesses between BKGPUGA and SGA. Moreover, the proposed BKGPUGA program can handle large-scale task scheduling problems with scalable speedup under limitations of GPU device (e.g. GPU’s device memory, number of GPU cores, etc.).
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Adaptive resource management using many-core processing for fault tolerance based on cyber–physical cloud systems
TL;DR: adaptive resource management (ARM) is proposed to 1) minimize information loss due to the irregular lifespan of resources, such as sensors and actuators; and 2) quickly respond to any problems.
11
Advances in computation and intelligence : Second International Symposium, ISICA 2007, Wuhan, China, September 21-23, 2007 : proceedings
Lishan Kang,Yong Liu,Sanyou Zeng +2 more
- 01 Jan 2007
TL;DR: In this article, a multi-objective evolutionary algorithm for solving optimization problems with high-dimensional objective space is presented. But the authors focus on the problem of finding the optimal solution to the optimization problem.
2
Implementing Genetic Algorithm Accelerated By Intel Xeon Phi
Nguyen Quang-Hung,Anh-Tu Ngoc Tran,Nam Thoai +2 more
- 07 Dec 2017
TL;DR: The GAPhi framework solves the power-aware task scheduling (PATS) problems in shorter execution time than sequential genetic algorithm and achieves significant speedup in comparison to SGA program executed on CPU Intel® Xeon and GAGPU program execution on NVIDIA Tesla with same input problem size.
2
Distributed Genetic Algorithm on Cluster of Intel Xeon Phi Co-processors
Nguyen Quang-Hung,Anh-Tu Ngoc Tran,Nam Thoai +2 more
- 28 Nov 2018
TL;DR: This study investigates using hybrid MPI and OpenMP to parallelize genetic algorithm (GA) onto high performance computing (HPC) servers included Intel Xeon Phi co-processors.
Efficient auto-scaling scheme for rapid storage service using many-core of desktop storage virtualization based on IoT
Hyun-Woo Kim,Young-Sik Jeong +1 more
TL;DR: An efficient auto-scaling scheme (EAS) is proposed that minimizes deferred processing time in Internet of Things (IoT) environments by using many-cores of the GPU for clustering and auto- scaling in DSV environments.
References
Power provisioning for a warehouse-sized computer
Xiaobo Fan,Wolf-Dietrich Weber,Luiz Andre Barroso +2 more
- 09 Jun 2007
TL;DR: This paper presents the aggregate power usage characteristics of large collections of servers for different classes of applications over a period of approximately six months, and uses the modelling framework to estimate the potential of power management schemes to reduce peak power and energy usage.
A Comparison of Eleven Static Heuristics for Mapping a Class of Independent Tasks onto Heterogeneous Distributed Computing Systems
Tracy D. Braun,Howard Jay Siegel,N.B. Beck,Ladislau Bölöni,Muthucumaru Maheswaran,Albert Reuther,James Patrick Robertson,Mitchell D. Theys,Bin Yao,Debra Hensgen,Richard F. Freund +10 more
TL;DR: It is shown that for the cases studied here, the relatively simple Min?min heuristic performs well in comparison to the other techniques, and one even basis for comparison and insights into circumstances where one technique will out-perform another.
1.9K
Advances in Computational Intelligence
Jing Liu,Cesare Alippi,Bernadette Bouchon-Meunier,Garrison W. Greenwood,Hussein A. Abbass +4 more
- 01 Jan 2012
TL;DR: This work proposes customized model ensembles on demand, inspired by Lazy Learning, which finds the most relevant models from a DB of models, using their meta-information, and creates an ensemble, which produces an output that is a weighted interpolation or extrapolation of the outputs of the models ensemble.
176
•Book
Advances in Computation and Intelligence
Lishan Kang,Zhihua Cai,Xuesong Yan,Yong Liu +3 more
- 01 Jan 2009
TL;DR: A new Evolutionary Decision Theory for Many-Objective Optimization problems and an Effective Dynamical Multi-objective Evolutionary Algorithm for Solving Optimization Problems with High Dimensional Objective Space are presented.
96
GPU-based island model for evolutionary algorithms
Nouredine Melab,El-Ghazali Talbi +1 more
- 07 Jul 2010
TL;DR: This paper focuses on the parallel island model on GPU and addresses its re-design, implementation, and associated issues related to the GPU execution context.