Parallel Data Processing in the Cloud using Nephele

doi:10.5120/12060-7527

Open AccessJournal Article10.5120/12060-7527

Parallel Data Processing in the Cloud using Nephele

Mayura D. Tapkire, +2 more

- 31 May 2013

- International Journal of Computer Applic...

- Vol. 69, Iss: 17, pp 1-8

2

TL;DR: Nephele is the first data processing framework to explicitly exploit the dynamic resource allocation offered by today’s IaaS clouds for both, task scheduling and execution.

Abstract: In recent years, Infrastructure-as-a-Service (IaaS) clouds have become increasingly popular as a flexible and inexpensive platform for ad-hoc parallel data processing. Major players in cloud computing have started to integrate frameworks for parallel data processing in their product portfolio, making it easy for customers to access these services and to deploy their programs. However, currently used processing frameworks have been designed for static, homogeneous cluster systems and do not support the new features which distinguish the cloud platform. In this paper discussion is being done on the research project Nephele. Nephele is the first data processing framework to explicitly exploit the dynamic resource allocation offered by today‟s IaaS clouds for both, task scheduling and execution. First performance results of Nephele are presented and its efficiency is compared with one of the well-known software, MapReduce. MapReduce is chosen for comparison since it is open source software and currently enjoys high popularity in the data processing community.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

Journal Article•10.1002/SEC.1224

Intrusion detection techniques for mobile cloud computing in heterogeneous 5G

Keke Gai, +3 more

- 10 Nov 2016

- Security and Communication Networks

TL;DR: It is concluded that the implementation of mobile cloud computing can be secured by the proposed framework because it will provide well-protected Web services and adaptable IDSs in the complicated heterogeneous 5G environment.

...read moreread less

182

Journal Article•10.1016/J.JPDC.2017.08.010

Approaches for optimizing virtual machine placement and migration in cloud environments: A survey

Manoel Campos da Silva Filho, +3 more

- 01 Jan 2018

- Journal of Parallel and Distributed Comp...

TL;DR: This work presents a cloud computing background, a review of several proposals, a discussion of problem formulations, advantages and shortcomings of reviewed works, and provides several open issues, showing the relevancy of the topic in an increasing and demanding market.

...read moreread less

123

References

Journal Article•10.1145/1327452.1327492

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 01 Jan 2008

- Communications of The ACM

TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.

...read moreread less

18.6K

Proceedings Article•10.1145/1272996.1273005

Dryad: distributed data-parallel programs from sequential building blocks

Michael Isard, +4 more

- 21 Mar 2007

TL;DR: The Dryad execution engine handles all the difficult problems of creating a large distributed, concurrent application: scheduling the use of computers and their CPUs, recovering from communication or computer failures, and transporting data between vertices.

...read moreread less

3K

Journal Article•10.1023/A:1015617019423

Condor-G: A Computation Management Agent for Multi-Institutional Grids

James W. Frey, +4 more

- 01 Jul 2002

- Cluster Computing

TL;DR: Condor-G as discussed by the authors leverages software from Globus and Condor to enable users to harness multi-domain resources as if they all belong to one personal domain, and it handles job management, resource selection, security, and fault tolerance.

...read moreread less

848

•Journal Article•10.3844/JCSSP.2012.780.788

A Dynamic Resource Allocation Method for Parallel DataProcessing in Cloud Computing

V. Venkatesa Kumar, +1 more

- 29 Feb 2012

- Journal of Computer Science

TL;DR: A novel Turnaround time utility scheduling approach which focuses on both high priority and the low priority tasks that arrives for scheduling is proposed.

...read moreread less

32