Many-task computing

Topic Tools

Papers published on a yearly basis

Papers

Journal Article•10.1145/1327452.1327492•

MapReduce: simplified data processing on large clusters

[...]

Jeffrey Dean¹, Sanjay Ghemawat¹•Institutions (1)

Google¹

01 Jan 2008-Communications of The ACM

TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.

...read moreread less

Abstract: MapReduce is a programming model and an associated implementation for processing and generating large datasets that is amenable to a broad variety of real-world tasks. Users specify the computation in terms of a map and a reduce function, and the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks. Programmers find the system easy to use: more than ten thousand distinct MapReduce programs have been implemented internally at Google over the past four years, and an average of one hundred thousand MapReduce jobs are executed on Google's clusters every day, processing a total of more than twenty petabytes of data per day.

...read moreread less

18,641 citations

Proceedings Article•10.1109/GRID.2004.14•

BOINC: A System for Public-Resource Computing and Storage

[...]

Dustin Anderson¹•Institutions (1)

University of California, Berkeley¹

8 Nov 2004

TL;DR: The goals of BOINC are described, the design issues that were confronted, and the solutions to these problems are described.

...read moreread less

Abstract: BOINC (Berkeley Open Infrastructure for Network Computing) is a software system that makes it easy for scientists to create and operate public-resource computing projects. It supports diverse applications, including those with large storage or communication requirements. PC owners can participate in multiple BOINC projects, and can specify how their resources are allocated among these projects. We describe the goals of BOINC, the design issues that we confronted, and our solutions to these problems.

...read moreread less

2,221 citations

Journal Article•10.1002/CPE.938•

Distributed computing in practice: the Condor experience

[...]

Douglas Thain¹, Todd Tannenbaum¹, Miron Livny¹•Institutions (1)

University of Wisconsin-Madison¹

01 Feb 2005-Concurrency and Computation: Practice and Experience

TL;DR: The history and philosophy of the Condor project is provided and how it has interacted with other projects and evolved along with the field of distributed computing is described.

...read moreread less

Abstract: SUMMARY Since 1984, the Condor project has enabled ordinary users to do extraordinary computing. Today, the project continues to explore the social and technical problems of cooperative computing on scales ranging from the desktop to the world-wide computational Grid. In this paper, we provide the history and philosophy of the Condor project and describe how it has interacted with other projects and evolved along with the field of distributed computing. We outline the core components of the Condor system and describe how the technology of computing must correspond to social structures. Throughout, we reflect on the lessons of experience and chart the course travelled by research ideas as they grow into production systems. Copyright c � 2005 John Wiley & Sons, Ltd.

...read moreread less

2,170 citations

Journal Article•10.1023/A:1015617019423•

Condor-G: A Computation Management Agent for Multi-Institutional Grids

[...]

James W. Frey¹, Todd Tannenbaum¹, Miron Livny¹, Ian Foster², Steven Tuecke² - Show less +1 more•Institutions (2)

University of Wisconsin-Madison¹, Argonne National Laboratory²

01 Jul 2002-Cluster Computing

TL;DR: Condor-G as discussed by the authors leverages software from Globus and Condor to enable users to harness multi-domain resources as if they all belong to one personal domain, and it handles job management, resource selection, security, and fault tolerance.

...read moreread less

Abstract: In recent years, there has been a dramatic increase in the number of available computing and storage resources. Yet few tools exist that allow these resources to be exploited effectively in an aggregated form. We present the Condor-G system, which leverages software from Globus and Condor to enable users to harness multi-domain resources as if they all belong to one personal domain. We describe the structure of Condor-G and how it handles job management, resource selection, security, and fault tolerance. We also present results from application experiments with the Condor-G system. We assert that Condor-G can serve as a general-purpose interface to Grid resources, for use by both end users and higher-level program development tools.

...read moreread less

848 citations

Proceedings Article•10.1109/MTAGS.2008.4777912•

Many-task computing for grids and supercomputers

[...]

Ioan Raicu¹, Ian Foster¹, Yong Zhao²•Institutions (2)

University of Chicago¹, Microsoft²

1 Nov 2008

TL;DR: Many-task computing aims to bridge the gap between two computing paradigms, high throughput computing and high performance computing, drawing attention to the many computations that are heterogeneous but not ldquohappilyrdquo parallel.

...read moreread less

Abstract: Many-task computing aims to bridge the gap between two computing paradigms, high throughput computing and high performance computing. Many task computing differs from high throughput computing in the emphasis of using large number of computing resources over short periods of time to accomplish many computational tasks (i.e. including both dependent and independent tasks), where primary metrics are measured in seconds (e.g. FLOPS, tasks/sec, MB/s I/O rates), as opposed to operations (e.g. jobs) per month. Many task computing denotes high-performance computations comprising multiple distinct activities, coupled via file system operations. Tasks may be small or large, uniprocessor or multiprocessor, compute-intensive or data-intensive. The set of tasks may be static or dynamic, homogeneous or heterogeneous, loosely coupled or tightly coupled. The aggregate number of tasks, quantity of computing, and volumes of data may be extremely large. Many task computing includes loosely coupled applications that are generally communication-intensive but not naturally expressed using standard message passing interface commonly found in high performance computing, drawing attention to the many computations that are heterogeneous but not ldquohappilyrdquo parallel.

...read moreread less

320 citations

...

Expand

Year	Papers
2020	2
2019	6
2018	1
2017	4
2016	10
2015	7

Topic Tools

Papers published on a yearly basis

Papers

MapReduce: simplified data processing on large clusters

BOINC: A System for Public-Resource Computing and Storage

Distributed computing in practice: the Condor experience

Condor-G: A Computation Management Agent for Multi-Institutional Grids

Many-task computing for grids and supercomputers

Related Topics (5)

Performance Metrics