Energy-efficient Data-intensive Computing with a Fast Array of Wimpy Nodes

Open Access

Energy-efficient Data-intensive Computing with a Fast Array of Wimpy Nodes

- 01 Oct 2011

9

TL;DR: Using FAWN as an early example, it is demonstrated that pervasive use of vector interfaces throughout distributed storage systems can improve throughput by an order of magnitude and eliminate the redundant work found in many data-intensive workloads.

Abstract: : Large-scale data-intensive computing systems have become a critical foundation for Internet-scale services. Their widespread growth during the past decade has raised datacenter energy demand and created an increasingly large financial burden and scaling challenge: Peak energy requirements today are a significant cost of provisioning and operating datacenters. In this thesis we propose to reduce the peak energy consumption of datacenters by using a FAWN: A Fast Array of Wimpy Nodes. FAWN is an approach to building datacenter server clusters using low-cost, low-power servers that are individually optimized for energy efficiency rather than raw performance alone. FAWN systems, however, have a different set of resource constraints than traditional systems that can prevent existing software from reaping the improved energy efficiency benefits FAWN systems can provide. This dissertation describes the principles behind FAWN and the software techniques necessary to unlock its energy efficiency potential. First, we present a deep study into building FAWN-KV, a distributed, log-structured key-value storage system designed for an early FAWN prototype. Second, we present a broader classification and workload analysis showing when FAWN can be more energy-efficient and under what workload conditions a FAWN cluster would perform poorly in comparison to a smaller number of high-speed systems. Last, we describe modern trends that portend a narrowing gap between CPU and I/O capability and highlight the challenges endemic to all future balanced systems. Using FAWN as an early example, we demonstrate that pervasive use of vector interfaces throughout distributed storage systems can improve throughput by an order of magnitude and eliminate the redundant work found in many data-intensive workloads.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1145/2254756.2254766

Workload analysis of a large-scale key-value store

Berk Atikoglu, +4 more

- 11 Jun 2012

TL;DR: This paper collects detailed traces from Facebook's Memcached deployment, arguably the world's largest, and analyzes the workloads from multiple angles, including: request composition, size, and rate; cache efficacy; temporal patterns; and application use cases.

...read moreread less

1K

•10.1184/R1/6468164.V1

A Case for Efficient Hardware/Software Cooperative Management of Storage and Memory

Justin Meza, +5 more

- 24 Jun 2013

TL;DR: The goal of this work is to explore the design of a Persistent Memory Manager that coordinates the management of memory and storage under a single hardware unit in a single address space and shows that such a system with a persistent memory can improve energy efficiency and performance.

...read moreread less

113

Journal Article•10.1109/MIC.2013.80

Characterizing Facebook's Memcached Workload

Yuehai Xu, +3 more

- 01 Mar 2014

- IEEE Internet Computing

TL;DR: This article analyzes the Memcached workload at Facebook, looking at server-side performance, request composition, caching efficacy, and key locality, to lead to several design insights and new research directions for key-value caches.

...read moreread less

74

•Proceedings Article

{MAPX}: Controlled Data Migration in the Expansion of Decentralized Object-Based Storage Systems

Li Wang, +3 more

- 01 Jan 2020

TL;DR: MAPX is a novel extension to CRUSH that uses an extra time-dimension mapping (from object creation times to cluster expansion times) for controlled data migration in cluster expansions and outperforms the CRUSH-based system by up to 4.25× in the tail latency.

...read moreread less

25

Patent

Solid-state drive management and control

Sriram Sankar, +1 more

- 16 Sep 2014

TL;DR: In this paper, a management engine for controlling a solid-state drive (SSD) includes an input interface configured to receive a target operation profile from an input source, and a process component configured to retrieve an operating policy from a database, and determine operating parameters for the SSD based on retrieved operating policy.

...read moreread less

10

References

Journal Article•10.21276/IJRE.2018.5.5.4

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 06 Dec 2004

TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.

...read moreread less

22.7K

Journal Article•10.1145/1327452.1327492

MapReduce: simplified data processing on large clusters

Jeffrey Dean, +1 more

- 01 Jan 2008

- Communications of The ACM

TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.

...read moreread less

18.6K

Proceedings Article•10.1145/383059.383071

Chord: A scalable peer-to-peer lookup service for internet applications

Ion Stoica, +4 more

- 27 Aug 2001

TL;DR: Results from theoretical analysis, simulations, and experiments show that Chord is scalable, with communication cost and the state maintained by each node scaling logarithmically with the number of Chord nodes.

...read moreread less

11.2K

Proceedings Article•10.1145/1294261.1294281

Dynamo: amazon's highly available key-value store

Giuseppe deCandia, +8 more

- 14 Oct 2007

TL;DR: D Dynamo is presented, a highly available key-value storage system that some of Amazon's core services use to provide an "always-on" experience and makes extensive use of object versioning and application-assisted conflict resolution in a manner that provides a novel interface for developers to use.

...read moreread less

4.5K

Proceedings Article•10.1145/1272996.1273005

Dryad: distributed data-parallel programs from sequential building blocks

Michael Isard, +4 more

- 21 Mar 2007

TL;DR: The Dryad execution engine handles all the difficult problems of creating a large distributed, concurrent application: scheduling the use of computers and their CPUs, recovering from communication or computer failures, and transporting data between vertices.

...read moreread less

3K

...

Expand

Energy-efficient Data-intensive Computing with a Fast Array of Wimpy Nodes

Chat with Paper

AI Agents for this Paper

Citations

Workload analysis of a large-scale key-value store

A Case for Efficient Hardware/Software Cooperative Management of Storage and Memory

Characterizing Facebook's Memcached Workload

{MAPX}: Controlled Data Migration in the Expansion of Decentralized Object-Based Storage Systems

Solid-state drive management and control

References

MapReduce: simplified data processing on large clusters

MapReduce: simplified data processing on large clusters

Chord: A scalable peer-to-peer lookup service for internet applications

Dynamo: amazon's highly available key-value store

Dryad: distributed data-parallel programs from sequential building blocks

Related Papers (5)

Resource efficient computing for warehouse-scale datacenters

Analysis of power consumption in heterogeneous virtual machine environments

Energy, performance and cost efficient datacenters: A survey

A Study of Efficient Energy Management Techniques for Cloud Computing Environment

The Future of Cloud Computing: Opportunities, Challenges and Research Trends