LADS: optimizing data transfers using layout-aware data scheduling

doi:10.5555/2750482.2750488

Open AccessProceedings Article10.5555/2750482.2750488

LADS: optimizing data transfers using layout-aware data scheduling

Youngjae Kim, +3 more

- 16 Feb 2015

- pp 67-80

44

TL;DR: This paper identifies the issues that lead to congestion on the path of an end-to-end data transfer in the terabit network environment, and presents a new bulk data movement framework called LADS for terabit networks.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Proceedings Article•10.1145/3078597.3078614

Predicting Output Performance of a Petascale Supercomputer

Bing Xie, +6 more

- 26 Jun 2017

TL;DR: A predictive model useful for output performance prediction of supercomputer file systems under production load of Titan and its Lustre-based multi-stage write path is developed, using feature transformations to capture non-linear relationships.

...read moreread less

65

•Proceedings Article•10.1109/CLUSTER.2015.38

TRIO: Burst Buffer Based I/O Orchestration

Teng Wang, +4 more

- 08 Sep 2015

TL;DR: This paper proposes a burst buffer based I/O orchestration framework, named TRIO, to intercept and reshape the bursty writes for better sequential write traffic to storage servers, and demonstrates that TRIO could efficiently utilize storage bandwidth and reduce the average job I-O time by 37% on average for data-intensive applications in typical checkpointing scenarios.

...read moreread less

49

Server-Side Log Data Analytics for I/O Workload Characterization and Coordination on Large Shared Storage Systems

Liu Yang, +3 more

- 01 Jan 2016

35

Journal Article•10.1016/J.FUTURE.2019.06.006

SciSpace: A scientific collaboration workspace for geo-distributed HPC data centers

Awais Khan, +3 more

- 01 Dec 2019

- Future Generation Computer Systems

TL;DR: SciSpace provides a global view of information shared from multiple geo-distributed HPC data centers under a single workspace that supports native data-access to gain high-performance when data read or write is required in native data center namespace and is evaluated using real scientific datasets and applications.

...read moreread less

21

•Journal Article•10.1109/TPDS.2016.2550439

Optimizing End-to-End Big Data Transfers over Terabits Network Infrastructure

Youngjae Kim, +4 more

- 01 Jan 2017

- IEEE Transactions on Parallel and Distri...

TL;DR: This paper identifies the issues that lead to congestion on the path of an end-to-end data transfer in the terabit network environment, and presents a new bulk data movement framework for terabit networks, called LADS, which can avoid congested storage elements within the shared storage resource, improving input/output bandwidth, and data transfer rates across the high speed networks.

...read moreread less

17

...

Expand

References

•Proceedings Article•10.5555/1298455.1298485

Ceph: a scalable, high-performance distributed file system

Sage A. Weil, +4 more

- 06 Nov 2006

TL;DR: Performance measurements under a variety of workloads show that Ceph has excellent I/O performance and scalable metadata management, supporting more than 250,000 metadata operations per second.

...read moreread less

1.8K

•Proceedings Article

GPFS: A Shared-Disk File System for Large Computing Clusters

Frank B. Schmuck, +1 more

- 28 Jan 2002

TL;DR: GPFS is IBM's parallel, shared-disk file system for cluster computers, available on the RS/6000 SP parallel supercomputer and on Linux clusters, and discusses how distributed locking and recovery techniques were extended to scale to large clusters.

...read moreread less

1.4K

•Proceedings Article•10.1109/SC.2005.72

The Globus Striped GridFTP Framework and Server

William Allcock, +6 more

- 12 Nov 2005

TL;DR: It is argued that this combination of performance and modular structure make the Globus GridFTP framework both a good foundation on which to build tools and applications, and a unique testbed for the study of innovative data management techniques and network protocols.

...read moreread less

707

•Proceedings Article

Scalable performance of the Panasas parallel file system

Brent B. Welch, +7 more

- 26 Feb 2008

TL;DR: Performance measures of I/O, metadata, and recovery operations for storage clusters that range in size from 10 to 120 storage nodes, 1 to 12 metadata nodes, and with file system client counts ranging from 1 to 100 compute nodes are presented.

...read moreread less

392

Proceedings Article•10.1109/SC.2010.32

Managing Variability in the IO Performance of Petascale Storage Systems

Jay Lofstead, +7 more

- 13 Nov 2010

TL;DR: These measurements motivate developing a 'managed' IO approach using adaptive algorithms varying the IO system workload based on current levels and use areas, which achieves higher overall performance and less variability in both a typical usage environment and with artificially introduced levels of 'noise'.

...read moreread less

193