Journal Article10.1109/TPDS.2018.2794989
GrapH: Traffic-Aware Graph Processing
29
TL;DR: GrapH is developed, the first graph processing system using vertex-cut graph partitioning that considers both, diverse vertex traffic and heterogeneous network costs, and the main idea is to avoid frequent communication over expensive network links using an adaptive edge migration strategy.
read more
Abstract: Distributed graph processing systems such as Pregel, PowerGraph, or GraphX gained popularity due to their superior performance of data analytics on graph-structured data. These systems employ partitioning algorithms to parallelize graph analytics while minimizing inter-partition communication. Recent partitioning algorithms, however, unrealistically assume a uniform and constant amount of data exchanged between graph vertices (i.e., uniform vertex traffic ) and homogeneous network costs between workers hosting the graph partitions. This leads to suboptimal partitioning decisions and inefficient graph processing. To this end, we developed GrapH, the first graph processing system using vertex-cut graph partitioning that considers both, diverse vertex traffic and heterogeneous network costs. The main idea is to avoid frequent communication over expensive network links using an adaptive edge migration strategy. Our evaluations show an improvement of 10 percent in graph processing latency and 60 percent in communication costs compared to state-of-the-art partitioning approaches.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
PaGraph: Scaling GNN training on large graphs via computation-aware caching
Zhiqi Lin,Cheng Li,Youshan Miao,Yunxin Liu,Yinlong Xu +4 more
- 12 Oct 2020
TL;DR: PaGraph is proposed, a system that supports general and efficient sampling-based GNN training on single-server with multi-GPU with good cache efficiency and develops a fast GNN-computation-aware partition algorithm to avoid cross-partition access during data parallel training and achieves better cache efficiency.
165
A Comprehensive Survey on Parallelization and Elasticity in Stream Processing
Henriette Röger,Ruben Mayer +1 more
TL;DR: In this article, a survey of the state of the art in stream processing parallelization and elasticity is presented, which is necessary to consolidate the state-of-the-art and to plan future research directions on this basis.
ADWISE: Adaptive Window-Based Streaming Edge Partitioning for High-Speed Graph Processing
Christian Mayer,Ruben Mayer,Muhammad Adnan Tariq,Heiko Geppert,Larissa Laich,Lukas Rieger,Kurt Rothermel +6 more
- 02 Jul 2018
TL;DR: ADWISE is a novel window-based streaming partitioning algorithm that increases the partitioning quality by always choosing the best edge from a set of edges for assignment to a partition, and reduces the total latency of partitioning plus processing by up to 23-47 percent compared to the state-of-the-art.
59
HYPE: Massive Hypergraph Partitioning with Neighborhood Expansion
Christian Mayer,Ruben Mayer,Sukanya Bhowmik,Lukas Epple,Kurt Rothermel +4 more
- 26 Oct 2018
TL;DR: HyPE as discussed by the authors exploits the neighborhood relations between vertices in the hypergraph using an efficient implementation of neighborhood expansion, which can improve partitioning quality by up to 95% and reduce runtime by 39% compared to streaming partitioning.
Graph Computing Systems and Partitioning Techniques: A Survey
01 Jan 2022
TL;DR: In this paper , the authors present an overview, classification, and investigation of the most popular graph partitioning and computing systems and discuss future challenges and research directions in graph partitions and computing.
References
Pregel: a system for large-scale graph processing
Grzegorz Malewicz,Matthew H. Austern,Aart J. C. Bik,James C. Dehnert,Ilan Horn,Naty Leiser,Grzegorz Czajkowski +6 more
- 06 Jun 2010
TL;DR: A model for processing large graphs that has been designed for efficient, scalable and fault-tolerant implementation on clusters of thousands of commodity computers, and its implied synchronicity makes reasoning about programs easier.
An Algorithm for Subgraph Isomorphism
TL;DR: A new algorithm is introduced that attains efficiency by inferentially eliminating successor nodes in the tree search by means of a brute-force tree-search enumeration procedure and a parallel asynchronous logic-in-memory implementation of a vital part of the algorithm is described.
VL2: a scalable and flexible data center network
Albert Greenberg,James R. Hamilton,Navendu Jain,Srikanth Kandula,Changhoon Kim,Parantap Lahiri,David A. Maltz,Parveen Patel,Sudipta Sengupta +8 more
- 16 Aug 2009
TL;DR: VL2 is a practical network architecture that scales to support huge data centers with uniform high capacity between servers, performance isolation between services, and Ethernet layer-2 semantics, and is built on a working prototype.
VL2: a scalable and flexible data center network
Albert Greenberg,James R. Hamilton,Navendu Jain,Srikanth Kandula,Changhoon Kim,Parantap Lahiri,David A. Maltz,Parveen Patel,Sudipta Sengupta +8 more
TL;DR: VL2 is a practical network architecture that scales to support huge data centers with uniform high capacity between servers, performance isolation between services, and Ethernet layer-2 semantics and can be deployed today, and a working prototype is built.
1.9K
PowerGraph: distributed graph-parallel computation on natural graphs
Joseph E. Gonzalez,Yucheng Low,Haijie Gu,Danny Bickson,Carlos Guestrin +4 more
- 08 Oct 2012
TL;DR: This paper describes the challenges of computation on natural graphs in the context of existing graph-parallel abstractions and introduces the PowerGraph abstraction which exploits the internal structure of graph programs to address these challenges.