LLAMA: Efficient graph analytics using Large Multiversioned Arrays
Peter Macko,Virendra J. Marathe,Daniel Margo,Margo Seltzer +3 more
- 13 Apr 2015
- pp 363-374
TL;DR: The evaluation shows that LLAMA's mutability introduces modest overheads of 3–18% relative to immutable CSR for in-memory execution and that it outperforms state- of-the-art out-of-memory systems in most cases, with a best case improvement of 5x on breadth-first-search.
read more
Abstract: We present LLAMA, a graph storage and analysis system that supports mutability and out-of-memory execution. LLAMA performs comparably to immutable main-memory analysis systems for graphs that fit in memory and significantly outperforms existing out-of-memory analysis systems for graphs that exceed main memory. LLAMA bases its implementation on the compressed sparse row (CSR) representation, which is a read-only representation commonly used for graph analytics. We augment this representation to support mutability and persistence using a novel implementation of multi-versioned array snapshots, making it ideal for applications that receive a steady stream of new data, but need to perform whole-graph analysis on consistent views of the data. We compare LLAMA to state-of-the-art systems on representative graph analysis workloads, showing that LLAMA scales well both out-of-memory and across parallel cores. Our evaluation shows that LLAMA's mutability introduces modest overheads of 3–18% relative to immutable CSR for in-memory execution and that it outperforms state-of-the-art out-of-memory systems in most cases, with a best case improvement of 5x on breadth-first-search.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Figures

Fig. 8: R-MAT, varying the average vertex degree on the Commodity platform using. Only the graph with 20 edges per vertex fits in memory; the sizes of the remaining graphs range from 100% to 200% of memory. 
Fig. 7: R-MAT, varying the number of vertices on the Commodity platform using. Only the graph with 225 vertices fits in memory; the sizes of the remaining graphs range from 100% to 900% of memory. 
TABLE II: BigMem Performance for LiveJournal and Twitter datasets, elapsed times shown in seconds. 
TABLE I: Commodity Machine Performance for LiveJournal and Twitter datasets, elapsed times shown in seconds. Note that neither GreenMarl nor GraphLab can run when the graph exceeds main memory. X-Stream does not implement an exact triangle counting algorithm, only an approximation, so we do not include its results. 
Fig. 5: BFS and PageRank Scaling as a Function of Core Count on the BigMem platform for the Twitter graph. 
Fig. 6: Triangle Counting as a Function of Core Count on the BigMem platform for the Twitter graph.
Citations
Gemini: a computation-centric distributed graph processing system
Xiaowei Zhu,Wenguang Chen,Weimin Zheng,Xiaosong Ma +3 more
- 02 Nov 2016
TL;DR: Gemini is presented, a distributed graph processing system that applies multiple optimizations targeting computation performance to build scalability on top of efficiency and significantly outperforms all well-known existing distributed graphprocessing systems.
Low-latency graph streaming using compressed purely-functional trees
Laxman Dhulipala,Guy E. Blelloch,Julian Shun +2 more
- 08 Jun 2019
TL;DR: Aspen as mentioned in this paper is a graph-streaming framework that extends the interface of Ligra with operations for updating graphs, which significantly improves on the space usage and locality of purely-functional trees.
117
GraFboost: using accelerated flash storage for external graph analytics
Sang-Woo Jun,Andrew Wright,Sizhuo Zhang,Shuotao Xu,Arvind +4 more
- 02 Jun 2018
TL;DR: It is demonstrated that despite the relatively small amount of DRAM, GraFBoost achieves high performance with very large graphs no other system can handle, and rivals the performance of the fastest software platforms on sizes of graphs that existing platforms can handle.
101
GraphOne: A Data Store for Real-time Analytics on Evolving Graphs
Pradeep Kumar,H. Howie Huang +1 more
TL;DR: GraphOne is designed and developed, a graph data store that abstracts thegraph data store away from the specialized systems to solve the fundamental research problems associated with the data store design and presents a new data abstraction, GraphView, to enable data access at two different granularities of data ingestions.
91
A scalable distributed graph partitioner
Daniel Margo,Margo Seltzer +1 more
- 01 Aug 2015
TL;DR: Sheep is a distributed graph partitioning algorithm capable of handling graphs that far exceed main memory and produces high quality edge partitions an order of magnitude faster than both state of the art offline partitioners and streaming partitioners.
References
•Proceedings Article
The PageRank Citation Ranking : Bringing Order to the Web
Lawrence Page,Sergey Brin,Rajeev Motwani,Terry Winograd +3 more
- 11 Nov 1999
TL;DR: This paper describes PageRank, a mathod for rating Web pages objectively and mechanically, effectively measuring the human interest and attention devoted to them, and shows how to efficiently compute PageRank for large numbers of pages.
16.4K
What is Twitter, a social network or a news media?
Haewoon Kwak,Changhyun Lee,Hosung Park,Sue Moon +3 more
- 26 Apr 2010
TL;DR: In this paper, the authors have crawled the entire Twittersphere and found a non-power-law follower distribution, a short effective diameter, and low reciprocity, which all mark a deviation from known characteristics of human social networks.
7.5K
Pregel: a system for large-scale graph processing
Grzegorz Malewicz,Matthew H. Austern,Aart J. C. Bik,James C. Dehnert,Ilan Horn,Naty Leiser,Grzegorz Czajkowski +6 more
- 06 Jun 2010
TL;DR: A model for processing large graphs that has been designed for efficient, scalable and fault-tolerant implementation on clusters of thousands of commodity computers, and its implied synchronicity makes reasoning about programs easier.
Graphs over time: densification laws, shrinking diameters and possible explanations
Jure Leskovec,Jon Kleinberg,Christos Faloutsos +2 more
- 21 Aug 2005
TL;DR: A new graph generator is provided, based on a "forest fire" spreading process, that has a simple, intuitive justification, requires very few parameters (like the "flammability" of nodes), and produces graphs exhibiting the full range of properties observed both in prior work and in the present study.
Group formation in large social networks: membership, growth, and evolution
Lars Backstrom,Daniel P. Huttenlocher,Jon Kleinberg,Xiangyang Lan +3 more
- 20 Aug 2006
TL;DR: It is found that the propensity of individuals to join communities, and of communities to grow rapidly, depends in subtle ways on the underlying network structure, and decision-tree techniques are used to identify the most significant structural determinants of these properties.