Incremental graph pattern matching
Wenfei Fan,Jianzhong Li,Jizhou Luo,Zijing Tan,Xin Wang,Yinghui Wu +5 more
- 12 Jun 2011
- pp 925-936
TL;DR: Investigating incremental algorithms for graph pattern matching defined in terms of graph simulation, bounded simulation and subgraph isomorphism shows that the incremental matching problem is unbounded, i.e., its cost is not determined by the size of the changes alone, and demonstrates that the problem is intractable and unbounded for unit updates and path patterns.
read more
Abstract: Graph pattern matching has become a routine process in emerging applications such as social networks. In practice a data graph is typically large, and is frequently updated with small changes. It is often prohibitively expensive to recompute matches from scratch via batch algorithms when the graph is updated. With this comes the need for incremental algorithms that compute changes to the matches in response to updates, to minimize unnecessary recomputation. This paper investigates incremental algorithms for graph pattern matching defined in terms of graph simulation, bounded simulation and subgraph isomorphism. (1) For simulation, we provide incremental algorithms for unit updates and certain graph patterns. These algorithms are optimal: in linear time in the size of the changes in the input and output, which characterizes the cost that is inherent to the problem itself. For general patterns we show that the incremental matching problem is unbounded, i.e., its cost is not determined by the size of the changes alone. (2) For bounded simulation, we show that the problem is unbounded even for unit updates and path patterns. (3) For subgraph isomorphism, we show that the problem is intractable and unbounded for unit updates and path patterns. (4) For multiple updates, we develop an incremental algorithm for each of simulation, bounded simulation and subgraph isomorphism. We experimentally verify that these incremental algorithms significantly outperform their batch counterparts in response to small changes, using real-life data and synthetic data.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
•Journal Article
ACM Transactions on Database Systems
Dan Suciu,Gerhard Weikum +1 more
TL;DR: BLOCKIN BLOCKINÒ BLOCKin× ½¸ÔÔº ¾ßß¿º ¿ ¾ ¾ à ¼ à à 0
425
Hybrid SDN Networks: A Survey of Existing Approaches
TL;DR: A comprehensive up-to-date survey of the research and development in the field of hybrid SDN networks is presented and guidelines for future research on hybridSDN networks are derived.
340
Effective community search for large attributed graphs
Yixiang Fang,Reynold Cheng,Siqiang Luo,Jiafeng Hu +3 more
- 01 Aug 2016
TL;DR: The results show that ACs are more effective and efficient than existing community retrieval approaches, and contains more precise and personalized information than that of existing community search and detection methods.
Graph Pattern Matching.
Yinghui Wu,Arijit Khan +1 more
- 01 Jan 2019
TL;DR: In this article, a class of graph patterns, called dag patterns, is proposed, in which an edge denotes the connectivity in a data graph within a predefined number of hops.
241
A subgraph isomorphism algorithm and its application to biochemical data
TL;DR: This work proposes a new subgraph isomorphism algorithm which applies a search strategy to significantly reduce the search space without using any complex pruning rules or domain reduction procedures.
References
•Book
Computers and Intractability: A Guide to the Theory of NP-Completeness
Michael Randolph Garey,David S. Johnson +1 more
- 01 Jan 1979
TL;DR: The second edition of a quarterly column as discussed by the authors provides a continuing update to the list of problems (NP-complete and harder) presented by M. R. Garey and myself in our book "Computers and Intractability: A Guide to the Theory of NP-Completeness,” W. H. Freeman & Co., San Francisco, 1979.
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
- 06 Dec 2004
TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.
•Book
Communication and Concurrency
Robin Milner
- 01 Jan 1989
TL;DR: This chapter discusses Bisimulation and Observation Equivalence as a Modelling Communication, a Programming Language, and its application to Equational laws.
9K
Related Papers (5)
Monika Henzinger,Thomas A. Henzinger,Peter W. Kopke +2 more
- 23 Oct 1995
Xifeng Yan,Philip S. Yu,Jiawei Han +2 more
- 13 Jun 2004