Proceedings Article10.1145/3485447.3512165
Resource-Efficient Training for Large Graph Convolutional Networks with Label-Centric Cumulative Sampling
Ming-Shr Matt Lin,Wenzhong Li,Ding Li,Yizhou Chen,Sanglu Lu +4 more
- 25 Apr 2022
6
TL;DR: It is argued that a GCN can be trained with a sampled subgraph to produce approximate node representations, which inspires a novel perspective to accelerate GCN training via network sampling and a label-centric cumulative sampling (LCS) framework is proposed for training GCNs for large graphs.
read more
Abstract: Graph Convolutional Networks (GCNs) are popular for learning representation of graph data and have a wide range of applications in social networks, recommendation systems, etc. However, training GCN models for large networks is resource intensive and time consuming, which hinders them from real deployment. The existing GCN training methods intended to optimize the sampling of mini-batches for stochastic gradient descent to accelerate training process, which did not reduce the problem size and had limited reduction in computation complexity. In this paper, we argue that a GCN can be trained with a sampled subgraph to produce approximate node representations, which inspires us a novel perspective to accelerate GCN training via network sampling. To this end, we propose a label-centric cumulative sampling (LCS) framework for training GCNs for large graphs. The proposed method constructs a subgraph cumulatively based on probabilistic sampling, and trains the GCN model iteratively to generate approximate node representations. The optimality of LCS is theoretically guaranteed to minimize the bias during node aggregation procedure in GCN training. Extensive experiments based on four real-world network datasets show that the LCS framework accelerates the training for the state-of-the-art GCN models up to 17x without causing noteworthy model accuracy drop.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
GraphGPT: Graph Instruction Tuning for Large Language Models
Jiabin Tang,Yuhao Yang,Wei Wei,Lei Shi,Lixin Su,Suqi Cheng,Dawei Yin,Chao Huang +7 more
- 10 Jul 2024
27
Coupled Attention Networks for Multivariate Time Series Anomaly Detection
TL;DR: Wang et al. as discussed by the authors proposed a coupled attention-based neural network framework (CAN) for anomaly detection in multivariate time series data featuring dynamic variable relationships, which combines adaptive graph learning methods with graph attention to generate a global-local graph that can represent both global correlations and dynamic local correlations among sensors.
Large Language Models for Graphs: Progresses and Directions
Chao Huang,Xubin Ren,Jiabin Tang,Dawei Yin,Nitesh V. Chawla +4 more
- 13 May 2024
TL;DR: Large language models for graphs: Recent progress and future directions. LLMs enable high-quality graph representation learning by addressing challenges such as noise and sparse labels, and feature construction variations.
2
K-neighboring on Multi-weighted Graphs for Passenger Count Prediction on Railway Networks
Hangli Ge,Takashi Michikata,Noboru Koshizuka +2 more
GraphGPT: Graph Instruction Tuning for Large Language Models
Jiabin Tang,Yuhao Yang,Wei Wei,Lei Shi,Lixin Su,Su-hua Cheng,Dawei Yin,Chao Huang +7 more
TL;DR: The GraphGPT framework is presented, which combines a text-graph grounding component to establish a connection between textual information and graph structures, and a dual-stage instruction tuning paradigm, accompanied by a lightweight graph-text alignment projector, that explores self-supervised graph structural signals and task-specific graph instructions.
References
•Journal Article
Visualizing Data using t-SNE
TL;DR: A new technique called t-SNE that visualizes high-dimensional data by giving each datapoint a location in a two or three-dimensional map, a variation of Stochastic Neighbor Embedding that is much easier to optimize, and produces significantly better visualizations by reducing the tendency to crowd points together in the center of the map.
•Posted Content
Semi-Supervised Classification with Graph Convolutional Networks
Thomas Kipf,Max Welling +1 more
TL;DR: A scalable approach for semi-supervised learning on graph-structured data that is based on an efficient variant of convolutional neural networks which operate directly on graphs which outperforms related methods by a significant margin.
22.7K
•Proceedings Article
The PageRank Citation Ranking : Bringing Order to the Web
Lawrence Page,Sergey Brin,Rajeev Motwani,Terry Winograd +3 more
- 11 Nov 1999
TL;DR: This paper describes PageRank, a mathod for rating Web pages objectively and mechanically, effectively measuring the human interest and attention devoted to them, and shows how to efficiently compute PageRank for large numbers of pages.
16.4K
•Posted Content
Inductive Representation Learning on Large Graphs
TL;DR: GraphSAGE is presented, a general, inductive framework that leverages node feature information (e.g., text attributes) to efficiently generate node embeddings for previously unseen data and outperforms strong baselines on three inductive node-classification benchmarks.
11.9K
•Proceedings Article
Proceedings of the 28th International Conference on Machine Learning, ICML 2011
Darío García-García,Ulrike von Luxburg,Raul Santos-Rodriguez +2 more
- 07 Oct 2011
4.9K