Proceedings Article10.1109/ICDCS.2019.00128
A Latent Hawkes Process Model for Event Clustering and Temporal Dynamics Learning with Applications in GitHub
Shengzhong Liu,Shuochao Yao,Dongxin Liu,Huajie Shao,Yiran Zhao,Xinzhe Fu,Tarek Abdelzaher +6 more
- 07 Jul 2019
- pp 1275-1285
6
TL;DR: A user community based generative model, called latent Hawkes process (LHP), taking into account both-side information to illustrate the generation of such inter-dependent event streams on GitHub repositories, is developed, and the effectiveness of LHP is validated in extracting user community structures and learning their correlated temporal dynamics.
read more
Abstract: Large volumes of event data are becoming increasingly available on online social networks. These events are usually causally dependent to each other, reflecting the interactions and collaborations among different parties. Learning and interpreting the temporal patterns and dynamics within these event streams plays an important role in many practical applications, such as trend prediction and anomaly detection. Since causal dependencies can be reflected in both event time (i.e., when) and event content (i.e., who and what), we thus develop a user community based generative model, called latent Hawkes process (LHP), taking into account both-side information to illustrate the generation of such inter-dependent event streams on GitHub repositories, where each attribute is assumed to be generated by interplays between correlated latent communities. Through learning of our model, two functionalities are fulfilled concurrently: event clustering (i.e., community discovery) and temporal dependency learning among these clusters (i.e., dependency profiling). To do so, we design an EM-based framework integrating sequential Monte Carlo sampling to estimate model parameters in an end-to-end manner. Through experiments on practical GitHub event data, we validate the effectiveness of LHP in extracting user community structures and learning their correlated temporal dynamics. Such knowledge further enables us to gain new insights into the development status of software, such as the project persistence and anomaly detection.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
DyDiff-VAE: A Dynamic Variational Framework for Information Diffusion Prediction
Ruijie Wang,Zijie Huang,Shengzhong Liu,Huajie Shao,Dongxin Liu,Jinyang Li,Tianshi Wang,Dachun Sun,Shuochao Yao,Tarek Abdelzaher +9 more
- 11 Jul 2021
TL;DR: Zhang et al. as mentioned in this paper proposed a novel diffusion model, DyDiff-VAE, to estimate the propagation likelihood for other potential users and predict the corresponding user rankings, given the initial content and a sequence of forwarding users.
NesTPP: Modeling Thread Dynamics in Online Discussion Forums
Chen Ling,Guangmo Tong,Mozi Chen +2 more
- 13 Jul 2020
TL;DR: The proposed model views the entire event space as a nested structure composed of main thread streams and their linked reply streams, and it explicitly models the correlations between these two types of streams through their intensity functions.
5
•Posted Content
NesTPP: Modeling Thread Dynamics in Online Discussion Forums
Chen Ling,Guangmo Tong,Mozi Chen +2 more
TL;DR: In this paper, a novel temporal point process model is proposed to characterize information cascades in online discussion forums, which views the entire event space as a nested structure composed of main thread streams and their linked reply streams, and explicitly models correlations between these two types of streams through their intensity functions.
•Posted Content
SocialGrid: A TCN-enhanced Method for Online Discussion Forecasting.
TL;DR: This work proposed a novel yet simple framework, called SocialGrid, for modeling events in online discussing forms, that transforms the entire event space into a grid representation by grouping successive evens in one time interval of a particular length.
•Posted Content
DyDiff-VAE: A Dynamic Variational Framework for Information Diffusion Prediction
Ruijie Wang,Zijie Huang,Shengzhong Liu,Huajie Shao,Dongxin Liu,Jinyang Li,Tianshi Wang,Dachun Sun,Shuochao Yao,Tarek Abdelzaher +9 more
TL;DR: Zhang et al. as mentioned in this paper proposed a novel diffusion model, DyDiff-VAE, to estimate the propagation likelihood for other potential users and predict the corresponding user rankings, given the initial content and a sequence of forwarding users.
References
Modeling Interdependent and Periodic Real-World Action Sequences
TL;DR: In this paper, the authors develop a statistical model for time-varying, interdependent, and periodic action sequences, which is based on personalized, multivariate temporal point processes.
Modeling Interdependent and Periodic Real-World Action Sequences
Takeshi Kurashima,Tim Althoff,Jure Leskovec +2 more
- 23 Apr 2018
TL;DR: TIPAS improves predictions of actions, and their timing, over existing methods across multiple datasets by up to 156%, and up to 37%, respectively, and demonstrates that explicit modeling of dependencies and periodicities in real-world behavior enables successful predictions of future actions.
40
•Posted Content
Modeling the Dynamics of Online Learning Activity
TL;DR: A novel modeling framework for clustering continuous-time grouped streaming data, the hierarchical Dirichlet Hawkes process (HDHP), is introduced, which allows us to automatically uncover a wide variety of learning patterns from detailed traces of learning activity.
16
A Constrained Maximum Likelihood Estimator for Unguided Social Sensing
Huajie Shao,Shuochao Yao,Yiran Zhao,Chao Zhang,Jinda Han,Lance M. Kaplan,Lu Su,Tarek Abdelzaher +7 more
- 16 Apr 2018
TL;DR: This paper develops a constrained expectation maximization algorithm (CEM) that improves the accuracy of truth estimation in unguided social sensing applications by forcing the algorithm to converge to a solution in which the constraint is satisfied, hence significantly improving the quality of fact-finding results.
14
Predicting Learner Interactions in Social Learning Networks
Tsung-Yen Yang,Christopher G. Brinton,Carlee Joe-Wong +2 more
- 08 Oct 2018
TL;DR: A time-series prediction methodology that uses a recurrent neural network architecture to pass network state between time periods and finds that several input features have opposite directions of correlation between link formation and post quality, suggesting that response time and quality are two competing objectives to be accounted for in SLN link recommendation systems.
14