Journal Article10.1038/s41467-024-45598-0
Sequential stacking link prediction algorithms for temporal networks
Xie He,Amirhossein Ghasemian,Eun Lee,Aaron Clauset,Peter J. Mucha +4 more
5
TL;DR: This sequential stacking link prediction method uses 41 static network features that avoid detailed feature engineering choices and is capable of learning a highly accurate predictive distribution of future connections from historical data.
read more
Abstract: Link prediction algorithms are indispensable tools in many scientific applications by speeding up network data collection and imputing missing connections. However, in many systems, links change over time and it remains unclear how to optimally exploit such temporal information for link predictions in such networks. Here, we show that many temporal topological features, in addition to having high computational cost, are less accurate in temporal link prediction than sequentially stacked static network features. This sequential stacking link prediction method uses 41 static network features that avoid detailed feature engineering choices and is capable of learning a highly accurate predictive distribution of future connections from historical data. We demonstrate that this algorithm works well for both partially observed and completely unobserved target layers, and on two temporal stochastic block models achieves near-oracle-level performance when combined with other single predictor methods as an ensemble learning method. Finally, we empirically illustrate that stacking multiple predictive methods together further improves performance on 19 real-world temporal networks from different domains.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
Link prediction accuracy on real-world networks under non-uniform missing-edge patterns
Xie He,Amir Ghasemian,Eun Lee,Alice C. Schwarze,Aaron Clauset,Peter J. Mucha +5 more
TL;DR: This study investigates the impact of non-uniform missing-edge patterns on link prediction accuracy in 250 real-world networks from 6 domains, comparing 9 algorithms across 20 missing-edge patterns to guide future researchers in selecting suitable link prediction methods.
1
Cooperation dynamics in multilayer networks under recommendation and vigilance mechanisms
Shike Yang,Xincheng Hu,Haobin Shi,Qiang Chen,Fei Ma +4 more
A new algorithm for identifying influential nodes in multiplex networks
Ru Zheng,Yaoqi GUO,Yamir Moreno +2 more
Link Prediction Accuracy on Real-World Networks Under Non-Uniform Missing Edge Patterns
Xie He,Amirhossein Ghasemian,Eun Lee,Alice Schwarze,Aaron Clauset,Peter J. Mucha +5 more
TL;DR: This study illustrates that different prediction algorithms exhibit significant differences in accuracy contingent upon both the dataset domain and the nature of the missingness pattern, and provides guidance for selecting appropriate prediction algorithms when encountering diverse patterns of missing data across various domains.
Evaluating the Performance of a Stacking-Based Ensemble Model for Daily Temperature Prediction
Qiwei Xu,An-Qi Guo,Wangzhi Yu,Chenfei He +3 more
TL;DR: This study develops a stacking-based ensemble model for daily temperature prediction in Algiers, outperforming traditional methods and base models, and demonstrates the potential of stacking-based ensemble learning in accurately predicting daily temperatures.
References
Random Forests
Leo Breiman
- 01 Oct 2001
TL;DR: Internal estimates monitor error, strength, and correlation and these are used to show the response to increasing the number of features used in the forest, and are also applicable to regression.
The meaning and use of the area under a receiver operating characteristic (ROC) curve.
TL;DR: A representation and interpretation of the area under a receiver operating characteristic (ROC) curve obtained by the "rating" method, or by mathematical predictions based on patient characteristics, is presented and it is shown that in such a setting the area represents the probability that a randomly chosen diseased subject is (correctly) rated or ranked with greater suspicion than a random chosen non-diseased subject.
21.8K
An introduction to ROC analysis
TL;DR: The purpose of this article is to serve as an introduction to ROC graphs and as a guide for using them in research.
21.3K
Ror2 signaling regulates Golgi structure and transport through IFT20 for tumor invasiveness
Michiru Nishita,Seung-Yeol Park,Tadashi Nishio,Koki Kamizaki,ZhiChao Wang,Kota Tamada,Toru Takumi,Ryuju Hashimoto,Hiroki Otani,Gregory J. Pazour,Victor W. Hsu,Yasuhiro Minami +11 more
TL;DR: It is found that intraflagellar transport 20 mediates the ability of Ror2 signaling to induce the invasiveness of tumors that lack primary cilia, and IFT20 regulates the nucleation of Golgi-derived microtubules by affecting the GM130-AKAP450 complex.
No free lunch theorems for optimization
TL;DR: A framework is developed to explore the connection between effective optimization algorithms and the problems they are solving and a number of "no free lunch" (NFL) theorems are presented which establish that for any algorithm, any elevated performance over one class of problems is offset by performance over another class.