A framework for evaluating epidemic forecasts
Farzaneh Sadat Tabataba,Farzaneh Sadat Tabataba,Prithwish Chakraborty,Naren Ramakrishnan,Naren Ramakrishnan,Srinivasan Venkatramanan,Jiangzhuo Chen,Bryan Lewis,Madhav V. Marathe,Madhav V. Marathe +9 more
TL;DR: This paper presents an evaluation framework which allows for combining different features, error measures, and ranking schema to evaluate forecasts, and demonstrates the utility of the framework by evaluating six forecasting methods for predicting influenza in the United States.
read more
Abstract: Over the past few decades, numerous forecasting methods have been proposed in the field of epidemic forecasting. Such methods can be classified into different categories such as deterministic vs. probabilistic, comparative methods vs. generative methods, and so on. In some of the more popular comparative methods, researchers compare observed epidemiological data from the early stages of an outbreak with the output of proposed models to forecast the future trend and prevalence of the pandemic. A significant problem in this area is the lack of standard well-defined evaluation measures to select the best algorithm among different ones, as well as for selecting the best possible configuration for a particular algorithm. In this paper we present an evaluation framework which allows for combining different features, error measures, and ranking schema to evaluate forecasts. We describe the various epidemic features (Epi-features) included to characterize the output of forecasting methods and provide suitable error measures that could be used to evaluate the accuracy of the methods with respect to these Epi-features. We focus on long-term predictions rather than short-term forecasting and demonstrate the utility of the framework by evaluating six forecasting methods for predicting influenza in the United States. Our results demonstrate that different error measures lead to different rankings even for a single Epi-feature. Further, our experimental analyses show that no single method dominates the rest in predicting all Epi-features when evaluated across error measures. As an alternative, we provide various Consensus Ranking schema that summarize individual rankings, thus accounting for different error measures. Since each Epi-feature presents a different aspect of the epidemic, multiple methods need to be combined to provide a comprehensive forecast. Thus we call for a more nuanced approach while evaluating epidemic forecasts and we believe that a comprehensive evaluation framework, as presented in this paper, will add value to the computational epidemiology community.
read more
Chat with Paper
AI Agents for this Paper
Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps
Citations
A New Typology Design of Performance Metrics to Measure Errors in Machine Learning Regression Algorithms
TL;DR: In this article, the authors proposed a new typology of performance metrics, based on the analysis of the structure and properties of various performance metrics and proposed a framework of metrics which includes four (4) categories: primary metrics, extended metrics, composite metrics, and hybrid sets of metrics.
Mathematical Models for COVID-19 Pandemic: A Comparative Analysis.
Aniruddha Adiga,Devdatt Dubhashi,Bryan Lewis,Madhav V. Marathe,Srinivasan Venkatramanan,Anil Vullikanti +5 more
TL;DR: This article reviews some of the important mathematical models used to support the ongoing planning and response efforts in the COVID-19 pandemic and discusses their use, their mathematical form and their scope.
Rational evaluation of various epidemic models based on the COVID-19 data of China.
TL;DR: In this paper, a rational evaluation of various epidemic models/methods, including seven empirical functions, four statistical inference methods and five dynamical models, on their forecasting abilities is carried out.
83
Rational evaluation of various epidemic models based on the COVID-19 data of China
TL;DR: This paper makes a systematical investigation on the forecast ability of eight widely used empirical functions, four statistical inference methods and five dynamical models widely used in the literature, and introduces the Akaike information criterion, root mean square errors and robustness index to quantify these three golden means and to evaluate various epidemic models/methods.
COVID-19: Forecasting confirmed cases and deaths with a simple time series model
TL;DR: In this paper , a statistical, time series approach is proposed to model and predict the short-term behavior of COVID-19 outbreaks, which assumes a multiplicative trend, aiming to capture the continuation of the two variables (global confirmed cases and deaths) as well as their uncertainty.
73
References
Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions
Sung-Hyuk Cha
- 01 Jan 2007
TL;DR: Various distance/similarity measures that are applicable to compare two probability density functions, pdf in short, are reviewed and categorized in both syntactic and semantic relationships to reveal similarities among numerous distance/Similarity measures.
1.9K
The M3-Competition: results, conclusions and implications
Spyros Makridakis,Michèle Hibon +1 more
TL;DR: In this paper, the M3-Competition, the latest edition of the M-Competitions, is described and its results and conclusions are compared with those of the previous two M-competitions as well as with other major empirical studies.
1.7K
•Book
Encyclopedia of Distances
Michel Deza,Elena Deza +1 more
- 06 Jun 2009
TL;DR: This book begins with several metrics in classical geometry, then proceeds to applications of distance in fields like algebra and probability, eventually working through applied mathematics, computer science, physics and chemistry, social science, and even art and religion.
1.7K
Error measures for generalizing about forecasting methods: Empirical comparisons
J. Scott Armstrong,Fred Collopy +1 more
TL;DR: In this article, the authors evaluated measures for making comparisons of errors across time series and found that the median absolute error of a given method to that from the random walk forecast is not reliable, and therefore inappropriate for comparing accuracy across series.
1.4K
•Book
Statistical inference based on divergence measures
Leandro Pardo
- 01 Jan 2006
TL;DR: This book discusses Phi-divergence Test Statistics under Sparseness Assumptions, as well as Independence Symmetry Marginal Homogeneity Quasi-symmetry Homogeneity, and more.
719
Related Papers (5)
Neil M. Ferguson,Daniel J Laydon,G Nedjati Gilani,Natsuko Imai,Kylie E. C. Ainslie,Marc Baguelin,Sangeeta N. Bhatia,A Boonyasiri,Z Cucunuba Perez,Gina Cuomo-Dannenburg,Amy Dighe,Ilaria Dorigatti,Han Fu,Katy A. M. Gaythorpe,W Green,Arran Hamlet,Wes Hinsley,Lucy C Okell,S Van Elsland,H Thompson,Robert Verity,Erik M. Volz,Haowei Wang,Y Wang,Patrick G T Walker,Caroline E. Walters,Peter Winskill,Charles Whittaker,Christl A. Donnelly,Steven Riley,Azra C. Ghani +30 more
- 16 Mar 2020