A framework for evaluating epidemic forecasts

doi:10.1186/S12879-017-2365-1

Open AccessJournal Article10.1186/S12879-017-2365-1

A framework for evaluating epidemic forecasts

Farzaneh Sadat Tabataba, +9 more

- 15 May 2017

- BMC Infectious Diseases

- Vol. 17, Iss: 1, pp 345-345

62

TL;DR: This paper presents an evaluation framework which allows for combining different features, error measures, and ranking schema to evaluate forecasts, and demonstrates the utility of the framework by evaluating six forecasting methods for predicting influenza in the United States.

Abstract: Over the past few decades, numerous forecasting methods have been proposed in the field of epidemic forecasting. Such methods can be classified into different categories such as deterministic vs. probabilistic, comparative methods vs. generative methods, and so on. In some of the more popular comparative methods, researchers compare observed epidemiological data from the early stages of an outbreak with the output of proposed models to forecast the future trend and prevalence of the pandemic. A significant problem in this area is the lack of standard well-defined evaluation measures to select the best algorithm among different ones, as well as for selecting the best possible configuration for a particular algorithm. In this paper we present an evaluation framework which allows for combining different features, error measures, and ranking schema to evaluate forecasts. We describe the various epidemic features (Epi-features) included to characterize the output of forecasting methods and provide suitable error measures that could be used to evaluate the accuracy of the methods with respect to these Epi-features. We focus on long-term predictions rather than short-term forecasting and demonstrate the utility of the framework by evaluating six forecasting methods for predicting influenza in the United States. Our results demonstrate that different error measures lead to different rankings even for a single Epi-feature. Further, our experimental analyses show that no single method dominates the rest in predicting all Epi-features when evaluated across error measures. As an alternative, we provide various Consensus Ranking schema that summarize individual rankings, thus accounting for different error measures. Since each Epi-feature presents a different aspect of the epidemic, multiple methods need to be combined to provide a comprehensive forecast. Thus we call for a more nuanced approach while evaluating epidemic forecasts and we believe that a comprehensive evaluation framework, as presented in this paper, will add value to the computational epidemiology community.

Chat with Paper

AI Agents for this Paper

Find similar papers on Google Scholar, PubMed and Arxiv
Write a critical review of this paper
Analyze citations of this paper to find unaddressed research gaps

Citations

•Journal Article•10.28945/4184

A New Typology Design of Performance Metrics to Measure Errors in Machine Learning Regression Algorithms

Alexei Botchkarev

- 24 Jan 2019

- Interdisciplinary Journal of Information...

TL;DR: In this article, the authors proposed a new typology of performance metrics, based on the analysis of the structure and properties of various performance metrics and proposed a framework of metrics which includes four (4) categories: primary metrics, extended metrics, composite metrics, and hybrid sets of metrics.

...read moreread less

592

•Journal Article•10.1007/S41745-020-00200-6

Mathematical Models for COVID-19 Pandemic: A Comparative Analysis.

Aniruddha Adiga, +5 more

- 30 Oct 2020

- Journal of the Indian Institute of Scien...

TL;DR: This article reviews some of the important mathematical models used to support the ongoing planning and response efforts in the COVID-19 pandemic and discusses their use, their mathematical form and their scope.

...read moreread less

186

•Journal Article•10.1016/J.EPIDEM.2021.100501

Rational evaluation of various epidemic models based on the COVID-19 data of China.

Wuyue Yang, +4 more

- 25 Sep 2021

- Epidemics

TL;DR: In this paper, a rational evaluation of various epidemic models/methods, including seven empirical functions, four statistical inference methods and five dynamical models, on their forecasting abilities is carried out.

...read moreread less

83

•Posted Content•10.1101/2020.03.12.20034595

Rational evaluation of various epidemic models based on the COVID-19 data of China

Wuyue Yang, +4 more

- 16 Mar 2020

- medRxiv

TL;DR: This paper makes a systematical investigation on the forecast ability of eight widely used empirical functions, four statistical inference methods and five dynamical models widely used in the literature, and introduces the Akaike information criterion, root mean square errors and robustness index to quantify these three golden means and to evaluate various epidemic models/methods.

...read moreread less

75

•Journal Article•10.1016/j.ijforecast.2020.11.010

COVID-19: Forecasting confirmed cases and deaths with a simple time series model

None Sebastian

- 01 Apr 2022

- International Journal of Forecasting

TL;DR: In this paper , a statistical, time series approach is proposed to model and predict the short-term behavior of COVID-19 outbreaks, which assumes a multiplicative trend, aiming to capture the continuation of the two variables (global confirmed cases and deaths) as well as their uncertainty.

...read moreread less

73

...

Expand

References

Comprehensive Survey on Distance/Similarity Measures between Probability Density Functions

Sung-Hyuk Cha

- 01 Jan 2007

TL;DR: Various distance/similarity measures that are applicable to compare two probability density functions, pdf in short, are reviewed and categorized in both syntactic and semantic relationships to reveal similarities among numerous distance/Similarity measures.

...read moreread less

1.9K

Journal Article•10.1016/S0169-2070(00)00057-1

The M3-Competition: results, conclusions and implications

Spyros Makridakis, +1 more

- 01 Oct 2000

- International Journal of Forecasting

TL;DR: In this paper, the M3-Competition, the latest edition of the M-Competitions, is described and its results and conclusions are compared with those of the previous two M-competitions as well as with other major empirical studies.

...read moreread less

1.7K

•Book

Encyclopedia of Distances

Michel Deza, +1 more

- 06 Jun 2009

TL;DR: This book begins with several metrics in classical geometry, then proceeds to applications of distance in fields like algebra and probability, eventually working through applied mathematics, computer science, physics and chemistry, social science, and even art and religion.

...read moreread less

1.7K

•Journal Article•10.1016/0169-2070(92)90008-W

Error measures for generalizing about forecasting methods: Empirical comparisons

J. Scott Armstrong, +1 more

- 01 Jun 1992

- International Journal of Forecasting

TL;DR: In this article, the authors evaluated measures for making comparisons of errors across time series and found that the median absolute error of a given method to that from the random walk forecast is not reliable, and therefore inappropriate for comparing accuracy across series.

...read moreread less

1.4K

•Book

Statistical inference based on divergence measures

Leandro Pardo

- 01 Jan 2006

TL;DR: This book discusses Phi-divergence Test Statistics under Sparseness Assumptions, as well as Independence Symmetry Marginal Homogeneity Quasi-symmetry Homogeneity, and more.

...read moreread less

719