A Dirichlet process model for classifying and forecasting epidemic curves

dc.contributor.authorNsoesie, Elaine O.en
dc.contributor.authorLeman, Scotland C.en
dc.contributor.authorMarathe, M. V.en
dc.date.accessioned2017-02-19T01:32:51Zen
dc.date.available2017-02-19T01:32:51Zen
dc.date.issued2014-01-09en
dc.description.abstractBackground: A forecast can be defined as an endeavor to quantitatively estimate a future event or probabilities assigned to a future occurrence. Forecasting stochastic processes such as epidemics is challenging since there are several biological, behavioral, and environmental factors that influence the number of cases observed at each point during an epidemic. However, accurate forecasts of epidemics would impact timely and effective implementation of public health interventions. In this study, we introduce a Dirichlet process (DP) model for classifying and forecasting influenza epidemic curves. Methods: The DP model is a nonparametric Bayesian approach that enables the matching of current influenza activity to simulated and historical patterns, identifies epidemic curves different from those observed in the past and enables prediction of the expected epidemic peak time. The method was validated using simulated influenza epidemics from an individual-based model and the accuracy was compared to that of the tree-based classification technique, Random Forest (RF), which has been shown to achieve high accuracy in the early prediction of epidemic curves using a classification approach. We also applied the method to forecasting influenza outbreaks in the United States from 1997–2013 using influenza-like illness (ILI) data from the Centers for Disease Control and Prevention (CDC). Results: We made the following observations. First, the DP model performed as well as RF in identifying several of the simulated epidemics. Second, the DP model correctly forecasted the peak time several days in advance for most of the simulated epidemics. Third, the accuracy of identifying epidemics different from those already observed improved with additional data, as expected. Fourth, both methods correctly classified epidemics with higher reproduction numbers (R) with a higher accuracy compared to epidemics with lower R values. Lastly, in the classification of seasonal influenza epidemics based on ILI data from the CDC, the methods’ performance was comparable. Conclusions: Although RF requires less computational time compared to the DP model, the algorithm is fully supervised implying that epidemic curves different from those previously observed will always be misclassified. In contrast, the DP model can be unsupervised, semi-supervised or fully supervised. Since both methods have their relative merits, an approach that uses both RF and the DP model could be beneficial.en
dc.description.versionPublished versionen
dc.format.extent12 pagesen
dc.format.mimetypeapplication/pdfen
dc.identifier.doihttps://doi.org/10.1186/1471-2334-14-12en
dc.identifier.issn1471-2334en
dc.identifier.urihttp://hdl.handle.net/10919/75063en
dc.identifier.volume14en
dc.language.isoenen
dc.publisherBiomed Centralen
dc.relation.urihttp://gateway.webofknowledge.com/gateway/Gateway.cgi?GWVersion=2&SrcApp=PARTNER_APP&SrcAuth=LinksAMR&KeyUT=WOS:000331200700001&DestLinkType=FullRecord&DestApp=ALL_WOS&UsrCustomerID=930d57c9ac61a043676db62af60056c1en
dc.rightsCreative Commons Attribution 2.0 Genericen
dc.rights.urihttp://creativecommons.org/licenses/by/2.0/en
dc.subjectInfectious Diseasesen
dc.subjectDirichlet process modelen
dc.subjectInfluenza epidemicen
dc.subjectSimulationsen
dc.subjectIndividual-based modelen
dc.subjectEpidemic forecastingen
dc.subjectINFLUENZAen
dc.subjectPOPULATIONSen
dc.subjectLINEen
dc.titleA Dirichlet process model for classifying and forecasting epidemic curvesen
dc.title.serialBMC Infectious Diseasesen
dc.typeArticle - Refereeden
dc.type.dcmitypeTexten
pubs.organisational-group/Virginia Techen
pubs.organisational-group/Virginia Tech/All T&R Facultyen
pubs.organisational-group/Virginia Tech/Scienceen
pubs.organisational-group/Virginia Tech/Science/COS T&R Facultyen
pubs.organisational-group/Virginia Tech/Science/Statisticsen
pubs.organisational-group/Virginia Tech/University Research Institutesen
pubs.organisational-group/Virginia Tech/University Research Institutes/Biocomplexity Instituteen
pubs.organisational-group/Virginia Tech/University Research Institutes/Biocomplexity Institute/Researchersen
pubs.organisational-group/Virginia Tech/University Research Institutes/Biocomplexity Institute/SelectedFaculty1en

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
A Dirichlet process model for classifying and forecasting epidemic curves.pdf
Size:
1.69 MB
Format:
Adobe Portable Document Format
Description:
Accepted Version
License bundle
Now showing 1 - 1 of 1
Name:
VTUL_Distribution_License_2016_05_09.pdf
Size:
18.09 KB
Format:
Adobe Portable Document Format
Description: