skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Forecasting influenza-like illness dynamics for military populations using neural networks and social media

Journal Article · · PLoS ONE

This work is the first to take advantage of recurrent neural networks to predict influenza-like-illness (ILI) dynamics from various linguistic signals extracted from social media data. Unlike other approaches that rely on timeseries analysis of historical ILI data [1, 2] and the state-of-the-art machine learning models [3, 4], we build and evaluate the predictive power of Long Short Term Memory (LSTMs) architectures capable of nowcasting (predicting in \real-time") and forecasting (predicting the future) ILI dynamics in the 2011 { 2014 influenza seasons. To build our models we integrate information people post in social media e.g., topics, stylistic and syntactic patterns, emotions and opinions, and communication behavior. We then quantitatively evaluate the predictive power of different social media signals and contrast the performance of the-state-of-the-art regression models with neural networks. Finally, we combine ILI and social media signals to build joint neural network models for ILI dynamics prediction. Unlike the majority of the existing work, we specifically focus on developing models for local rather than national ILI surveillance [1], specifically for military rather than general populations [3] in 26 U.S. and six international locations. Our approach demonstrates several advantages: (a) Neural network models learned from social media data yield the best performance compared to previously used regression models. (b) Previously under-explored language and communication behavior features are more predictive of ILI dynamics than syntactic and stylistic signals expressed in social media. (c) Neural network models learned exclusively from social media signals yield comparable or better performance to the models learned from ILI historical data, thus, signals from social media can be potentially used to accurately forecast ILI dynamics for the regions where ILI historical data is not available. (d) Neural network models learned from combined ILI and social media signals significantly outperform models that rely solely on ILI historical data, which adds to a great potential of alternative public sources for ILI dynamics prediction. (e) Location-specific models outperform previously used location-independent models e.g., U.S. only. (f) Prediction results significantly vary across geolocations depending on the amount of social media data available and ILI activity patterns.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
Grant/Contract Number:
AC05-76RL01830
OSTI ID:
1426366
Report Number(s):
PNNL-SA-124148; 453040142
Journal Information:
PLoS ONE, Vol. 12, Issue 12; ISSN 1932-6203
Publisher:
Public Library of ScienceCopyright Statement
Country of Publication:
United States
Language:
English
Citation Metrics:
Cited by: 43 works
Citation information provided by
Web of Science

References (23)

National and Local Influenza Surveillance through Twitter: An Analysis of the 2012-2013 Influenza Epidemic journal December 2013
Global Disease Monitoring and Forecasting with Wikipedia journal November 2014
Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance journal October 2015
Text and Structural Data Mining of Influenza Mentions in Web and Social Media journal February 2010
Wikipedia Usage Estimates Prevalence of Influenza-Like Illness in the United States in Near Real-Time journal April 2014
Multiple Estimates of Transmissibility for the 2009 Influenza Pandemic Based on Influenza-like-Illness Data from Small US Military Populations journal May 2013
Long Short-Term Memory journal November 1997
Social and News Media Enable Estimation of Epidemiological Patterns Early in the 2010 Haitian Cholera Outbreak journal January 2012
Guess Who’s Not Coming to Dinner? Evaluating Online Restaurant Reservations for Disease Surveillance journal January 2014
Uncovering the relationships between military community health and affects expressed in social media journal June 2017
Using Social Media for Actionable Disease Surveillance and Outbreak Management: A Systematic Literature Review journal October 2015
Monitoring the Impact of Influenza by Age: Emergency Department Fever and Respiratory Complaint Surveillance in New York City journal August 2007
Social Media and Internet-Based Data in Global Systems for Public Health Surveillance: A Systematic Review: Social Media and Internet-Based Data for Public Health Surveillance journal March 2014
Enhancing disease surveillance with novel data streams: challenges and opportunities journal October 2015
Account Deletion Prediction on RuNet: A Case Study of Suspicious Twitter Accounts Active During the Russian-Ukrainian Crisis conference January 2016
Tracking Twitter for epidemic intelligence: case study: EHEC/HUS outbreak in Germany, 2011 conference January 2012
Forecasting ILI Twitter Data dataset January 2017
Combining Search, Social Media, and Traditional Data Sources to Improve Influenza Surveillance text January 2015
Global Disease Monitoring and Forecasting with Wikipedia journal March 2016
Ebola — A Growing Threat? journal October 2014
HealthMap: Global Infectious Disease Monitoring through Automated Classification and Visualization of Internet Media Reports journal March 2008
Twitter Improves Influenza Forecasting journal January 2014
Forecasting ILI Twitter Data dataset January 2017

Cited By (10)

Development and evaluation of a deep learning approach for modeling seasonality and trends in hand-foot-mouth disease incidence in mainland China journal May 2019
Even a good influenza forecasting model can benefit from internet-based nowcasts, but those benefits are limited journal February 2019
Deep Learning Intervention for Health Care Challenges: Some Biomedical Domain Considerations journal January 2019
Predicting Infectious Disease Using Deep Learning and Big Data journal July 2018
Improving probabilistic infectious disease forecasting through coherence journal January 2021
Computational Forecasting Methodology for Acute Respiratory Infectious Disease Dynamics
  • Gónzalez-Bandala, Daniel Alejandro; Cuevas-Tello, Juan Carlos; Noyola, Daniel E.
  • International Journal of Environmental Research and Public Health, Vol. 17, Issue 12 https://doi.org/10.3390/ijerph17124540
journal June 2020
Predicting the Flu from Instagram preprint January 2018
Multiple Document Representations from News Alerts for Automated Bio-surveillance Event Detection preprint January 2019
TDEFSI: Theory Guided Deep Learning Based Epidemic Forecasting with Synthetic Information preprint January 2020
Incorporating Expert Guidance in Epidemic Forecasting preprint January 2021