DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Soda Pop: A Time-Series Clustering, Alarming and Disease Forecasting Application

Abstract

To introduce Soda Pop, an R/Shiny application designed to be a disease agnostic time-series clustering, alarming, and forecasting tool to assist in disease surveillance “triage, analysis and reporting” workflows within the Biosurveillance Ecosystem (BSVE). In this poster, we highlight the new capabilities that are brought to the BSVE by Soda Pop with an emphasis on the impact of methodological decisions. The Biosurveillance Ecosystem (BSVE) is a biological and chemical threat surveillance system sponsored by the Defense Threat Reduction Agency (DTRA). BSVE is intended to be user-friendly, multi-agency, cooperative, modular and threat agnostic platform for biosurveillance. In BSVE, a web-based workbench presents the analyst with applications (apps) developed by various DTRA funded researchers, which are deployed on-demand in the cloud(e.g., Amazon Web Services). These apps aim to address emerging needs and refine capabilities to enable early warning of chemical and biological threats for multiple users across local, state, and federal agencies. Soda Pop is an app developed by Pacific Northwest National Laboratory (PNNL) to meet the current needs of the BSVE for early warning and detection of disease outbreaks. Aimed for use bya diverse set of analysts, the application is agnostic to data source and spatial scale enabling it to bemore » generalizable across many diseases and locations. To achieve this, we placed a particular emphasis on clustering and alerting of disease signals within Soda Pop without strong prior assumptions on the nature of observed diseased counts. Although designed to be agnostic to the data source, Soda Pop was initially developed and tested on data summarizing Influenza-Like Illness in military hospitals from collaboration with the Armed Forces Health Surveillance Branch. Currently, the data incorporated also includes the CDC’s National Notifiable Diseases Surveillance System(NNDSS) tables and the WHO’s Influenza A/B Influenza Data(Flunet). These data sources are now present in BSVE’s Postgres data storage for direct access. Soda Pop is designed to automate time-series tasks of data summarization, exploration, clustering, alarming and forecasting. Built as an R/Shiny application, Soda Pop is founded on the powerful statistical tool R.. Where applicable, Soda Pop facilitates nonparametric seasonal decomposition of time-series; hierarchical agglomerative clustering across reporting areas and between diseases within reporting areas; and a variety of alarming techniques including Exponential Weighted Moving Average alarms and Early Aberration Detection. Soda Pop embeds these techniques within a user-interface designed to enhance an analyst’s understanding of emerging trends in their data and enables the inclusion of its graphical elements into their dossier for further tracking and reporting. The ultimate goal of this software is to facilitate the discovery of unknown disease signals along with increasing the speed of detection of unusual patterns within these signals. Soda Pop organizes common statistical disease surveillance tasks in a manner integrated with BSVE data source inputs and outputs. The app analyzes time-series disease data and supports a robust set of clustering and alarming routines that avoid strong assumptions on the nature of observed disease counts. This attribute allows for flexibility in the data source, spatial scale, and disease types making it useful to a wide range of analysts.« less

Authors:
 [1];  [1];  [1]
  1. Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Publication Date:
Research Org.:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Org.:
USDOE Office of Science (SC); Defense Threat Reduction Agency (DTRA)
OSTI Identifier:
1629204
Grant/Contract Number:  
AC05-76RL01830; CB10082
Resource Type:
Accepted Manuscript
Journal Name:
Online Journal of Public Health Informatics
Additional Journal Information:
Journal Volume: 9; Journal Issue: 1; Journal ID: ISSN 1947-2579
Publisher:
University of Illinois at Chicago
Country of Publication:
United States
Language:
English
Subject:
97 MATHEMATICS AND COMPUTING; BSVE; biosurveillance; R/shiny; clustering; alarming

Citation Formats

Rounds, Jeremiah, Charles-Smith, Lauren, and Corley, Courtney D. Soda Pop: A Time-Series Clustering, Alarming and Disease Forecasting Application. United States: N. p., 2017. Web. doi:10.5210/ojphi.v9i1.7582.
Rounds, Jeremiah, Charles-Smith, Lauren, & Corley, Courtney D. Soda Pop: A Time-Series Clustering, Alarming and Disease Forecasting Application. United States. https://doi.org/10.5210/ojphi.v9i1.7582
Rounds, Jeremiah, Charles-Smith, Lauren, and Corley, Courtney D. Tue . "Soda Pop: A Time-Series Clustering, Alarming and Disease Forecasting Application". United States. https://doi.org/10.5210/ojphi.v9i1.7582. https://www.osti.gov/servlets/purl/1629204.
@article{osti_1629204,
title = {Soda Pop: A Time-Series Clustering, Alarming and Disease Forecasting Application},
author = {Rounds, Jeremiah and Charles-Smith, Lauren and Corley, Courtney D.},
abstractNote = {To introduce Soda Pop, an R/Shiny application designed to be a disease agnostic time-series clustering, alarming, and forecasting tool to assist in disease surveillance “triage, analysis and reporting” workflows within the Biosurveillance Ecosystem (BSVE). In this poster, we highlight the new capabilities that are brought to the BSVE by Soda Pop with an emphasis on the impact of methodological decisions. The Biosurveillance Ecosystem (BSVE) is a biological and chemical threat surveillance system sponsored by the Defense Threat Reduction Agency (DTRA). BSVE is intended to be user-friendly, multi-agency, cooperative, modular and threat agnostic platform for biosurveillance. In BSVE, a web-based workbench presents the analyst with applications (apps) developed by various DTRA funded researchers, which are deployed on-demand in the cloud(e.g., Amazon Web Services). These apps aim to address emerging needs and refine capabilities to enable early warning of chemical and biological threats for multiple users across local, state, and federal agencies. Soda Pop is an app developed by Pacific Northwest National Laboratory (PNNL) to meet the current needs of the BSVE for early warning and detection of disease outbreaks. Aimed for use bya diverse set of analysts, the application is agnostic to data source and spatial scale enabling it to be generalizable across many diseases and locations. To achieve this, we placed a particular emphasis on clustering and alerting of disease signals within Soda Pop without strong prior assumptions on the nature of observed diseased counts. Although designed to be agnostic to the data source, Soda Pop was initially developed and tested on data summarizing Influenza-Like Illness in military hospitals from collaboration with the Armed Forces Health Surveillance Branch. Currently, the data incorporated also includes the CDC’s National Notifiable Diseases Surveillance System(NNDSS) tables and the WHO’s Influenza A/B Influenza Data(Flunet). These data sources are now present in BSVE’s Postgres data storage for direct access. Soda Pop is designed to automate time-series tasks of data summarization, exploration, clustering, alarming and forecasting. Built as an R/Shiny application, Soda Pop is founded on the powerful statistical tool R.. Where applicable, Soda Pop facilitates nonparametric seasonal decomposition of time-series; hierarchical agglomerative clustering across reporting areas and between diseases within reporting areas; and a variety of alarming techniques including Exponential Weighted Moving Average alarms and Early Aberration Detection. Soda Pop embeds these techniques within a user-interface designed to enhance an analyst’s understanding of emerging trends in their data and enables the inclusion of its graphical elements into their dossier for further tracking and reporting. The ultimate goal of this software is to facilitate the discovery of unknown disease signals along with increasing the speed of detection of unusual patterns within these signals. Soda Pop organizes common statistical disease surveillance tasks in a manner integrated with BSVE data source inputs and outputs. The app analyzes time-series disease data and supports a robust set of clustering and alarming routines that avoid strong assumptions on the nature of observed disease counts. This attribute allows for flexibility in the data source, spatial scale, and disease types making it useful to a wide range of analysts.},
doi = {10.5210/ojphi.v9i1.7582},
journal = {Online Journal of Public Health Informatics},
number = 1,
volume = 9,
place = {United States},
year = {Tue May 02 00:00:00 EDT 2017},
month = {Tue May 02 00:00:00 EDT 2017}
}

Works referenced in this record:

Monitoring Count Time Series in R: Aberration Detection in Public Health Surveillance
text, January 2014


Works referencing / citing this record:

Period of Measurement in Time-Series Predictions of Disease Counts from 2007 to 2017 in Northern Nevada: Analytics Experiment
journal, January 2019

  • Talaei-Khoei, Amir; Wilson, James M.; Kazemi, Seyed-Farzan
  • JMIR Public Health and Surveillance, Vol. 5, Issue 1
  • DOI: 10.2196/11357