DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Big Data Analytics for Long-Term Meteorological Observations at Hanford Site

Journal Article · · Atmosphere (Basel)

A growing number of physical objects with embedded sensors with typically high volume and frequently updated data sets has accentuated the need to develop methodologies to extract useful information from big data for supporting decision making. This study applies a suite of data analytics and core principles of data science to characterize near real-time meteorological data with a focus on extreme weather events. To highlight the applicability of this work and make it more accessible from a risk management perspective, a foundation for a software platform with an intuitive Graphical User Interface (GUI) was developed to access and analyze data from a decommissioned nuclear production complex operated by the U.S. Department of Energy (DOE, Richland, USA). Exploratory data analysis (EDA), involving classical non-parametric statistics, and machine learning (ML) techniques, were used to develop statistical summaries and learn characteristic features of key weather patterns and signatures. The new approach and GUI provide key insights into using big data and ML to assist site operation related to safety management strategies for extreme weather events. Specifically, this work offers a practical guide to analyzing long-term meteorological data and highlights the integration of ML and classical statistics to applied risk and decision science.

Sponsoring Organization:
USDOE
OSTI ID:
1840080
Journal Information:
Atmosphere (Basel), Journal Name: Atmosphere (Basel) Journal Issue: 1 Vol. 13; ISSN 2073-4433; ISSN ATMOCZ
Publisher:
MDPI AGCopyright Statement
Country of Publication:
Switzerland
Language:
English

References (44)

Regional Extreme Precipitation Events: Robust Inference From Credibly Simulated GCM Variables journal June 2018
Trend analysis in Turkish precipitation data journal January 2006
Trends of precipitation in Beijiang River Basin, Guangdong Province, China journal January 2008
Recent changes in rainfall and rainy days in Ethiopia journal June 2004
Modelling nonlinear trend for developing non-stationary rainfall intensity-duration-frequency curve: MODELLING NONLINEAR TREND FOR DEVELOPING NON-STATIONARY IDF CURVE journal May 2016
Diurnal pressure variation: the atmospheric tide journal October 2011
An empirical comparison of selection measures for decision-tree induction journal March 1989
Extreme events in a changing climate: Variability is more important than averages journal July 1992
Simulation of tropical cyclone impacts to the U.S. power system under climate change scenarios journal October 2014
Attributing high-impact extreme events across timescales—a case study of four different types of events journal August 2018
Rainfall and river flow trends using Mann–Kendall and Sen’s slope estimator statistical tests in the Cobres River basin journal February 2015
Exploratory data analysis: A comparison of statistical methods with artificial neural networks journal April 1992
Stochastic gradient boosting journal February 2002
A research progress review on regional extreme events journal September 2018
Spatial and temporal trends of mean and extreme rainfall and temperature for the 33 urban centers of the arid and semi-arid state of Rajasthan, India journal March 2014
An assessment of the effectiveness of a random forest classifier for land-cover classification journal January 2012
Comparing methods for estimating flow duration curves at ungauged sites journal April 2012
The importance of weather variations in a quantitative risk analysis journal November 2009
A statistical analysis of causes and consequences of the release of hazardous materials from pipelines. The influence of layout journal November 2018
Random forest as a potential multivariate method for near-infrared (NIR) spectroscopic analysis of complex mixture samples: Gasoline and naphtha journal September 2013
A random forest partition model for predicting NO2 concentrations from traffic flow and meteorological conditions journal February 2019
A gradient boosting method to improve travel time prediction journal September 2015
Random Forests journal January 2001
A Survey of Outlier Detection Methodologies journal October 2004
Using Machine Learning to Parameterize Moist Convection: Potential for Modeling of Climate, Climate Change, and Extreme Events journal October 2018
Techniques of trend analysis for monthly water quality data journal February 1982
Extreme Rainfall Nonstationarity Investigation and Intensity–Frequency–Duration Relationship journal June 2014
Random forest classifier for remote sensing classification journal January 2005
Estimates of the Regression Coefficient Based on Kendall's Tau journal December 1968
Progress in Outlier Detection Techniques: A Survey journal January 2019
An Adaptive Outlier Detection and Processing Approach Towards Time Series Sensor Data journal January 2019
Predictive Risk Analytics for Weather-Resilient Operation of Electric Power Systems journal January 2019
On the Probability of Extreme Rainfall Events journal October 1973
Classifications of Atmospheric Circulation Patterns journal December 2008
machine. journal October 2001
Synergies between urban heat island and heat waves in Seoul: The role of wind speed and land use characteristics journal December 2020
edarf: Exploratory Data Analysis using Random Forests journal October 2016
Nonparametric Tests Against Trend journal July 1945
Rank Correlation Methods. journal June 1957
Null Hypothesis Testing: Problems, Prevalence, and an Alternative journal October 2000
Machine Learning Analysis of Hydrologic Exchange Flows and Transit Time Distributions in a Large Regulated River journal April 2021
Comparison between Random Forests, Artificial Neural Networks and Gradient Boosted Machines Methods of On-Line Vis-NIR Spectroscopy Measurements of Soil Total Nitrogen and Total Carbon journal October 2017
Impacts of Spatial Heterogeneity and Temporal Non-Stationarity on Intensity-Duration-Frequency Estimates—A Case Study in a Mountainous California-Nevada Watershed journal June 2019
Regionalization of patterns of flow intermittence from gauging station records journal January 2013