Streamflow simulation in data-scarce basins using Bayesian and physics-informed machine learning models
Journal Article
·
· Journal of Hydrometeorology
- Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)
Hydrologic predictions at rural watersheds are important but also challenging due to data shortage. Long short-term memory (LSTM) networks are a promising machine learning approach and have demonstrated good performance in streamflow predictions. However, due to its data-hungry nature, most LSTM applications focus on well-monitored catchments with abundant and high-quality observations. In this work, we investigate predictive capabilities of LSTM in poorly monitored watersheds with short observation records. To address three main challenges of LSTM applications in data-scarce locations, i.e., overfitting, uncertainty quantification (UQ), and out-of-distribution prediction, we evaluate different regularization techniques to prevent overfitting, apply a Bayesian LSTM for UQ, and introduce a physics-informed hybrid LSTM to enhance out-of-distribution prediction. Through case studies in two diverse sets of catchments with and without snow influence, we demonstrate that 1) when hydrologic variability in the prediction period is similar to the calibration period, LSTM models can reasonably predict daily streamflow with Nash–Sutcliffe efficiency above 0.8, even with only 2 years of calibration data; 2) when the hydrologic variability in the prediction and calibration periods is dramatically different, LSTM alone does not predict well, but the hybrid model can improve the out-of-distribution prediction with acceptable generalization accuracy; 3) L2 norm penalty and dropout can mitigate overfitting, and Bayesian and hybrid LSTM have no overfitting; and 4) Bayesian LSTM provides useful uncertainty information to improve prediction understanding and credibility. In conclusion, these insights have vital implications for streamflow simulation in watersheds where data quality and availability are a critical issue.
- Research Organization:
- Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Biological and Environmental Research (BER)
- Grant/Contract Number:
- AC05-00OR22725
- OSTI ID:
- 1808403
- Journal Information:
- Journal of Hydrometeorology, Journal Name: Journal of Hydrometeorology Journal Issue: 6 Vol. 22; ISSN 1525-755X
- Publisher:
- American Meteorological SocietyCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Daymet: Daily Surface Weather Data on a 1-km Grid for North America, Version 2
|
collection | January 2014 |
Similar Records
Machine learning assisted hybrid models can improve streamflow simulation in diverse catchments across the conterminous US
Uncertainty quantification of machine learning models to improve streamflow prediction under changing climate and environmental conditions
Novel Deep Learning Transformer Model for Short to Sub‐Seasonal Streamflow Forecast
Journal Article
·
Thu Jul 23 20:00:00 EDT 2020
· Environmental Research Letters
·
OSTI ID:1651326
Uncertainty quantification of machine learning models to improve streamflow prediction under changing climate and environmental conditions
Journal Article
·
Thu Apr 20 20:00:00 EDT 2023
· Frontiers in Water
·
OSTI ID:1972559
Novel Deep Learning Transformer Model for Short to Sub‐Seasonal Streamflow Forecast
Journal Article
·
Wed Jul 23 20:00:00 EDT 2025
· Geophysical Research Letters
·
OSTI ID:3002326