DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model

Abstract

Groundwater model structural error is ubiquitous, due to simplification and/or misrepresentation of real aquifer systems. During model calibration, the basic hydrogeological parameters may be adjusted to compensate for structural error. This may result in biased predictions when such calibrated models are used to forecast aquifer responses to new forcing. Here, we investigate the impact of model structural error on calibration and prediction of a real-world groundwater flow model, using a Bayesian method with a data-driven error model to explicitly account for model structural error. The error-explicit Bayesian method jointly infers model parameters and structural error and thereby reduces parameter compensation. In this study, Bayesian inference is facilitated using high performance computing and fast surrogate models (based on machine learning techniques) as a substitute for the computationally expensive groundwater model. We demonstrate that with explicit treatment of model structural error, the Bayesian method yields parameter posterior distributions that are substantially different from those derived using classical Bayesian calibration that does not account for model structural error. We also found that the error-explicit Bayesian method gives signficantly more accurate prediction along with reasonable credible intervals. Finally, through variance decomposition, we provide a comprehensive assessment of prediction uncertainty contributed from parameter, model structure,more » and measurement uncertainty. The results suggest that the error-explicit Bayesian approach provides a solution to real-world modeling applications for which data support the presence of model structural error, yet model deficiency cannot be specifically identified or corrected.« less

Authors:
ORCiD logo [1]; ORCiD logo [2]; ORCiD logo [3];  [4]
  1. Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Civil and Environmental Engineering; Michigan State Univ., East Lansing, MI (United States). Dept. of Earth and Environmental Sciences
  2. Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Civil and Environmental Engineering
  3. Florida State Univ., Tallahassee, FL (United States). Dept. of Scientific Computing
  4. Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Statistics
Publication Date:
Research Org.:
Florida State Univ., Tallahassee, FL (United States)
Sponsoring Org.:
USDOE Office of Science (SC)
OSTI Identifier:
1532997
Grant/Contract Number:  
SC0008272
Resource Type:
Accepted Manuscript
Journal Name:
Water Resources Research
Additional Journal Information:
Journal Volume: 53; Journal Issue: 5; Journal ID: ISSN 0043-1397
Publisher:
American Geophysical Union (AGU)
Country of Publication:
United States
Language:
English
Subject:
54 ENVIRONMENTAL SCIENCES; environmental sciences & ecology; marine & freshwater biology; water resources; Bayesian calibration; uncertainty decomposition; model structural error; surrogate modeling; groundwater

Citation Formats

Xu, Tianfang, Valocchi, Albert J., Ye, Ming, and Liang, Feng. Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model. United States: N. p., 2017. Web. doi:10.1002/2016wr019831.
Xu, Tianfang, Valocchi, Albert J., Ye, Ming, & Liang, Feng. Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model. United States. https://doi.org/10.1002/2016wr019831
Xu, Tianfang, Valocchi, Albert J., Ye, Ming, and Liang, Feng. Mon . "Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model". United States. https://doi.org/10.1002/2016wr019831. https://www.osti.gov/servlets/purl/1532997.
@article{osti_1532997,
title = {Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model},
author = {Xu, Tianfang and Valocchi, Albert J. and Ye, Ming and Liang, Feng},
abstractNote = {Groundwater model structural error is ubiquitous, due to simplification and/or misrepresentation of real aquifer systems. During model calibration, the basic hydrogeological parameters may be adjusted to compensate for structural error. This may result in biased predictions when such calibrated models are used to forecast aquifer responses to new forcing. Here, we investigate the impact of model structural error on calibration and prediction of a real-world groundwater flow model, using a Bayesian method with a data-driven error model to explicitly account for model structural error. The error-explicit Bayesian method jointly infers model parameters and structural error and thereby reduces parameter compensation. In this study, Bayesian inference is facilitated using high performance computing and fast surrogate models (based on machine learning techniques) as a substitute for the computationally expensive groundwater model. We demonstrate that with explicit treatment of model structural error, the Bayesian method yields parameter posterior distributions that are substantially different from those derived using classical Bayesian calibration that does not account for model structural error. We also found that the error-explicit Bayesian method gives signficantly more accurate prediction along with reasonable credible intervals. Finally, through variance decomposition, we provide a comprehensive assessment of prediction uncertainty contributed from parameter, model structure, and measurement uncertainty. The results suggest that the error-explicit Bayesian approach provides a solution to real-world modeling applications for which data support the presence of model structural error, yet model deficiency cannot be specifically identified or corrected.},
doi = {10.1002/2016wr019831},
journal = {Water Resources Research},
number = 5,
volume = 53,
place = {United States},
year = {Mon May 08 00:00:00 EDT 2017},
month = {Mon May 08 00:00:00 EDT 2017}
}

Journal Article:
Free Publicly Available Full Text
Publisher's Version of Record

Citation Metrics:
Cited by: 55 works
Citation information provided by
Web of Science

Save / Share:

Works referenced in this record:

A unified approach for process‐based hydrologic modeling: 1. Modeling concept
journal, April 2015

  • Clark, Martyn P.; Nijssen, Bart; Lundquist, Jessica D.
  • Water Resources Research, Vol. 51, Issue 4
  • DOI: 10.1002/2015WR017198

Maximum likelihood Bayesian averaging of spatial variability models in unsaturated fractured tuff: MAXIMUM LIKELIHOOD BAYESIAN MODEL AVERAGING
journal, May 2004

  • Ye, Ming; Neuman, Shlomo P.; Meyer, Philip D.
  • Water Resources Research, Vol. 40, Issue 5
  • DOI: 10.1029/2003WR002557

Bayesian Calibration and Uncertainty Analysis for Computationally Expensive Models Using Optimization and Radial Basis Function Approximation
journal, June 2008

  • Bliznyuk, Nikolay; Ruppert, David; Shoemaker, Christine
  • Journal of Computational and Graphical Statistics, Vol. 17, Issue 2
  • DOI: 10.1198/106186008X320681

Uncertainty in the spatial prediction of soil texture
journal, January 2012


Inference from Iterative Simulation Using Multiple Sequences
journal, November 1992


Treatment of input uncertainty in hydrologic modeling: Doing hydrology backward with Markov chain Monte Carlo simulation: FORCING DATA ERROR USING MCMC SAMPLING
journal, December 2008

  • Vrugt, Jasper A.; ter Braak, Cajo J. F.; Clark, Martyn P.
  • Water Resources Research, Vol. 44, Issue 12
  • DOI: 10.1029/2007WR006720

Learning about physical parameters: the importance of model discrepancy
journal, October 2014


Accelerating Markov Chain Monte Carlo Simulation by Differential Evolution with Self-Adaptive Randomized Subspace Sampling
journal, January 2009

  • Vrugt, J. A.; ter Braak, C. J. F.; Diks, C. G. H.
  • International Journal of Nonlinear Sciences and Numerical Simulation, Vol. 10, Issue 3
  • DOI: 10.1515/IJNSNS.2009.10.3.273

Uncertainty in water quality modelling: The applicability of Variance Decomposition Approach
journal, November 2010


Linking statistical bias description to multiobjective model calibration: STATISTICAL DESCRIPTION OF BIAS
journal, September 2012


A Bayesian approach to improved calibration and prediction of groundwater models with structural error
journal, November 2015

  • Xu, Tianfang; Valocchi, Albert J.
  • Water Resources Research, Vol. 51, Issue 11
  • DOI: 10.1002/2015WR017912

A framework for dealing with uncertainty due to model structure error
journal, November 2006


Markov Chain Monte Carlo Convergence Diagnostics: A Comparative Review
journal, June 1996


A short exploration of structural noise: A SHORT EXPLORATION OF STRUCTURAL NOISE
journal, May 2010


A computationally efficient parallel Levenberg-Marquardt algorithm for highly parameterized inverse model analyses: PARALLEL LEVENBERG-MARQUARDT FOR INVERSE MODELING
journal, September 2016

  • Lin, Youzuo; O'Malley, Daniel; Vesselinov, Velimir V.
  • Water Resources Research, Vol. 52, Issue 9
  • DOI: 10.1002/2016WR019028

Effective Groundwater Model Calibration
book, January 2007


Review of surrogate modeling in water resources: REVIEW
journal, July 2012

  • Razavi, Saman; Tolson, Bryan A.; Burn, Donald H.
  • Water Resources Research, Vol. 48, Issue 7
  • DOI: 10.1029/2011WR011527

Assessment of parametric uncertainty for groundwater reactive transport modeling
journal, May 2014

  • Shi, Xiaoqing; Ye, Ming; Curtis, Gary P.
  • Water Resources Research, Vol. 50, Issue 5
  • DOI: 10.1002/2013WR013755

A theory for modeling ground-water flow in heterogeneous media
report, January 2004


A Stochastic Collocation Approach to Bayesian Inference in Inverse Problems
journal, January 2009


Towards a comprehensive assessment of model structural adequacy: ASSESSMENT OF MODEL STRUCTURAL ADEQUACY
journal, August 2012

  • Gupta, Hoshin V.; Clark, Martyn P.; Vrugt, Jasper A.
  • Water Resources Research, Vol. 48, Issue 8
  • DOI: 10.1029/2011WR011044

Bayesian Kernel Methods
book, January 2003


A philosophical basis for hydrological uncertainty
journal, May 2016


Random Forest:  A Classification and Regression Tool for Compound Classification and QSAR Modeling
journal, November 2003

  • Svetnik, Vladimir; Liaw, Andy; Tong, Christopher
  • Journal of Chemical Information and Computer Sciences, Vol. 43, Issue 6
  • DOI: 10.1021/ci034160g

Daily streamflow forecasting by machine learning methods with weather and climate inputs
journal, January 2012


Use of paired simple and complex models to reduce predictive bias and quantify uncertainty: PAIRED SIMPLE AND COMPLEX MODELS
journal, December 2011

  • Doherty, John; Christensen, Steen
  • Water Resources Research, Vol. 47, Issue 12
  • DOI: 10.1029/2011WR010763

Efficient nonlinear predictive error variance for highly parameterized models: EFFICIENT NONLINEAR PREDICTIVE ERROR
journal, July 2007

  • Tonkin, Matthew; Doherty, John; Moore, Catherine
  • Water Resources Research, Vol. 43, Issue 7
  • DOI: 10.1029/2006WR005348

Bayesian calibration of computer models
journal, August 2001

  • Kennedy, Marc C.; O'Hagan, Anthony
  • Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 63, Issue 3
  • DOI: 10.1111/1467-9868.00294

Evaluating forecasts of extreme events for hydrological applications: an approach for screening unfamiliar performance measures
journal, January 2008

  • Cloke, Hannah L.; Pappenberger, Florian
  • Meteorological Applications, Vol. 15, Issue 1
  • DOI: 10.1002/met.58

Maximum likelihood Bayesian averaging of uncertain model predictions
journal, November 2003

  • Neuman, S. P.
  • Stochastic Environmental Research and Risk Assessment (SERRA), Vol. 17, Issue 5
  • DOI: 10.1007/s00477-003-0151-7

Data-driven methods to improve baseflow prediction of a regional groundwater model
journal, December 2015


A review of surrogate models and their application to groundwater modeling: SURROGATES OF GROUNDWATER MODELS
journal, August 2015

  • Asher, M. J.; Croke, B. F. W.; Jakeman, A. J.
  • Water Resources Research, Vol. 51, Issue 8
  • DOI: 10.1002/2015WR016967

The Nature of Statistical Learning Theory
book, January 1995


Practical selection of SVM parameters and noise estimation for SVM regression
journal, January 2004


LIBSVM: A library for support vector machines
journal, April 2011

  • Chang, Chih-Chung; Lin, Chih-Jen
  • ACM Transactions on Intelligent Systems and Technology, Vol. 2, Issue 3
  • DOI: 10.1145/1961189.1961199

Typology of hydrologic predictability: OPINION
journal, March 2011


Support vector machines (SVMs) for monitoring network design
journal, May 2005


Environmental data mining and modeling based on machine learning algorithms and geostatistics
journal, September 2004


A statistical concept to assess the uncertainty in Bayesian model weights and its impact on model ranking: ASSESSING THE UNCERTAINTY IN BAYESIAN MODEL WEIGHTS
journal, September 2015

  • Schöniger, Anneli; Wöhling, Thomas; Nowak, Wolfgang
  • Water Resources Research, Vol. 51, Issue 9
  • DOI: 10.1002/2015WR016918

Uncertainty in hydrologic modeling: Toward an integrated data assimilation framework: HYDROLOGIC DATA ASSIMILATION
journal, July 2007

  • Liu, Yuqiong; Gupta, Hoshin V.
  • Water Resources Research, Vol. 43, Issue 7
  • DOI: 10.1029/2006WR005756

An evaluation of adaptive surrogate modeling based optimization with two benchmark problems
journal, October 2014


Use of Machine Learning Methods to Reduce Predictive Error of Groundwater Models
journal, May 2013

  • Xu, Tianfang; Valocchi, Albert J.; Choi, Jaesik
  • Groundwater, Vol. 52, Issue 3
  • DOI: 10.1111/gwat.12061

Comparison of joint versus postprocessor approaches for hydrological uncertainty estimation accounting for error autocorrelation and heteroscedasticity
journal, March 2014

  • Evin, Guillaume; Thyer, Mark; Kavetski, Dmitri
  • Water Resources Research, Vol. 50, Issue 3
  • DOI: 10.1002/2013WR014185

Efficient Bayesian inference of subsurface flow models using nested sampling and sparse polynomial chaos surrogates
journal, February 2014

  • Elsheikh, Ahmed H.; Hoteit, Ibrahim; Wheeler, Mary F.
  • Computer Methods in Applied Mechanics and Engineering, Vol. 269
  • DOI: 10.1016/j.cma.2013.11.001

Evaluating two sparse grid surrogates and two adaptation criteria for groundwater Bayesian uncertainty quantification
journal, April 2016


Random Forests
journal, January 2001


Assessing the impacts of parameter uncertainty for computationally expensive groundwater models: UNCERTAINTY ASSESSMENT
journal, October 2006

  • Mugunthan, Pradeep; Shoemaker, Christine A.
  • Water Resources Research, Vol. 42, Issue 10
  • DOI: 10.1029/2005WR004640

Measuring the impact of COVID-19 vaccine misinformation on vaccination intent in the UK and USA
journal, February 2021

  • Loomba, Sahil; de Figueiredo, Alexandre; Piatek, Simon J.
  • Nature Human Behaviour, Vol. 5, Issue 3
  • DOI: 10.1038/s41562-021-01056-1

miRNALoc: predicting miRNA subcellular localizations based on principal component scores of physico-chemical properties and pseudo compositions of di-nucleotides
journal, September 2020

  • Meher, Prabina Kumar; Satpathy, Subhrajit; Rao, Atmakuri Ramakrishna
  • Scientific Reports, Vol. 10, Issue 1
  • DOI: 10.1038/s41598-020-71381-4

Bayesian Kernel Methods
book, January 2018

  • Schölkopf, Bernhard; Smola, Alexander J.
  • Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
  • DOI: 10.7551/mitpress/4175.003.0021

The Nature of Statistical Learning Theory
book, January 2000


The Nature Of Statistical Learning Theory~
journal, November 1997


Bayesian Kernel Methods: Applications in Medical Diagnosis Decision-Making Processes (A Case Study)
journal, January 2021

  • Saxena, Arti; Kumar, Vijay
  • International Journal of Big Data and Analytics in Healthcare, Vol. 6, Issue 1
  • DOI: 10.4018/ijbdah.20210101.oa3

Works referencing / citing this record:

Improving Robustness of Hydrologic Ensemble Predictions Through Probabilistic Pre‐ and Post‐Processing in Sequential Data Assimilation
journal, March 2018

  • Wang, S.; Ancell, B. C.; Huang, G. H.
  • Water Resources Research, Vol. 54, Issue 3
  • DOI: 10.1002/2018wr022546

What We Talk About When We Talk About Uncertainty. Toward a Unified, Data-Driven Framework for Uncertainty Characterization in Hydrogeology
journal, June 2019


A Brief Review of Random Forests for Water Scientists and Practitioners and Their Recent History in Water Resources
journal, April 2019

  • Tyralis, Hristos; Papacharalampous, Georgia; Langousis, Andreas
  • Water, Vol. 11, Issue 5
  • DOI: 10.3390/w11050910

Evaluation of terrestrial pan-Arctic carbon cycling using a data-assimilation system
journal, January 2019

  • López-Blanco, Efrén; Exbrayat, Jean-François; Lund, Magnus
  • Earth System Dynamics, Vol. 10, Issue 2
  • DOI: 10.5194/esd-10-233-2019

Estimating time-dependent vegetation biases in the SMAP soil moisture product
journal, January 2018

  • Zwieback, Simon; Colliander, Andreas; Cosh, Michael H.
  • Hydrology and Earth System Sciences, Vol. 22, Issue 8
  • DOI: 10.5194/hess-22-4473-2018

Estimating time-dependent vegetation biases in the SMAP soil moisture product
text, January 2018