Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model
Abstract
Groundwater model structural error is ubiquitous, due to simplification and/or misrepresentation of real aquifer systems. During model calibration, the basic hydrogeological parameters may be adjusted to compensate for structural error. This may result in biased predictions when such calibrated models are used to forecast aquifer responses to new forcing. Here, we investigate the impact of model structural error on calibration and prediction of a real-world groundwater flow model, using a Bayesian method with a data-driven error model to explicitly account for model structural error. The error-explicit Bayesian method jointly infers model parameters and structural error and thereby reduces parameter compensation. In this study, Bayesian inference is facilitated using high performance computing and fast surrogate models (based on machine learning techniques) as a substitute for the computationally expensive groundwater model. We demonstrate that with explicit treatment of model structural error, the Bayesian method yields parameter posterior distributions that are substantially different from those derived using classical Bayesian calibration that does not account for model structural error. We also found that the error-explicit Bayesian method gives signficantly more accurate prediction along with reasonable credible intervals. Finally, through variance decomposition, we provide a comprehensive assessment of prediction uncertainty contributed from parameter, model structure,more »
- Authors:
-
- Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Civil and Environmental Engineering; Michigan State Univ., East Lansing, MI (United States). Dept. of Earth and Environmental Sciences
- Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Civil and Environmental Engineering
- Florida State Univ., Tallahassee, FL (United States). Dept. of Scientific Computing
- Univ. of Illinois at Urbana-Champaign, Urbana, IL (United States). Dept. of Statistics
- Publication Date:
- Research Org.:
- Florida State Univ., Tallahassee, FL (United States)
- Sponsoring Org.:
- USDOE Office of Science (SC)
- OSTI Identifier:
- 1532997
- Grant/Contract Number:
- SC0008272
- Resource Type:
- Accepted Manuscript
- Journal Name:
- Water Resources Research
- Additional Journal Information:
- Journal Volume: 53; Journal Issue: 5; Journal ID: ISSN 0043-1397
- Publisher:
- American Geophysical Union (AGU)
- Country of Publication:
- United States
- Language:
- English
- Subject:
- 54 ENVIRONMENTAL SCIENCES; environmental sciences & ecology; marine & freshwater biology; water resources; Bayesian calibration; uncertainty decomposition; model structural error; surrogate modeling; groundwater
Citation Formats
Xu, Tianfang, Valocchi, Albert J., Ye, Ming, and Liang, Feng. Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model. United States: N. p., 2017.
Web. doi:10.1002/2016wr019831.
Xu, Tianfang, Valocchi, Albert J., Ye, Ming, & Liang, Feng. Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model. United States. https://doi.org/10.1002/2016wr019831
Xu, Tianfang, Valocchi, Albert J., Ye, Ming, and Liang, Feng. Mon .
"Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model". United States. https://doi.org/10.1002/2016wr019831. https://www.osti.gov/servlets/purl/1532997.
@article{osti_1532997,
title = {Quantifying model structural error: Efficient Bayesian calibration of a regional groundwater flow model using surrogates and a data-driven error model},
author = {Xu, Tianfang and Valocchi, Albert J. and Ye, Ming and Liang, Feng},
abstractNote = {Groundwater model structural error is ubiquitous, due to simplification and/or misrepresentation of real aquifer systems. During model calibration, the basic hydrogeological parameters may be adjusted to compensate for structural error. This may result in biased predictions when such calibrated models are used to forecast aquifer responses to new forcing. Here, we investigate the impact of model structural error on calibration and prediction of a real-world groundwater flow model, using a Bayesian method with a data-driven error model to explicitly account for model structural error. The error-explicit Bayesian method jointly infers model parameters and structural error and thereby reduces parameter compensation. In this study, Bayesian inference is facilitated using high performance computing and fast surrogate models (based on machine learning techniques) as a substitute for the computationally expensive groundwater model. We demonstrate that with explicit treatment of model structural error, the Bayesian method yields parameter posterior distributions that are substantially different from those derived using classical Bayesian calibration that does not account for model structural error. We also found that the error-explicit Bayesian method gives signficantly more accurate prediction along with reasonable credible intervals. Finally, through variance decomposition, we provide a comprehensive assessment of prediction uncertainty contributed from parameter, model structure, and measurement uncertainty. The results suggest that the error-explicit Bayesian approach provides a solution to real-world modeling applications for which data support the presence of model structural error, yet model deficiency cannot be specifically identified or corrected.},
doi = {10.1002/2016wr019831},
journal = {Water Resources Research},
number = 5,
volume = 53,
place = {United States},
year = {Mon May 08 00:00:00 EDT 2017},
month = {Mon May 08 00:00:00 EDT 2017}
}
Web of Science
Works referenced in this record:
A unified approach for process‐based hydrologic modeling: 1. Modeling concept
journal, April 2015
- Clark, Martyn P.; Nijssen, Bart; Lundquist, Jessica D.
- Water Resources Research, Vol. 51, Issue 4
Maximum likelihood Bayesian averaging of spatial variability models in unsaturated fractured tuff: MAXIMUM LIKELIHOOD BAYESIAN MODEL AVERAGING
journal, May 2004
- Ye, Ming; Neuman, Shlomo P.; Meyer, Philip D.
- Water Resources Research, Vol. 40, Issue 5
Bayesian Calibration and Uncertainty Analysis for Computationally Expensive Models Using Optimization and Radial Basis Function Approximation
journal, June 2008
- Bliznyuk, Nikolay; Ruppert, David; Shoemaker, Christine
- Journal of Computational and Graphical Statistics, Vol. 17, Issue 2
Uncertainty in the spatial prediction of soil texture
journal, January 2012
- Ließ, Mareike; Glaser, Bruno; Huwe, Bernd
- Geoderma, Vol. 170
Inference from Iterative Simulation Using Multiple Sequences
journal, November 1992
- Gelman, Andrew; Rubin, Donald B.
- Statistical Science, Vol. 7, Issue 4
A General Probabilistic Framework for uncertainty and global sensitivity analysis of deterministic models: A hydrological case study
journal, January 2014
- Baroni, G.; Tarantola, S.
- Environmental Modelling & Software, Vol. 51
Treatment of input uncertainty in hydrologic modeling: Doing hydrology backward with Markov chain Monte Carlo simulation: FORCING DATA ERROR USING MCMC SAMPLING
journal, December 2008
- Vrugt, Jasper A.; ter Braak, Cajo J. F.; Clark, Martyn P.
- Water Resources Research, Vol. 44, Issue 12
Learning about physical parameters: the importance of model discrepancy
journal, October 2014
- Brynjarsdóttir, Jenný; OʼHagan, Anthony
- Inverse Problems, Vol. 30, Issue 11
Using sparse polynomial chaos expansions for the global sensitivity analysis of groundwater lifetime expectancy in a multi-layered hydrogeological model
journal, March 2016
- Deman, G.; Konakli, K.; Sudret, B.
- Reliability Engineering & System Safety, Vol. 147
Accelerating Markov Chain Monte Carlo Simulation by Differential Evolution with Self-Adaptive Randomized Subspace Sampling
journal, January 2009
- Vrugt, J. A.; ter Braak, C. J. F.; Diks, C. G. H.
- International Journal of Nonlinear Sciences and Numerical Simulation, Vol. 10, Issue 3
Uncertainty in water quality modelling: The applicability of Variance Decomposition Approach
journal, November 2010
- Freni, Gabriele; Mannina, Giorgio
- Journal of Hydrology, Vol. 394, Issue 3-4
Linking statistical bias description to multiobjective model calibration: STATISTICAL DESCRIPTION OF BIAS
journal, September 2012
- Reichert, P.; Schuwirth, N.
- Water Resources Research, Vol. 48, Issue 9
A Bayesian approach to improved calibration and prediction of groundwater models with structural error
journal, November 2015
- Xu, Tianfang; Valocchi, Albert J.
- Water Resources Research, Vol. 51, Issue 11
A framework for dealing with uncertainty due to model structure error
journal, November 2006
- Refsgaard, Jens Christian; van der Sluijs, Jeroen P.; Brown, James
- Advances in Water Resources, Vol. 29, Issue 11
Markov Chain Monte Carlo Convergence Diagnostics: A Comparative Review
journal, June 1996
- Cowles, Mary Kathryn; Carlin, Bradley P.
- Journal of the American Statistical Association, Vol. 91, Issue 434
A short exploration of structural noise: A SHORT EXPLORATION OF STRUCTURAL NOISE
journal, May 2010
- Doherty, John; Welter, David
- Water Resources Research, Vol. 46, Issue 5
A computationally efficient parallel Levenberg-Marquardt algorithm for highly parameterized inverse model analyses: PARALLEL LEVENBERG-MARQUARDT FOR INVERSE MODELING
journal, September 2016
- Lin, Youzuo; O'Malley, Daniel; Vesselinov, Velimir V.
- Water Resources Research, Vol. 52, Issue 9
Effective Groundwater Model Calibration
book, January 2007
- Hill, Mary C.; Tiedeman, Claire R.
- John Wiley & Sons, Inc.
Review of surrogate modeling in water resources: REVIEW
journal, July 2012
- Razavi, Saman; Tolson, Bryan A.; Burn, Donald H.
- Water Resources Research, Vol. 48, Issue 7
Assessment of parametric uncertainty for groundwater reactive transport modeling
journal, May 2014
- Shi, Xiaoqing; Ye, Ming; Curtis, Gary P.
- Water Resources Research, Vol. 50, Issue 5
Disentangling uncertainties in distributed hydrological modeling using multiplicative error models and sequential data assimilation: DISENTANGLING UNCERTAINTIES IN HYDROLOGICAL MODELING
journal, December 2010
- Salamon, Peter; Feyen, Luc
- Water Resources Research, Vol. 46, Issue 12
A theory for modeling ground-water flow in heterogeneous media
report, January 2004
- Cooley, Richard L.
- USGS
A Stochastic Collocation Approach to Bayesian Inference in Inverse Problems
journal, January 2009
- Marzouk, Youssef; Xiu, Dongbin
- Communications in Computational Physics, Vol. 6, Issue 4
Markov chain Monte Carlo simulation using the DREAM software package: Theory, concepts, and MATLAB implementation
journal, January 2016
- Vrugt, Jasper A.
- Environmental Modelling & Software, Vol. 75
Efficient posterior exploration of a high-dimensional groundwater model from two-stage Markov chain Monte Carlo simulation and polynomial chaos expansion: Speeding up MCMC Simulation of a Groundwater Model
journal, May 2013
- Laloy, Eric; Rogiers, Bart; Vrugt, Jasper A.
- Water Resources Research, Vol. 49, Issue 5
A formal likelihood function for parameter and predictive inference of hydrologic models with correlated, heteroscedastic, and non‐Gaussian errors
journal, October 2010
- Schoups, Gerrit; Vrugt, Jasper A.
- Water Resources Research, Vol. 46, Issue 10
Towards a comprehensive assessment of model structural adequacy: ASSESSMENT OF MODEL STRUCTURAL ADEQUACY
journal, August 2012
- Gupta, Hoshin V.; Clark, Martyn P.; Vrugt, Jasper A.
- Water Resources Research, Vol. 48, Issue 8
Bayesian Kernel Methods
book, January 2003
- Smola, Alexander J.; Schölkopf, Bernhard
- Advanced Lectures on Machine Learning
A philosophical basis for hydrological uncertainty
journal, May 2016
- Nearing, Grey S.; Tian, Yudong; Gupta, Hoshin V.
- Hydrological Sciences Journal, Vol. 61, Issue 9
Random Forest: A Classification and Regression Tool for Compound Classification and QSAR Modeling
journal, November 2003
- Svetnik, Vladimir; Liaw, Andy; Tong, Christopher
- Journal of Chemical Information and Computer Sciences, Vol. 43, Issue 6
High-dimensional posterior exploration of hydrologic models using multiple-try DREAM (ZS) and high-performance computing : EFFICIENT MCMC FOR HIGH-DIMENSIONAL PROBLEMS
journal, January 2012
- Laloy, Eric; Vrugt, Jasper A.
- Water Resources Research, Vol. 48, Issue 1
Daily streamflow forecasting by machine learning methods with weather and climate inputs
journal, January 2012
- Rasouli, Kabir; Hsieh, William W.; Cannon, Alex J.
- Journal of Hydrology, Vol. 414-415
Use of paired simple and complex models to reduce predictive bias and quantify uncertainty: PAIRED SIMPLE AND COMPLEX MODELS
journal, December 2011
- Doherty, John; Christensen, Steen
- Water Resources Research, Vol. 47, Issue 12
Efficient nonlinear predictive error variance for highly parameterized models: EFFICIENT NONLINEAR PREDICTIVE ERROR
journal, July 2007
- Tonkin, Matthew; Doherty, John; Moore, Catherine
- Water Resources Research, Vol. 43, Issue 7
Bayesian calibration of computer models
journal, August 2001
- Kennedy, Marc C.; O'Hagan, Anthony
- Journal of the Royal Statistical Society: Series B (Statistical Methodology), Vol. 63, Issue 3
Evaluating forecasts of extreme events for hydrological applications: an approach for screening unfamiliar performance measures
journal, January 2008
- Cloke, Hannah L.; Pappenberger, Florian
- Meteorological Applications, Vol. 15, Issue 1
Maximum likelihood Bayesian averaging of uncertain model predictions
journal, November 2003
- Neuman, S. P.
- Stochastic Environmental Research and Risk Assessment (SERRA), Vol. 17, Issue 5
Data-driven methods to improve baseflow prediction of a regional groundwater model
journal, December 2015
- Xu, Tianfang; Valocchi, Albert J.
- Computers & Geosciences, Vol. 85
Understanding predictive uncertainty in hydrologic modeling: The challenge of identifying input and structural errors: IDENTIFIABILITY OF INPUT AND STRUCTURAL ERRORS
journal, May 2010
- Renard, Benjamin; Kavetski, Dmitri; Kuczera, George
- Water Resources Research, Vol. 46, Issue 5
A review of surrogate models and their application to groundwater modeling: SURROGATES OF GROUNDWATER MODELS
journal, August 2015
- Asher, M. J.; Croke, B. F. W.; Jakeman, A. J.
- Water Resources Research, Vol. 51, Issue 8
Practical selection of SVM parameters and noise estimation for SVM regression
journal, January 2004
- Cherkassky, Vladimir; Ma, Yunqian
- Neural Networks, Vol. 17, Issue 1
An adaptive Gaussian process-based method for efficient Bayesian experimental design in groundwater contaminant source identification problems: ADAPTIVE GAUSSIAN PROCESS-BASED INVERSION
journal, August 2016
- Zhang, Jiangjiang; Li, Weixuan; Zeng, Lingzao
- Water Resources Research, Vol. 52, Issue 8
LIBSVM: A library for support vector machines
journal, April 2011
- Chang, Chih-Chung; Lin, Chih-Jen
- ACM Transactions on Intelligent Systems and Technology, Vol. 2, Issue 3
Typology of hydrologic predictability: OPINION
journal, March 2011
- Kumar, Praveen
- Water Resources Research, Vol. 47, Issue 3
Equifinality, data assimilation, and uncertainty estimation in mechanistic modelling of complex environmental systems using the GLUE methodology
journal, August 2001
- Beven, Keith; Freer, Jim
- Journal of Hydrology, Vol. 249, Issue 1-4
An approach for improving the sampling efficiency in the Bayesian calibration of computationally expensive simulation models: IMPROVING SAMPLING EFFICIENCY IN BAYESIAN CALIBRATION
journal, June 2009
- Xie, Hua; Eheart, J. Wayland; Chen, Yuguo
- Water Resources Research, Vol. 45, Issue 6
Support vector machines (SVMs) for monitoring network design
journal, May 2005
- Asefa, Tirusew; Kemblowski, Mariush; Urroz, Gilberto
- Ground Water, Vol. 43, Issue 3
Environmental data mining and modeling based on machine learning algorithms and geostatistics
journal, September 2004
- Kanevski, M.; Parkin, R.; Pozdnukhov, A.
- Environmental Modelling & Software, Vol. 19, Issue 9
Bayesian calibration of a large-scale geothermal reservoir model by a new adaptive delayed acceptance Metropolis Hastings algorithm: ADAPTIVE DELAYED ACCEPTANCE METROPOLIS-HASTINGS ALGORITHM
journal, October 2011
- Cui, T.; Fox, C.; O'Sullivan, M. J.
- Water Resources Research, Vol. 47, Issue 10
A statistical concept to assess the uncertainty in Bayesian model weights and its impact on model ranking: ASSESSING THE UNCERTAINTY IN BAYESIAN MODEL WEIGHTS
journal, September 2015
- Schöniger, Anneli; Wöhling, Thomas; Nowak, Wolfgang
- Water Resources Research, Vol. 51, Issue 9
Uncertainty in hydrologic modeling: Toward an integrated data assimilation framework: HYDROLOGIC DATA ASSIMILATION
journal, July 2007
- Liu, Yuqiong; Gupta, Hoshin V.
- Water Resources Research, Vol. 43, Issue 7
An evaluation of adaptive surrogate modeling based optimization with two benchmark problems
journal, October 2014
- Wang, Chen; Duan, Qingyun; Gong, Wei
- Environmental Modelling & Software, Vol. 60
Use of Machine Learning Methods to Reduce Predictive Error of Groundwater Models
journal, May 2013
- Xu, Tianfang; Valocchi, Albert J.; Choi, Jaesik
- Groundwater, Vol. 52, Issue 3
Comparison of joint versus postprocessor approaches for hydrological uncertainty estimation accounting for error autocorrelation and heteroscedasticity
journal, March 2014
- Evin, Guillaume; Thyer, Mark; Kavetski, Dmitri
- Water Resources Research, Vol. 50, Issue 3
Efficient Bayesian inference of subsurface flow models using nested sampling and sparse polynomial chaos surrogates
journal, February 2014
- Elsheikh, Ahmed H.; Hoteit, Ibrahim; Wheeler, Mary F.
- Computer Methods in Applied Mechanics and Engineering, Vol. 269
Evaluating two sparse grid surrogates and two adaptation criteria for groundwater Bayesian uncertainty quantification
journal, April 2016
- Zeng, Xiankui; Ye, Ming; Burkardt, John
- Journal of Hydrology, Vol. 535
Estimating effective model parameters for heterogeneous unsaturated flow using error models for bias correction: PARAMETER ESTIMATION USING ERROR MODELS
journal, June 2012
- Erdal, D.; Neuweiler, I.; Huisman, J. A.
- Water Resources Research, Vol. 48, Issue 6
Predicting the output from a complex computer code when fast approximations are available
journal, March 2000
- Kennedy, M.
- Biometrika, Vol. 87, Issue 1
Effects of error covariance structure on estimation of model averaging weights and predictive performance: EFFECTS OF ERROR COVARIANCE STRUCTURE ON MODEL AVERAGING
journal, September 2013
- Lu, Dan; Ye, Ming; Meyer, Philip D.
- Water Resources Research, Vol. 49, Issue 9
Assessing the impacts of parameter uncertainty for computationally expensive groundwater models: UNCERTAINTY ASSESSMENT
journal, October 2006
- Mugunthan, Pradeep; Shoemaker, Christine A.
- Water Resources Research, Vol. 42, Issue 10
Measuring the impact of COVID-19 vaccine misinformation on vaccination intent in the UK and USA
journal, February 2021
- Loomba, Sahil; de Figueiredo, Alexandre; Piatek, Simon J.
- Nature Human Behaviour, Vol. 5, Issue 3
miRNALoc: predicting miRNA subcellular localizations based on principal component scores of physico-chemical properties and pseudo compositions of di-nucleotides
journal, September 2020
- Meher, Prabina Kumar; Satpathy, Subhrajit; Rao, Atmakuri Ramakrishna
- Scientific Reports, Vol. 10, Issue 1
Bayesian Kernel Methods
book, January 2018
- Schölkopf, Bernhard; Smola, Alexander J.
- Learning with Kernels: Support Vector Machines, Regularization, Optimization, and Beyond
The Nature of Statistical Learning Theory
book, January 2000
- Vapnik, Vladimir N.
- Springer New York, NY
The Nature Of Statistical Learning Theory~
journal, November 1997
- Cherkassky, V.
- IEEE Transactions on Neural Networks, Vol. 8, Issue 6
Bayesian Kernel Methods: Applications in Medical Diagnosis Decision-Making Processes (A Case Study)
journal, January 2021
- Saxena, Arti; Kumar, Vijay
- International Journal of Big Data and Analytics in Healthcare, Vol. 6, Issue 1
Works referencing / citing this record:
Improving Robustness of Hydrologic Ensemble Predictions Through Probabilistic Pre‐ and Post‐Processing in Sequential Data Assimilation
journal, March 2018
- Wang, S.; Ancell, B. C.; Huang, G. H.
- Water Resources Research, Vol. 54, Issue 3
What We Talk About When We Talk About Uncertainty. Toward a Unified, Data-Driven Framework for Uncertainty Characterization in Hydrogeology
journal, June 2019
- Heße, Falk; Comunian, Alessandro; Attinger, Sabine
- Frontiers in Earth Science, Vol. 7
A Brief Review of Random Forests for Water Scientists and Practitioners and Their Recent History in Water Resources
journal, April 2019
- Tyralis, Hristos; Papacharalampous, Georgia; Langousis, Andreas
- Water, Vol. 11, Issue 5
Evaluation of terrestrial pan-Arctic carbon cycling using a data-assimilation system
journal, January 2019
- López-Blanco, Efrén; Exbrayat, Jean-François; Lund, Magnus
- Earth System Dynamics, Vol. 10, Issue 2
Estimating time-dependent vegetation biases in the SMAP soil moisture product
journal, January 2018
- Zwieback, Simon; Colliander, Andreas; Cosh, Michael H.
- Hydrology and Earth System Sciences, Vol. 22, Issue 8
Estimating time-dependent vegetation biases in the SMAP soil moisture product
text, January 2018
- Zwieback, Simon; Colliander, Andreas; Cosh, Michael H.
- ETH Zurich
Inverse modeling of hydrologic systems with adaptive multi-fidelity Markov chain Monte Carlo simulations
preprint, January 2017
- Zhang, Jiangjiang; Man, Jun; Lin, Guang
- arXiv
Surrogate-Based Bayesian Inverse Modeling of the Hydrological System: An Adaptive Approach Considering Surrogate Approximation Error
text, January 2018
- Zhang, Jiangjiang; Zheng, Qiang; Chen, Dingjiang
- arXiv