Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Bayesian sequential optimal experimental design for nonlinear models using policy gradient reinforcement learning

Journal Article · · Computer Methods in Applied Mechanics and Engineering
 [1];  [2]
  1. University of Michigan, Ann Arbor, MI (United States); University of Michigan
  2. University of Michigan, Ann Arbor, MI (United States)
We present a mathematical framework and computational methods for optimally designing a finite sequence of experiments. This sequential optimal experimental design (sOED) problem is formulated as a finite-horizon partially observable Markov decision process (POMDP) under a Bayesian setting and with information-theoretic utilities. The formulation is general and may accommodate continuous random variables, non-Gaussian posteriors, and nonlinear forward models. The sOED design policy incorporates elements of feedback and lookahead simultaneously, and we show it to generalize the commonly-used batch and greedy design strategies. We solve for the sOED policy using the policy gradient (PG) method from reinforcement learning, and provide a derivation for the PG expression in the sOED context. Adopting an actor-critic approach, the policy and value functions are parameterized using deep neural networks and improved via PG estimates produced from simulated episodes of designs and observations. The new PG-sOED algorithm is first validated on a linear-Gaussian benchmark, and then compared against other design baselines on a sensor movement problem for contaminant source inversion in a convection-diffusion field. As a result, we provide explanation for the policy behaviors using knowledge of the underlying physical process.
Research Organization:
University of Michigan, Ann Arbor, MI (United States)
Sponsoring Organization:
USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
Grant/Contract Number:
SC0021398
OSTI ID:
1996019
Journal Information:
Computer Methods in Applied Mechanics and Engineering, Journal Name: Computer Methods in Applied Mechanics and Engineering Vol. 416; ISSN 0045-7825
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (37)

Sequential experimental design and response optimisation journal October 2002
Simulation Based Optimal Design book January 2005
A Laplace method for under-determined Bayesian optimal experimental designs journal March 2015
Fast Bayesian experimental design: Laplace-based importance sampling for the expected information gain journal June 2018
Sequential Monte Carlo for Bayesian sequentially designed experiments for discrete data journal January 2013
Bayesian experimental design for the active nitridation of graphite by atomic nitrogen journal January 2012
Simulation-based optimal Bayesian experimental design for nonlinear systems journal January 2013
Simulation-based sequential Bayesian design journal October 2007
Natural Actor-Critic journal March 2008
Human-level control through deep reinforcement learning journal February 2015
Recent Advances in Nonlinear Experimental Design journal February 1989
Bayesian Design of Experiments Using Approximate Coordinate Exchange journal April 2017
Variational Inference: A Review for Statisticians journal July 2016
George'S Column journal January 1992
A Sequential Monte Carlo Algorithm to Incorporate Model Uncertainty in Bayesian Sequential Design journal January 2014
Design of Experiments in Non-Linear Situations journal January 1959
Bayesian inference in physics journal September 2011
Optimal Dynamic Treatment Regimes journal April 2003
A Review of Modern Computational Algorithms for Bayesian Optimal Design: A Review of Modern Algorithms for Bayesian Design journal June 2015
A Fast and Scalable Method for A-Optimal Design of Experiments for Infinite-dimensional Bayesian Nonlinear Inverse Problems journal January 2016
Efficient Bayesian Experimentation Using an Expected Information Gain Lower Bound journal January 2017
A Hierarchical Adaptive Approach to Optimal Experimental Design journal November 2014
Adaptive Design Optimization: A Mutual Information-Based Approach to Model Discrimination in Cognitive Science journal April 2010
Implementation of Backward Induction for Sequentially Adaptive Clinical Trials journal June 2006
Sequential Experimental Designs for Generalized Linear Models journal March 2008
Estimating Expected Information Gains for Experimental Designs With Application to the Random Fatigue-Limit Model journal September 2003
A Gridding Method for Bayesian Sequential Decision Problems journal September 2003
Sequential stopping rules for species accumulation journal June 2003
Simulation-Based Optimal Design Using a Response Variance Criterion journal January 2012
On the measure of the information in a statistical experiment journal March 2007
Computational Enhancements to Bayesian Design of Experiments Using Gaussian Processes journal March 2016
Sequential Bayesian Experimental Design for Implicit Models via Mutual Information journal September 2021
On a Measure of the Information Provided by an Experiment journal December 1956
Bayesian Experimental Design: A Review journal August 1995
Optimal Adaptive Policies for Markov Decision Processes journal February 1997
Approaches for Optimal Sequential Decision Analysis in Clinical Trials journal September 1998
Adaptive control for sequential design journal January 2000