Bayesian sequential optimal experimental design for nonlinear models using policy gradient reinforcement learning
Journal Article
·
· Computer Methods in Applied Mechanics and Engineering
- University of Michigan, Ann Arbor, MI (United States); University of Michigan
- University of Michigan, Ann Arbor, MI (United States)
We present a mathematical framework and computational methods for optimally designing a finite sequence of experiments. This sequential optimal experimental design (sOED) problem is formulated as a finite-horizon partially observable Markov decision process (POMDP) under a Bayesian setting and with information-theoretic utilities. The formulation is general and may accommodate continuous random variables, non-Gaussian posteriors, and nonlinear forward models. The sOED design policy incorporates elements of feedback and lookahead simultaneously, and we show it to generalize the commonly-used batch and greedy design strategies. We solve for the sOED policy using the policy gradient (PG) method from reinforcement learning, and provide a derivation for the PG expression in the sOED context. Adopting an actor-critic approach, the policy and value functions are parameterized using deep neural networks and improved via PG estimates produced from simulated episodes of designs and observations. The new PG-sOED algorithm is first validated on a linear-Gaussian benchmark, and then compared against other design baselines on a sensor movement problem for contaminant source inversion in a convection-diffusion field. As a result, we provide explanation for the policy behaviors using knowledge of the underlying physical process.
- Research Organization:
- University of Michigan, Ann Arbor, MI (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC), Advanced Scientific Computing Research (ASCR)
- Grant/Contract Number:
- SC0021398
- OSTI ID:
- 1996019
- Journal Information:
- Computer Methods in Applied Mechanics and Engineering, Journal Name: Computer Methods in Applied Mechanics and Engineering Vol. 416; ISSN 0045-7825
- Publisher:
- ElsevierCopyright Statement
- Country of Publication:
- United States
- Language:
- English
Similar Records
Practical and Optimal Sequential Bayesian Experimental Design for Complex Systems Incorporating Human Experimenter Preferences (Final Scientific/Technical Report)
Computing optimal policies for partially observable decision processes using compact representations
Sequential ensemble transform for Bayesian inverse problems
Technical Report
·
Wed Dec 25 23:00:00 EST 2024
·
OSTI ID:2483855
Computing optimal policies for partially observable decision processes using compact representations
Conference
·
Mon Dec 30 23:00:00 EST 1996
·
OSTI ID:430799
Sequential ensemble transform for Bayesian inverse problems
Journal Article
·
Sun Dec 06 19:00:00 EST 2020
· Journal of Computational Physics
·
OSTI ID:1852513