Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Learning Stochastic Parametric Differentiable Predictive Control Policies

Conference ·
OSTI ID:1907869
We present a scalable unsupervised learning-based method for obtaining explicit control policies for model predictive control problems for stochastic linear systems with additive uncertainties subject to nonlinear chance constraints. We call the proposed method stochastic parametric differentiable predictive control (SP-DPC), which extends the recently proposed deterministic DPC policy optimization algorithm. We formulate the SP-DPC as a deterministic approximation to the stochastic parametric constrained optimal control problem via independent sampling of the problem's parameters and uncertainties. This formulation allows us to directly compute the policy gradients via automatic differentiation of the problem's value function, evaluated over sampled parameters and uncertainties. In particular, the computed expectation of the problem's value function is backpropagated through the finite-time closed-loop system rollouts parametrized by a known nominal system dynamics model and neural control policy. We also provide theoretical probabilistic guarantees on closed-loop stability and chance constraints satisfaction for systems controlled by learned neural policies. We demonstrate the computational efficiency and scalability of the proposed policy optimization algorithm in three numerical examples, including systems with a large number of states or subject to nonlinear constraints.
Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE
DOE Contract Number:
AC05-76RL01830
OSTI ID:
1907869
Report Number(s):
PNNL-SA-170144
Country of Publication:
Netherlands
Language:
English

Similar Records

Learning Constrained Parametric Differentiable Predictive Control Policies With Guarantees
Journal Article · Mon Mar 25 20:00:00 EDT 2024 · IEEE Transactions on Systems, Man, and Cybernetics: Systems · OSTI ID:2438488

Deep Learning Explicit Differentiable Predictive Control Laws for Buildings
Conference · Sun Jul 11 00:00:00 EDT 2021 · OSTI ID:1831144

Related Subjects