DOE PAGES title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Two-Stage Reinforcement Learning Policy Search for Grid-Interactive Building Control

Journal Article · · IEEE Transactions on Smart Grid

This paper develops an intelligent grid-interactive building controller, which optimizes building operation during both normal hours and demand response (DR) events. To avoid costly on-demand computation and to adapt to non-linear building models, the controller utilizes reinforcement learning (RL) and makes real-time decisions based on a near-optimal control policy. Learning such a policy typically amounts to solving a hard non-convex optimization problem. We propose to address this problem with a novel global-local policy search method. In the first stage, an RL algorithm based on zero-order gradient estimation is leveraged to search for the optimal policy globally, due to its scalability and the potential to escape some poor performing local optima. The obtained policy is then fine-tuned locally to bring the first-stage solution closer to that of the original unsmoothed problem. Experiments on a simulated five-zone commercial building demonstrate the advantages of the proposed method over existing learning approaches. They also show that the learned control policy outperforms a pragmatic linear model predictive controller (MPC) and approaches the performance of an oracle MPC in testing scenarios. Using a state-of-the-art advanced computing system, we demonstrate that the controller can be learned and deployed within hours of training.

Research Organization:
National Renewable Energy Laboratory (NREL), Golden, CO (United States)
Sponsoring Organization:
USDOE Office of Energy Efficiency and Renewable Energy (EERE), Energy Efficiency Office. Building Technologies Office
Grant/Contract Number:
AC36-08GO28308
OSTI ID:
1841140
Report Number(s):
NREL/JA-2C00-79559; MainId:35780; UUID:ae6b3461-9d61-4782-8331-aa0d27f35eab; MainAdminID:63638
Journal Information:
IEEE Transactions on Smart Grid, Journal Name: IEEE Transactions on Smart Grid Journal Issue: 3 Vol. 13; ISSN 1949-3053
Publisher:
IEEECopyright Statement
Country of Publication:
United States
Language:
English

References (31)

Random Gradient-Free Minimization of Convex Functions journal November 2015
A self-learning algorithm for coordinated control of rooftop units in small- and medium-sized commercial buildings journal November 2017
Coordinating the operations of smart buildings in smart grids journal October 2018
Model predictive control for thermal energy storage and thermal comfort optimization of building demand response in smart grids journal May 2019
All you need to know about model predictive control for buildings journal January 2020
Theory and applications of HVAC control systems – A review of model predictive control (MPC) journal February 2014
Reinforcement learning for whole-building HVAC control and demand response journal November 2020
Energy Efficient Building HVAC Control Algorithm with Real-time Occupancy Prediction journal March 2017
Development and experimental demonstration of a plug-and-play multiple RTU coordination control algorithm for small/medium commercial buildings journal November 2015
Whole building energy model for HVAC optimal control: A practical framework based on deep reinforcement learning journal September 2019
A summary of demand response in electricity markets journal November 2008
Residual Reinforcement Learning for Robot Control conference May 2019
An IoT-Based Thermal Model Learning Framework for Smart Buildings journal January 2020
Tube-Based Model Predictive Controller for Building’s Heating Ventilation and Air Conditioning (HVAC) System journal December 2021
Using a Transactive Energy Framework: Providing Grid Services from Smart Buildings journal December 2016
Distributed Model-Predictive Control Strategy for Distribution Network Volt/VAR Control: A Smart-Building-Based Approach journal November 2019
Mobile Storage for Demand Charge Reduction journal January 2021
Ancillary Service to the Grid Through Control of Fans in Commercial Building HVAC Systems journal July 2014
Residential Demand Response of Thermostatically Controlled Loads Using Batch Reinforcement Learning journal September 2017
Reinforcement Learning Applied to an Electric Water Heater: From Theory to Practice journal July 2018
On-Line Building Energy Optimization Using Deep Reinforcement Learning journal July 2019
Linearized Price-Responsive HVAC Controller for Optimal Scheduling of Smart Building Loads journal July 2020
Real-Time Residential Demand Response journal September 2020
An Edge-Cloud Integrated Solution for Buildings Demand Response Using Reinforcement Learning journal January 2021
Automated Multi-Zone Linear Parametric Black Box Modeling Approach for Building HVAC Systems
  • Chintala, Rohit H.; Rasmussen, Bryan P.
  • ASME 2015 Dynamic Systems and Control Conference, Volume 2: Diagnostics and Detection; Drilling; Dynamics and Control of Wind Energy Systems; Energy Harvesting; Estimation and Identification; Flexible and Smart Structure Control; Fuels Cells/Energy Storage; Human Robot Interaction; HVAC Building Energy Management; Industrial Applications; Intelligent Transportation Systems; Manufacturing; Mechatronics; Modelling and Validation; Motion and Vibration Control Applications https://doi.org/10.1115/DSCC2015-9933
conference January 2016
Deep Reinforcement Learning for Building HVAC Control
  • Wei, Tianshu; Wang, Yanzhi; Zhu, Qi
  • DAC '17: The 54th Annual Design Automation Conference 2017, Proceedings of the 54th Annual Design Automation Conference 2017 https://doi.org/10.1145/3061639.3062224
conference June 2017
Transferable Reinforcement Learning for Smart Homes
  • Zhang, Xiangyu; Jin, Xin; Tripp, Charles
  • BuildSys '20: The 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Proceedings of the 1st International Workshop on Reinforcement Learning for Energy Management in Buildings & Cities https://doi.org/10.1145/3427773.3427865
conference November 2020
A Comparison of Model-Free and Model Predictive Control for Price Responsive Water Heaters
  • Biagioni, David J.; Zhang, Xiangyu; Graf, Peter
  • BuildSys '20: The 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Proceedings of the 1st International Workshop on Reinforcement Learning for Energy Management in Buildings & Cities https://doi.org/10.1145/3427773.3427872
conference November 2020
U.S. Department of Energy Commercial Reference Building Models of the National Building Stock report February 2011
Predictive Control of Building Thermal Loads for Participation in Energy and Regulation Markets conference July 2020
Reinforcement Learning for Control of Building HVAC Systems conference July 2020