Skip to main content
U.S. Department of Energy
Office of Scientific and Technical Information

Ten questions concerning reinforcement learning for building energy management

Journal Article · · Building and Environment
As buildings account for approximately 40% of global energy consumption and associated greenhouse gas emissions, their role in decarbonizing the power grid is crucial. The increased integration of variable energy sources, such as renewables, introduces uncertainties and unprecedented flexibilities, necessitating buildings to adapt their energy demand to enhance grid resiliency. Consequently, buildings must transition from passive energy consumers to active grid assets, providing demand flexibility and energy elasticity while maintaining occupant comfort and health. This fundamental shift demands advanced optimal control methods to manage escalating energy demand and avert power outages. Reinforcement learning (RL) emerges as a promising method to address these challenges. Here, in this paper, we explore ten questions related to the application of RL in buildings, specifically targeting flexible energy management. We consider the growing availability of data, advancements in machine learning algorithms, open-source tools, and the practical deployment aspects associated with software and hardware requirements. Our objective is to deliver a comprehensive introduction to RL, present an overview of existing research and accomplishments, underscore the challenges and opportunities, and propose potential future research directions to expedite the adoption of RL for building energy management.
Research Organization:
Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); National Renewable Energy Laboratory (NREL), Golden, CO (United States); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States); Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
Sponsoring Organization:
USDOE Office of Energy Efficiency and Renewable Energy (EERE), Energy Efficiency Office. Building Technologies Office
Grant/Contract Number:
AC02-05CH11231; AC05-00OR22725; AC05-76RL01830; AC36-08GO28308
OSTI ID:
1984043
Alternate ID(s):
OSTI ID: 2000373
OSTI ID: 1986487
OSTI ID: 1986509
Report Number(s):
NREL/JA-5D00-84137; PNNL-SA-186171; ark:/13030/qt7945510n
Journal Information:
Building and Environment, Journal Name: Building and Environment Vol. 241; ISSN 0360-1323
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (80)

Transfer Learning Applied to Reinforcement Learning-Based HVAC Control journal April 2020
Model predictive heuristic control journal September 1978
AlphaBuilding ResCommunity: A multi-agent virtual testbed for community-level load coordination journal November 2021
Reinforcement learning for optimal control of low exergy buildings journal October 2015
Approximate model predictive building control via machine learning journal May 2018
Reinforcement learning for demand response: A review of algorithms and modeling techniques journal February 2019
Controlling distributed energy resources via deep reinforcement learning for load flexibility and energy efficiency journal December 2021
Reinforced model predictive control (RL-MPC) for building energy management journal March 2022
Reinforcement Learning for proactive operation of residential energy systems by learning stochastic occupant behavior and fluctuating solar energy: Balancing comfort, hygiene and energy use journal July 2022
A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings journal October 2022
Multi-agent deep reinforcement learning-based coordination control for grid-aware multi-buildings journal December 2022
All you need to know about model predictive control for buildings journal January 2020
Challenges of urban digital twins: A systematic review and a Delphi expert survey journal March 2023
Reinforcement learning for energy conservation and comfort in buildings journal July 2007
Investigation on the impacts of different genders and ages on satisfaction with thermal environments in office buildings journal June 2010
On occupant-centric building performance metrics journal September 2017
Personal comfort models – A new paradigm in thermal comfort for occupant-centric environmental control journal March 2018
LightLearn: An adaptive and occupant centered controller for lighting based on reinforcement learning journal January 2019
A critical review of field implementations of occupant-centric building controls journal November 2019
Ten questions concerning agent-based modeling of occupant behavior for energy and environmental performance of buildings journal June 2022
Ten questions concerning human-building interaction research for improving the quality of life journal December 2022
Towards self-learning control of HVAC systems with the consideration of dynamic occupancy patterns: Application of model-free deep reinforcement learning journal December 2022
Reinforcement learning for whole-building HVAC control and demand response journal November 2020
Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings journal November 2022
A predictive and adaptive control strategy to optimize the management of integrated energy systems in buildings journal November 2022
Experimental analysis of simulated reinforcement learning control for active and passive building thermal storage inventory journal February 2006
Experimental analysis of simulated reinforcement learning control for active and passive building thermal storage inventory journal February 2006
Satisfaction based Q-learning for integrated lighting and blind control journal September 2016
IEA EBC Annex 66: Definition and simulation of occupant behavior in buildings journal December 2017
Artificial neural network models using thermal sensations and occupants’ behavior for predicting thermal comfort journal September 2018
Whole building energy model for HVAC optimal control: A practical framework based on deep reinforcement learning journal September 2019
Data-driven building energy modeling with feature selection and active learning for data predictive control journal December 2021
Benchmarking high performance HVAC Rule-Based controls with advanced intelligent Controllers: A case study in a Multi-Zone system in Modelica journal April 2023
An automated FX trading system using adaptive reinforcement learning journal April 2006
Deep Learning Explicit Differentiable Predictive Control Laws for Buildings journal January 2021
Comprehensive analysis of the relationship between thermal comfort and building control research - A data-driven literature review journal February 2018
Experimental analysis of data-driven control for a building heating system journal June 2016
Human-level control through deep reinforcement learning journal February 2015
Mastering the game of Go with deep neural networks and tree search journal January 2016
Mastering Atari, Go, chess and shogi by planning with a learned model journal December 2020
Outracing champion Gran Turismo drivers with deep reinforcement learning journal February 2022
A synthetic building operation dataset journal August 2021
A three-year dataset supporting research on building energy management and occupancy analytics journal April 2022
A Global Building Occupant Behavior Database journal June 2022
The National Human Activity Pattern Survey (NHAPS): a resource for assessing exposure to environmental pollutants journal July 2001
Evaluation of Reinforcement Learning Control for Thermal Energy Storage Systems journal July 2003
Functional mock-up unit for co-simulation import in EnergyPlus journal June 2013
Identification of multi-zone grey-box building models for use in model predictive control journal May 2020
Building optimization testing framework (BOPTEST) for simulation-based benchmarking of control strategies in buildings journal September 2021
A high-fidelity building performance simulation test bed for the development and evaluation of advanced controls journal April 2022
Sample Efficient Reinforcement Learning With Domain Randomization for Automated Demand Response in Low-Voltage Grids journal October 2022
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey conference December 2020
Real-Time Residential Demand Response journal September 2020
An Edge-Cloud Integrated Solution for Buildings Demand Response Using Reinforcement Learning journal January 2021
Domain Randomization for Demand Response of an Electric Water Heater journal March 2021
Two-Stage Reinforcement Learning Policy Search for Grid-Interactive Building Control journal May 2022
Evaluation of Reinforcement Learning for Optimal Control of Building Active and Passive Thermal Storage Inventory journal October 2006
Exploring fairness in participatory thermal comfort control in smart buildings conference November 2017
Practical implementation and evaluation of deep reinforcement learning control for a radiant heating system
  • Zhang, Zhiang; Lam, Khee Poh
  • BuildSys '18: The 5th ACM International Conference on Systems for Built Environments, Proceedings of the 5th Conference on Systems for Built Environments https://doi.org/10.1145/3276774.3276775
conference November 2018
Introduction to Human-Building Interaction (HBI): Interfacing HCI with Architecture and Urban Design
  • Alavi, Hamed S.; Churchill, Elizabeth F.; Wiberg, Mikael
  • ACM Transactions on Computer-Human Interaction, Vol. 26, Issue 2 https://doi.org/10.1145/3309714
journal April 2019
Gnu-RL: A Precocial Reinforcement Learning Solution for Building HVAC Control Using a Differentiable MPC Policy
  • Chen, Bingqing; Cai, Zicheng; Bergés, Mario
  • BuildSys '19: The 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3360322.3360849
conference November 2019
CityLearn v1.0: An OpenAI Gym Environment for Demand Response with Deep Reinforcement Learning
  • Vázquez-Canteli, José R.; Kämpf, Jérôme; Henze, Gregor
  • BuildSys '19: The 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3360322.3360998
conference November 2019
HVACLearn conference June 2020
Marlisa
  • Vazquez-Canteli, Jose R.; Henze, Gregor; Nagy, Zoltan
  • Proceedings of the 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3408308.3427604
conference November 2020
One for Many
  • Xu, Shichao; Wang, Yixuan; Wang, Yanzhi
  • Proceedings of the 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3408308.3427617
conference November 2020
Enforcing Policy Feasibility Constraints through Differentiable Projection for Energy Optimization conference June 2021
I want it that way
  • von Frankenberg, Nadine; Loftness, Vivian; Bruegge, Bernd
  • Proceedings of the 8th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3486611.3486672
conference November 2021
ComfortLearn
  • Quintana, Matias; Nagy, Zoltan; Tartarini, Federico
  • Proceedings of the 9th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3563357.3566167
conference November 2022
Learning Tetris Using the Noisy Cross-Entropy Method journal December 2006
Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection journal June 2017
Deep Reinforcement Learning with Double Q-Learning journal March 2016
Counterfactual Multi-Agent Policy Gradients journal April 2018
Rainbow: Combining Improvements in Deep Reinforcement Learning journal April 2018
An Open-AI gym environment for the Building Optimization Testing (BOPTEST) framework conference September 2021
Gnu-RL: A Practical and Scalable Reinforcement Learning Solution for Building HVAC Control Using a Differentiable MPC Policy journal November 2020
The Information Gap in Occupant-Centric Building Operations: Lessons Learned from Interviews with Building Operators in Germany journal May 2022
Energym: A Building Model Library for Controller Benchmarking journal April 2021
Metadata Schemas and Ontologies for Building Energy Applications: A Critical Review and Use Case Analysis journal April 2021
Evaluating the Adaptability of Reinforcement Learning Based HVAC Control for Residential Houses journal September 2020
A Relearning Approach to Reinforcement Learning for control of Smart Buildings journal November 2020