skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Ten questions concerning reinforcement learning for building energy management

Journal Article · · Building and Environment
ORCiD logo [1];  [2];  [3];  [4];  [5];  [6];  [7]; ORCiD logo [8];  [8];  [6]; ORCiD logo [8];  [9];  [10];  [11];  [11];  [12];  [12];  [13];  [14];  [13] more »;  [9];  [15];  [1];  [6];  [6] « less
  1. University of Texas, Austin, TX (United States)
  2. University of Colorado, Boulder, CO (United States); National Renewable Energy Laboratory (NREL), Golden, CO (United States)
  3. University of Colorado, Boulder, CO (United States)
  4. Katholieke University Leuven (Belgium)
  5. Katholieke University. Leuven (Belgium)
  6. National Renewable Energy Laboratory (NREL), Golden, CO (United States)
  7. Bosch Research North America, Pittsburgh, PA (United States)
  8. Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
  9. Pacific Northwest National Laboratory (PNNL), Richland, WA (United States)
  10. National University of Singapore (Singapore); Eidgenoessische Technische Hochschule (ETH) (Singapore). Singapore-ETH Centre
  11. University of Texas, Arlington, TX (United States)
  12. Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States)
  13. Polytechnic University of Turin (Italy)
  14. PassiveLogic, Salt Lake City, UT (United States); Polytechnic University of Turin (Italy)
  15. Carnegie Mellon University, Pittsburgh, PA (United States)

As buildings account for approximately 40% of global energy consumption and associated greenhouse gas emissions, their role in decarbonizing the power grid is crucial. The increased integration of variable energy sources, such as renewables, introduces uncertainties and unprecedented flexibilities, necessitating buildings to adapt their energy demand to enhance grid resiliency. Consequently, buildings must transition from passive energy consumers to active grid assets, providing demand flexibility and energy elasticity while maintaining occupant comfort and health. This fundamental shift demands advanced optimal control methods to manage escalating energy demand and avert power outages. Reinforcement learning (RL) emerges as a promising method to address these challenges. Here, in this paper, we explore ten questions related to the application of RL in buildings, specifically targeting flexible energy management. We consider the growing availability of data, advancements in machine learning algorithms, open-source tools, and the practical deployment aspects associated with software and hardware requirements. Our objective is to deliver a comprehensive introduction to RL, present an overview of existing research and accomplishments, underscore the challenges and opportunities, and propose potential future research directions to expedite the adoption of RL for building energy management.

Research Organization:
Pacific Northwest National Laboratory (PNNL), Richland, WA (United States); Lawrence Berkeley National Laboratory (LBNL), Berkeley, CA (United States); National Renewable Energy Laboratory (NREL), Golden, CO (United States); Oak Ridge National Laboratory (ORNL), Oak Ridge, TN (United States)
Sponsoring Organization:
USDOE Office of Energy Efficiency and Renewable Energy (EERE), Energy Efficiency Office. Building Technologies Office
Grant/Contract Number:
AC05-76RL01830; AC02-05CH11231; AC36-08GO28308; AC05-00OR22725
OSTI ID:
1986487
Alternate ID(s):
OSTI ID: 1984043; OSTI ID: 1986509; OSTI ID: 2000373
Report Number(s):
PNNL-SA-186171; NREL/JA-5D00-84137
Journal Information:
Building and Environment, Vol. 241; ISSN 0360-1323
Publisher:
ElsevierCopyright Statement
Country of Publication:
United States
Language:
English

References (80)

Benchmarking high performance HVAC Rule-Based controls with advanced intelligent Controllers: A case study in a Multi-Zone system in Modelica journal April 2023
Transfer Learning Applied to Reinforcement Learning-Based HVAC Control journal April 2020
Deep Learning Explicit Differentiable Predictive Control Laws for Buildings journal January 2021
HVACLearn conference June 2020
Learning hand-eye coordination for robotic grasping with deep learning and large-scale data collection journal June 2017
Deep Reinforcement Learning with Double Q-Learning journal March 2016
Real-world challenges for multi-agent reinforcement learning in grid-interactive buildings journal November 2022
Mastering the game of Go with deep neural networks and tree search journal January 2016
Ten questions concerning agent-based modeling of occupant behavior for energy and environmental performance of buildings journal June 2022
Gnu-RL: A Practical and Scalable Reinforcement Learning Solution for Building HVAC Control Using a Differentiable MPC Policy journal November 2020
All you need to know about model predictive control for buildings journal January 2020
Reinforcement learning for demand response: A review of algorithms and modeling techniques journal February 2019
Identification of multi-zone grey-box building models for use in model predictive control journal May 2020
Data-driven building energy modeling with feature selection and active learning for data predictive control journal December 2021
Approximate model predictive building control via machine learning journal May 2018
Evaluation of Reinforcement Learning Control for Thermal Energy Storage Systems journal July 2003
Two-Stage Reinforcement Learning Policy Search for Grid-Interactive Building Control journal May 2022
A critical review of field implementations of occupant-centric building controls journal November 2019
A high-fidelity building performance simulation test bed for the development and evaluation of advanced controls journal April 2022
Functional mock-up unit for co-simulation import in EnergyPlus journal June 2013
LightLearn: An adaptive and occupant centered controller for lighting based on reinforcement learning journal January 2019
A Global Building Occupant Behavior Database journal June 2022
Sample Efficient Reinforcement Learning With Domain Randomization for Automated Demand Response in Low-Voltage Grids journal October 2022
Multi-agent deep reinforcement learning-based coordination control for grid-aware multi-buildings journal December 2022
Exploring fairness in participatory thermal comfort control in smart buildings conference November 2017
One for Many
  • Xu, Shichao; Wang, Yixuan; Wang, Yanzhi
  • Proceedings of the 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3408308.3427617
conference November 2020
Building optimization testing framework (BOPTEST) for simulation-based benchmarking of control strategies in buildings journal September 2021
A Relearning Approach to Reinforcement Learning for control of Smart Buildings journal November 2020
The Information Gap in Occupant-Centric Building Operations: Lessons Learned from Interviews with Building Operators in Germany journal May 2022
Investigation on the impacts of different genders and ages on satisfaction with thermal environments in office buildings journal June 2010
Introduction to Human-Building Interaction (HBI): Interfacing HCI with Architecture and Urban Design
  • Alavi, Hamed S.; Churchill, Elizabeth F.; Wiberg, Mikael
  • ACM Transactions on Computer-Human Interaction, Vol. 26, Issue 2 https://doi.org/10.1145/3309714
journal April 2019
Learning Tetris Using the Noisy Cross-Entropy Method journal December 2006
Towards self-learning control of HVAC systems with the consideration of dynamic occupancy patterns: Application of model-free deep reinforcement learning journal December 2022
Experimental analysis of simulated reinforcement learning control for active and passive building thermal storage inventory journal February 2006
Experimental analysis of simulated reinforcement learning control for active and passive building thermal storage inventory journal February 2006
Comprehensive analysis of the relationship between thermal comfort and building control research - A data-driven literature review journal February 2018
Gnu-RL: A Precocial Reinforcement Learning Solution for Building HVAC Control Using a Differentiable MPC Policy
  • Chen, Bingqing; Cai, Zicheng; Bergés, Mario
  • BuildSys '19: The 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3360322.3360849
conference November 2019
A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings journal October 2022
Outracing champion Gran Turismo drivers with deep reinforcement learning journal February 2022
An Edge-Cloud Integrated Solution for Buildings Demand Response Using Reinforcement Learning journal January 2021
A three-year dataset supporting research on building energy management and occupancy analytics journal April 2022
ComfortLearn
  • Quintana, Matias; Nagy, Zoltan; Tartarini, Federico
  • Proceedings of the 9th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3563357.3566167
conference November 2022
IEA EBC Annex 66: Definition and simulation of occupant behavior in buildings journal December 2017
Satisfaction based Q-learning for integrated lighting and blind control journal September 2016
Practical implementation and evaluation of deep reinforcement learning control for a radiant heating system
  • Zhang, Zhiang; Lam, Khee Poh
  • BuildSys '18: The 5th ACM International Conference on Systems for Built Environments, Proceedings of the 5th Conference on Systems for Built Environments https://doi.org/10.1145/3276774.3276775
conference November 2018
Mastering Atari, Go, chess and shogi by planning with a learned model journal December 2020
Reinforcement learning for whole-building HVAC control and demand response journal November 2020
Human-level control through deep reinforcement learning journal February 2015
Evaluation of Reinforcement Learning for Optimal Control of Building Active and Passive Thermal Storage Inventory journal October 2006
I want it that way
  • von Frankenberg, Nadine; Loftness, Vivian; Bruegge, Bernd
  • Proceedings of the 8th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3486611.3486672
conference November 2021
Marlisa
  • Vazquez-Canteli, Jose R.; Henze, Gregor; Nagy, Zoltan
  • Proceedings of the 7th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3408308.3427604
conference November 2020
Metadata Schemas and Ontologies for Building Energy Applications: A Critical Review and Use Case Analysis journal April 2021
An automated FX trading system using adaptive reinforcement learning journal April 2006
Enforcing Policy Feasibility Constraints through Differentiable Projection for Energy Optimization conference June 2021
Energym: A Building Model Library for Controller Benchmarking journal April 2021
Domain Randomization for Demand Response of an Electric Water Heater journal March 2021
Reinforced model predictive control (RL-MPC) for building energy management journal March 2022
Rainbow: Combining Improvements in Deep Reinforcement Learning journal April 2018
Whole building energy model for HVAC optimal control: A practical framework based on deep reinforcement learning journal September 2019
Counterfactual Multi-Agent Policy Gradients journal April 2018
Controlling distributed energy resources via deep reinforcement learning for load flexibility and energy efficiency journal December 2021
On occupant-centric building performance metrics journal September 2017
Reinforcement learning for optimal control of low exergy buildings journal October 2015
An Open-AI gym environment for the Building Optimization Testing (BOPTEST) framework conference September 2021
CityLearn v1.0: An OpenAI Gym Environment for Demand Response with Deep Reinforcement Learning
  • Vázquez-Canteli, José R.; Kämpf, Jérôme; Henze, Gregor
  • BuildSys '19: The 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation, Proceedings of the 6th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation https://doi.org/10.1145/3360322.3360998
conference November 2019
AlphaBuilding ResCommunity: A multi-agent virtual testbed for community-level load coordination journal November 2021
Artificial neural network models using thermal sensations and occupants’ behavior for predicting thermal comfort journal September 2018
Personal comfort models – A new paradigm in thermal comfort for occupant-centric environmental control journal March 2018
Ten questions concerning human-building interaction research for improving the quality of life journal December 2022
A synthetic building operation dataset journal August 2021
Challenges of urban digital twins: A systematic review and a Delphi expert survey journal March 2023
Experimental analysis of data-driven control for a building heating system journal June 2016
Evaluating the Adaptability of Reinforcement Learning Based HVAC Control for Residential Houses journal September 2020
Reinforcement learning for energy conservation and comfort in buildings journal July 2007
A predictive and adaptive control strategy to optimize the management of integrated energy systems in buildings journal November 2022
Sim-to-Real Transfer in Deep Reinforcement Learning for Robotics: a Survey conference December 2020
Model predictive heuristic control journal September 1978
The National Human Activity Pattern Survey (NHAPS): a resource for assessing exposure to environmental pollutants journal July 2001
Real-Time Residential Demand Response journal September 2020
Reinforcement Learning for proactive operation of residential energy systems by learning stochastic occupant behavior and fluctuating solar energy: Balancing comfort, hygiene and energy use journal July 2022