A Cooperative Multi-Agent Deep Reinforcement Learning Framework for Real-Time Residential Load Scheduling

Zhang, Chi; Kuppannagari, Sanmukh; Xiong, Chuanxiu; Kannan, Rajgopal; Prasanna, Viktor K.

doi:10.1145/3302505.3310069

Title: A Cooperative Multi-Agent Deep Reinforcement Learning Framework for Real-Time Residential Load Scheduling

Conference · Mon Apr 22 00:00:00 EDT 2019

DOI:https://doi.org/10.1145/3302505.3310069· OSTI ID:1607511

Zhang, Chi ^[1]; Kuppannagari, Sanmukh ^[1]; Xiong, Chuanxiu ^[1]; Kannan, Rajgopal ^[2]; Prasanna, Viktor K. ^[1]

University of Southern California
US Army Research Lab-West

Internet-of-Things (IoT) enabled monitoring and control capabilities are enabling increasing numbers of household users with controllable loads to actively participate in smart grid energy management. Realizing an efficient real-time energy management system that takes advantage of these developments requires novel techniques for managing the increased complexity of the control action space in resolving multiple challenges such as the uncertainty in energy prices and renewable energy output along with the need to satisy physical grid constraints such as transformer capacity. Addressing these challenges, we develop a multi-household energy management framework for residential units connected to the same transformer and containing DERs such as PV, ESS and controllable loads. The goal of our framework is to schedule controllable household appliances and ESS such that the cost of procuring electricity from the utility over a horizon is minimized while physical grid constraints are satisfied at each scheduling step. Traditional energy management frameworks either perform global optimization to satisfy grid constraints but suffer from high computational complexity (for example Integer Program, Mixed Integer Programming frameworks and centralized reinforcement learning) or perform decentralized real-time energy management without satisfying global grid constraints (for example multi-agent reinforcement learning with no cooperation). In contrast, we propose a cooperative multiagent reinforcement learning (MARL) framework that i) operates in real-time, and ii) performs explicit collaboration to satisfy global grid constraints. The novelty in our framework is two fold. Firstly, our framework trains multiple independent learners (IL) for each household in parallel using historical data and performs real-time inferencing of control actions using the most recent system state. Secondly, our framework contains a low complexity knapsack based cooperation agent which combines the outputs of ILs to minimize cost while satisfying grid constraints. Simulation results show that our cooperative MARL approach achieves significant cost improvement over centralized reinforcement learning and day-ahead planning baselines. Moreover, our approach strictly satisfies physical constraints with no apriori knowledge of system dynamics while the baseline approaches have occasional violations. We also measure the training and inference time by ranging the number of households from 1 to 25. Results show that our cooperative MARL approach scales best among various approaches.

View Conference

Cite

Export

Save

Research Organization:: Univ. of Southern California, Los Angeles, CA (United States)

Sponsoring Organization:: USDOE Office of Energy Efficiency and Renewable Energy (EERE), Renewable Power Office. Solar Energy Technologies Office

DOE Contract Number:: EE0008003

OSTI ID:: 1607511

Report Number(s):: EE0008003-3

Resource Relation:: Conference: The 4th ACM/IEEE International Conference on Internet of Things Design and Implementation (IoTDI), 2019

Country of Publication:: United States

Language:: English

References (15)

Load Scheduling for Household Energy Consumption Optimization Agnetis, Alessandro; de Pascale, Gianluca; Detti, Paolo IEEE Transactions on Smart Grid, Vol. 4, Issue 4 https://doi.org/10.1109/TSG.2013.2254506	journal	December 2013
A Framework for Volt-VAR Optimization in Distribution Systems Ahmadi, Hamed; Marti, Jose R.; Dommel, Hermann W. IEEE Transactions on Smart Grid, Vol. 6, Issue 3 https://doi.org/10.1109/TSG.2014.2374613	journal	May 2015
Approximating Multiobjective Knapsack Problems Erlebach, Thomas; Kellerer, Hans; Pferschy, Ulrich Management Science, Vol. 48, Issue 12 https://doi.org/10.1287/mnsc.48.12.1603.445	journal	December 2002
Branch flow model: Relaxations and convexification Farivar, Masoud; Low, Steven H. 2012 IEEE 51st IEEE Conference on Decision and Control (CDC) https://doi.org/10.1109/CDC.2012.6425870	conference	December 2012
Household response to dynamic pricing of electricity: a survey of 15 experiments Faruqui, Ahmad; Sergici, Sanem Journal of Regulatory Economics, Vol. 38, Issue 2 https://doi.org/10.1007/s11149-010-9127-y	journal	August 2010
Long Short-Term Memory Hochreiter, Sepp; Schmidhuber, Jürgen Neural Computation, Vol. 9, Issue 8 https://doi.org/10.1162/neco.1997.9.8.1735	journal	November 1997
Dynamic Pricing and Energy Consumption Scheduling With Reinforcement Learning Kim, Byung-Gook; Zhang, Yu; van der Schaar, Mihaela IEEE Transactions on Smart Grid, Vol. 7, Issue 5 https://doi.org/10.1109/TSG.2015.2495145	journal	September 2016
Sparse cooperative Q-learning Kok, Jelle R.; Vlassis, Nikos Twenty-first international conference on Machine learning - ICML '04 https://doi.org/10.1145/1015330.1015410	conference	January 2004
Distributed Rate Control for Smart Solar Arrays Lee, Stephen; Iyengar, Srinivasan; Irwin, David e-Energy '17: The Eighth International Conference on Future Energy Systems, Proceedings of the Eighth International Conference on Future Energy Systems https://doi.org/10.1145/3077839.3077840	conference	May 2017
Residential Energy Management in Smart Grid: A Markov Decision Process-Based Approach Misra, Sudip; Mondal, Ayan; Banik, Shukla 2013 IEEE International Conference on Green Computing and Communications and IEEE Internet of Things and IEEE Cyber, Physical and Social Computing https://doi.org/10.1109/GreenCom-iThings-CPSCom.2013.200	conference	August 2013
A centralized reinforcement learning method for multi-agent job scheduling in Grid Moradi, Milad 2016 6th International Conference on Computer and Knowledge Engineering (ICCKE) https://doi.org/10.1109/ICCKE.2016.7802135	conference	October 2016
A Reliability Perspective of the Smart Grid Moslehi, Khosrow; Kumar, Ranjit IEEE Transactions on Smart Grid, Vol. 1, Issue 1 https://doi.org/10.1109/TSG.2010.2046346	journal	June 2010
Internet of Things in the 5G Era: Enablers, Architecture, and Business Models Palattella, Maria Rita; Dohler, Mischa; Grieco, Alfredo IEEE Journal on Selected Areas in Communications, Vol. 34, Issue 3 https://doi.org/10.1109/JSAC.2016.2525418	journal	March 2016
Mastering the game of Go without human knowledge Silver, David; Schrittwieser, Julian; Simonyan, Karen Nature, Vol. 550, Issue 7676 https://doi.org/10.1038/nature24270	journal	October 2017
Reinforcement Learning: An Introduction Sutton, R. S.; Barto, A. G. IEEE Transactions on Neural Networks, Vol. 9, Issue 5 https://doi.org/10.1109/TNN.1998.712192	journal	September 1998

Similar Records

Approximate Scheduling of DERs with Discrete Complex Injections

Conference · Tue Jun 25 00:00:00 EDT 2019 · OSTI ID:1607511

Kuppannagari, Sanmukh; Kannan, Rajgopal; Prasanna, Viktor K.

Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning

Conference · Fri Jan 19 00:00:00 EST 2024 · OSTI ID:1607511

Emami, Patrick; Zhang, Xiangyu; Biagioni, David; +1 more

EMOS (Energy Management Optimization System)

Software · Tue Mar 23 00:00:00 EDT 2021 · OSTI ID:1607511

Mohamed, Ahmed; Mahmud, Rasel; Meintz, Andrew

Related Subjects

14 SOLAR ENERGY
multi-agent
deep reinforcement learning
smart home
real-time load scheduling
internet-of-things

Title: A Cooperative Multi-Agent Deep Reinforcement Learning Framework for Real-Time Residential Load Scheduling

Citation Formats

References (15)

Similar Records

Related Subjects