skip to main content
OSTI.GOV title logo U.S. Department of Energy
Office of Scientific and Technical Information

Title: Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV Based Random Access IoT Networks With NOMA

Journal Article · · IEEE Journal on Selected Areas in Communications

In this paper, we apply the Non-Orthogonal Multiple Access (NOMA) technique to improve the massive channel access of a wireless IoT network where solar-powered Unmanned Aerial Vehicles (UAVs) relay data from IoT devices to remote servers. Specifically, IoT devices contend for accessing the shared wireless channel using an adaptive p-persistent slotted Aloha protocol; and the solar-powered UAVs adopt Successive Interference Cancellation (SIC) to decode multiple received data from IoT devices to improve access efficiency. To enable an energy-sustainable capacity-optimal network, we study the joint problem of dynamic multi-UAV altitude control and multi-cell wireless channel access management of IoT devices as a stochastic control problem with multiple energy constraints. We first formulate this problem as a Constrained Markov Decision Process (CMDP), and propose an online model-free Constrained Deep Reinforcement Learning (CDRL) algorithm based on Lagrangian primal-dual policy optimization to solve the CMDP. Extensive simulations demonstrate that our proposed algorithm learns a cooperative policy in which the altitude of UAVs and channel access probability of IoT devices are dynamically controlled to attain the maximal long-term network capacity while ensuring energy sustainability of UAVs, outperforming baseline schemes. The proposed CDRL agent can be trained on a small network, yet the learned policy can efficiently manage networks with a massive number of IoT devices and varying initial states, which can amortize the cost of training the CDRL agent.

Research Organization:
Argonne National Laboratory (ANL), Argonne, IL (United States)
Sponsoring Organization:
USDOE Office of Science (SC); National Science Foundation (NSF)
Grant/Contract Number:
AC02-06CH11357
OSTI ID:
1776838
Journal Information:
IEEE Journal on Selected Areas in Communications, Vol. 39, Issue 4; ISSN 0733-8716
Publisher:
IEEECopyright Statement
Country of Publication:
United States
Language:
English

References (33)

Playing Atari with Deep Reinforcement Learning preprint January 2013
LoRa technology MAC layer operations and Research issues journal January 2018
An actor-critic algorithm for constrained Markov decision processes journal March 2005
Optimal 3D-Trajectory Design and Resource Allocation for Solar-Powered UAV Communication Systems journal June 2019
Optimal Path Planning of Solar-Powered UAV Using Gravitational Potential Energy journal June 2017
Quadrotor Helicopter Flight Dynamics and Control: Theory and Experiment conference June 2007
An actor–critic algorithm with function approximation for discounted cost constrained Markov decision processes journal December 2010
An Online Actor–Critic Algorithm with Function Approximation for Constrained Markov Decision Processes journal January 2012
A Survey of Motion Planning Algorithms from the Perspective of Autonomous UAV Guidance journal November 2009
Optimizing Non-Orthogonal Multiple Access in Random Access Networks conference May 2020
Thrust Control for Multirotor Aerial Vehicles journal April 2017
A Renewal Theory Based Analytical Model for Multi-Channel Random Access in IEEE 802.11ac/ax journal May 2019
Solar powered UAV: Design and experiments conference September 2015
Perpetual flight with a small solar-powered UAV: Flight results, performance analysis and model validation conference March 2016
UAV-Enabled Communication Using NOMA journal July 2019
Joint Trajectory and Precoding Optimization for UAV-Assisted NOMA Networks journal May 2019
Least squares quantization in PCM journal March 1982
Efficient Deployment of Multiple Unmanned Aerial Vehicles for Optimal Wireless Coverage journal August 2016
Deployment Algorithms for UAV Airborne Networks Toward On-Demand Coverage journal September 2018
Risk-Sensitive Reinforcement Learning Applied to Control under Constraints journal July 2005
A Method for Optimized Deployment of a Network of Surveillance Aerial Drones journal December 2019
Joint Trajectory and Communication Design for Multi-UAV Enabled Wireless Networks journal March 2018
Throughput Maximization in Multi-UAV Enabled Communication Systems With Difference Consideration journal January 2018
Deep Reinforcement Learning for Minimizing Age-of-Information in UAV-Assisted Networks conference December 2019
Trajectory Design and Power Control for Multi-UAV Assisted Wireless Networks: A Machine Learning Approach journal August 2019
Reinforcement Learning for Decentralized Trajectory Design in Cellular UAV Networks With Sense-and-Send Protocol journal August 2019
Ultra-Reliable IoT Communications with UAVs: A Swarm Use Case journal December 2018
A Tutorial on UAVs for Wireless Networks: Applications, Challenges, and Open Problems journal January 2019
Sustainable Wireless IoT Networks With RF Energy Charging Over Wi-Fi (CoWiFi) journal December 2019
NOMA-Based Random Access With Multichannel ALOHA journal December 2017
A Game-Theoretic Approach for NOMA-ALOHA conference June 2018
Nonorthogonal Random Access for 5G Mobile Communication Systems journal August 2018
Placement Optimization of UAV-Mounted Mobile Base Stations journal March 2017

Cited By (1)

Data-Driven Random Access Optimization in Multi-Cell IoT Networks Using NOMA journal July 2022