Constrained Deep Reinforcement Learning for Energy Sustainable Multi-UAV Based Random Access IoT Networks With NOMA
- Illinois Institute of Technology, Chicago, IL (United States)
- Argonne National Lab. (ANL), Argonne, IL (United States)
In this paper, we apply the Non-Orthogonal Multiple Access (NOMA) technique to improve the massive channel access of a wireless IoT network where solar-powered Unmanned Aerial Vehicles (UAVs) relay data from IoT devices to remote servers. Specifically, IoT devices contend for accessing the shared wireless channel using an adaptive p-persistent slotted Aloha protocol; and the solar-powered UAVs adopt Successive Interference Cancellation (SIC) to decode multiple received data from IoT devices to improve access efficiency. To enable an energy-sustainable capacity-optimal network, we study the joint problem of dynamic multi-UAV altitude control and multi-cell wireless channel access management of IoT devices as a stochastic control problem with multiple energy constraints. We first formulate this problem as a Constrained Markov Decision Process (CMDP), and propose an online model-free Constrained Deep Reinforcement Learning (CDRL) algorithm based on Lagrangian primal-dual policy optimization to solve the CMDP. Extensive simulations demonstrate that our proposed algorithm learns a cooperative policy in which the altitude of UAVs and channel access probability of IoT devices are dynamically controlled to attain the maximal long-term network capacity while ensuring energy sustainability of UAVs, outperforming baseline schemes. The proposed CDRL agent can be trained on a small network, yet the learned policy can efficiently manage networks with a massive number of IoT devices and varying initial states, which can amortize the cost of training the CDRL agent.
- Research Organization:
- Argonne National Laboratory (ANL), Argonne, IL (United States)
- Sponsoring Organization:
- USDOE Office of Science (SC); National Science Foundation (NSF)
- Grant/Contract Number:
- AC02-06CH11357
- OSTI ID:
- 1776838
- Journal Information:
- IEEE Journal on Selected Areas in Communications, Vol. 39, Issue 4; ISSN 0733-8716
- Publisher:
- IEEECopyright Statement
- Country of Publication:
- United States
- Language:
- English
Data-Driven Random Access Optimization in Multi-Cell IoT Networks Using NOMA
|
journal | July 2022 |
Similar Records
Autonomous Wireless Technology Detection in Seamless IoT Applications
Seamless Cross-Technology Communication Platform for IoT Applications (Final Report)