Abstract
This paper studies multi-agent deep reinforcement learning (MADRL) based resource allocation methods for multi-cell wireless powered communication networks (WPCNs) where multiple hybrid access points (H-APs) wirelessly charge energy-limited users to collect data from them. We design a distributed reinforcement learning strategy where H-APs individually determine time and power allocation variables. Unlike traditional centralized optimization algorithms which require global information collected at a central unit, the proposed MADRL technique models an H-AP as an agent producing its action based only on its own locally observable states. Numerical results verify that the proposed approach can achieve comparable performance of the centralized algorithms.
Original language | English |
---|---|
Article number | 9217951 |
Pages (from-to) | 14055-14060 |
Number of pages | 6 |
Journal | IEEE Transactions on Vehicular Technology |
Volume | 69 |
Issue number | 11 |
DOIs | |
Publication status | Published - 2020 Nov |
Keywords
- Wireless powered communication networks
- actor-critic method
- multi-agent deep reinforcement learning
ASJC Scopus subject areas
- Automotive Engineering
- Aerospace Engineering
- Electrical and Electronic Engineering
- Applied Mathematics