Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimi...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2023
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/166422 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-166422 |
---|---|
record_format |
dspace |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Heterogeneous Wireless Networks Distributed Resource Management |
spellingShingle |
Engineering::Computer science and engineering Heterogeneous Wireless Networks Distributed Resource Management Yang, Helin Zhao, Jun Lam, Kwok-Yan Xiong, Zehui Wu, Qingqing Xiao, Liang Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks |
description |
This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimization problem with devices' quality-of-service (QoS) constraints is non-convex and NP-hard, we model it as a Markov decision process (MDP). Considering the fact that the network is highly complex with large state and action spaces, a multi-agent dueling deep-Q network-based algorithm combined with distributed coordinated learning is proposed to effectively learn the optimized intelligent resource management policy, where the algorithm adopts dueling deep network to learn the action-value distribution by estimating both the state-value and action advantage functions. Under the distributed coordinated learning manner and dueling architecture, the learning algorithm can rapidly converge to the optimized policy. Simulation results demonstrate that the proposed distributed coordinated learning algorithm outperforms other existing learning algorithms in terms of learning efficiency, network data rate, and QoS satisfaction probability. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Yang, Helin Zhao, Jun Lam, Kwok-Yan Xiong, Zehui Wu, Qingqing Xiao, Liang |
format |
Article |
author |
Yang, Helin Zhao, Jun Lam, Kwok-Yan Xiong, Zehui Wu, Qingqing Xiao, Liang |
author_sort |
Yang, Helin |
title |
Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks |
title_short |
Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks |
title_full |
Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks |
title_fullStr |
Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks |
title_full_unstemmed |
Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks |
title_sort |
distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks |
publishDate |
2023 |
url |
https://hdl.handle.net/10356/166422 |
_version_ |
1765213842676121600 |
spelling |
sg-ntu-dr.10356-1664222023-04-28T15:37:34Z Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks Yang, Helin Zhao, Jun Lam, Kwok-Yan Xiong, Zehui Wu, Qingqing Xiao, Liang School of Computer Science and Engineering Strategic Centre for Research in Privacy-Preserving Technologies and Systems Engineering::Computer science and engineering Heterogeneous Wireless Networks Distributed Resource Management This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimization problem with devices' quality-of-service (QoS) constraints is non-convex and NP-hard, we model it as a Markov decision process (MDP). Considering the fact that the network is highly complex with large state and action spaces, a multi-agent dueling deep-Q network-based algorithm combined with distributed coordinated learning is proposed to effectively learn the optimized intelligent resource management policy, where the algorithm adopts dueling deep network to learn the action-value distribution by estimating both the state-value and action advantage functions. Under the distributed coordinated learning manner and dueling architecture, the learning algorithm can rapidly converge to the optimized policy. Simulation results demonstrate that the proposed distributed coordinated learning algorithm outperforms other existing learning algorithms in terms of learning efficiency, network data rate, and QoS satisfaction probability. Ministry of Education (MOE) Nanyang Technological University National Research Foundation (NRF) Submitted/Accepted version This work was supported in part by the National Research Foundation (NRF), Singapore, under its Strategic Capability Research Centres Funding Initiative; in part by the Nanyang Technological University (NTU) Startup Grant; in part by the Alibaba-NTU Singapore Joint Research Institute; in part by the Singapore Ministry of Education Academic Research Fund under Grant Tier 1 RG97/20, Grant Tier 1 RG24/20, Grant Tier 1 RT07/19, Grant Tier 1 RT01/19, and Grant Tier 2 MOE2019-T2-1-176; in part by the NTU-Wallenberg AI, Autonomous Systems and Software Program (WASP) Joint Project; in part by the Energy Research Institute @ NTU; in part by the Singapore NRF National Satellite of Excellence, Design Science and Technology for Secure Critical Infrastructure under Grant NSoE DeST-SCI2019-0012; in part by the Artificial Intelligence (AI) Singapore 100 Experiments (100E) Programme; in part by the NTU Project for Large Vertical Take-Off and Landing Research Platform; in part by the Singapore University of Technology and Design (SUTD) under Grant SRG-ISTD-2021-165; in part by the SUTD-Zhejiang University (ZJU) IDEA Grant under Grant SUTD-ZJU (VP) 202102; in part by the SUTD-ZJU IDEA Seed Grant under Grant SUTD-ZJU (SD) 202101; and in part by the Natural Science Foundation of China under Grant 61971366 and Grant U21A20444. 2023-04-25T01:36:57Z 2023-04-25T01:36:57Z 2022 Journal Article Yang, H., Zhao, J., Lam, K., Xiong, Z., Wu, Q. & Xiao, L. (2022). Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks. IEEE Transactions On Wireless Communications, 21(9), 6935-6948. https://dx.doi.org/10.1109/TWC.2022.3153175 1536-1276 https://hdl.handle.net/10356/166422 10.1109/TWC.2022.3153175 2-s2.0-85125705570 9 21 6935 6948 en RG97/20 RG24/20 RT07/19 RT01/19 MOE2019-T2-1-176 NSoE DeST-SCI2019-0012 SRG-ISTD-2021-165 SUTD-ZJU (VP) 202102 SUTD-ZJU (SD) 202101 IEEE Transactions on Wireless Communications © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TWC.2022.3153175. application/pdf |