Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks

This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimi...

Full description

Saved in:
Bibliographic Details
Main Authors: Yang, Helin, Zhao, Jun, Lam, Kwok-Yan, Xiong, Zehui, Wu, Qingqing, Xiao, Liang
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2023
Subjects:
Online Access:https://hdl.handle.net/10356/166422
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-166422
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Heterogeneous Wireless Networks
Distributed Resource Management
spellingShingle Engineering::Computer science and engineering
Heterogeneous Wireless Networks
Distributed Resource Management
Yang, Helin
Zhao, Jun
Lam, Kwok-Yan
Xiong, Zehui
Wu, Qingqing
Xiao, Liang
Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
description This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimization problem with devices' quality-of-service (QoS) constraints is non-convex and NP-hard, we model it as a Markov decision process (MDP). Considering the fact that the network is highly complex with large state and action spaces, a multi-agent dueling deep-Q network-based algorithm combined with distributed coordinated learning is proposed to effectively learn the optimized intelligent resource management policy, where the algorithm adopts dueling deep network to learn the action-value distribution by estimating both the state-value and action advantage functions. Under the distributed coordinated learning manner and dueling architecture, the learning algorithm can rapidly converge to the optimized policy. Simulation results demonstrate that the proposed distributed coordinated learning algorithm outperforms other existing learning algorithms in terms of learning efficiency, network data rate, and QoS satisfaction probability.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Yang, Helin
Zhao, Jun
Lam, Kwok-Yan
Xiong, Zehui
Wu, Qingqing
Xiao, Liang
format Article
author Yang, Helin
Zhao, Jun
Lam, Kwok-Yan
Xiong, Zehui
Wu, Qingqing
Xiao, Liang
author_sort Yang, Helin
title Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_short Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_full Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_fullStr Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_full_unstemmed Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_sort distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
publishDate 2023
url https://hdl.handle.net/10356/166422
_version_ 1765213842676121600
spelling sg-ntu-dr.10356-1664222023-04-28T15:37:34Z Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks Yang, Helin Zhao, Jun Lam, Kwok-Yan Xiong, Zehui Wu, Qingqing Xiao, Liang School of Computer Science and Engineering Strategic Centre for Research in Privacy-Preserving Technologies and Systems Engineering::Computer science and engineering Heterogeneous Wireless Networks Distributed Resource Management This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimization problem with devices' quality-of-service (QoS) constraints is non-convex and NP-hard, we model it as a Markov decision process (MDP). Considering the fact that the network is highly complex with large state and action spaces, a multi-agent dueling deep-Q network-based algorithm combined with distributed coordinated learning is proposed to effectively learn the optimized intelligent resource management policy, where the algorithm adopts dueling deep network to learn the action-value distribution by estimating both the state-value and action advantage functions. Under the distributed coordinated learning manner and dueling architecture, the learning algorithm can rapidly converge to the optimized policy. Simulation results demonstrate that the proposed distributed coordinated learning algorithm outperforms other existing learning algorithms in terms of learning efficiency, network data rate, and QoS satisfaction probability. Ministry of Education (MOE) Nanyang Technological University National Research Foundation (NRF) Submitted/Accepted version This work was supported in part by the National Research Foundation (NRF), Singapore, under its Strategic Capability Research Centres Funding Initiative; in part by the Nanyang Technological University (NTU) Startup Grant; in part by the Alibaba-NTU Singapore Joint Research Institute; in part by the Singapore Ministry of Education Academic Research Fund under Grant Tier 1 RG97/20, Grant Tier 1 RG24/20, Grant Tier 1 RT07/19, Grant Tier 1 RT01/19, and Grant Tier 2 MOE2019-T2-1-176; in part by the NTU-Wallenberg AI, Autonomous Systems and Software Program (WASP) Joint Project; in part by the Energy Research Institute @ NTU; in part by the Singapore NRF National Satellite of Excellence, Design Science and Technology for Secure Critical Infrastructure under Grant NSoE DeST-SCI2019-0012; in part by the Artificial Intelligence (AI) Singapore 100 Experiments (100E) Programme; in part by the NTU Project for Large Vertical Take-Off and Landing Research Platform; in part by the Singapore University of Technology and Design (SUTD) under Grant SRG-ISTD-2021-165; in part by the SUTD-Zhejiang University (ZJU) IDEA Grant under Grant SUTD-ZJU (VP) 202102; in part by the SUTD-ZJU IDEA Seed Grant under Grant SUTD-ZJU (SD) 202101; and in part by the Natural Science Foundation of China under Grant 61971366 and Grant U21A20444. 2023-04-25T01:36:57Z 2023-04-25T01:36:57Z 2022 Journal Article Yang, H., Zhao, J., Lam, K., Xiong, Z., Wu, Q. & Xiao, L. (2022). Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks. IEEE Transactions On Wireless Communications, 21(9), 6935-6948. https://dx.doi.org/10.1109/TWC.2022.3153175 1536-1276 https://hdl.handle.net/10356/166422 10.1109/TWC.2022.3153175 2-s2.0-85125705570 9 21 6935 6948 en RG97/20 RG24/20 RT07/19 RT01/19 MOE2019-T2-1-176 NSoE DeST-SCI2019-0012 SRG-ISTD-2021-165 SUTD-ZJU (VP) 202102 SUTD-ZJU (SD) 202101 IEEE Transactions on Wireless Communications © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TWC.2022.3153175. application/pdf