Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks

This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimi...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yang, Helin, Zhao, Jun, Lam, Kwok-Yan, Xiong, Zehui, Wu, Qingqing, Xiao, Liang
Other Authors:	School of Computer Science and Engineering
Format:	Article
Language:	English
Published:	2023
Subjects:	Engineering::Computer science and engineering Heterogeneous Wireless Networks Distributed Resource Management
Online Access:	https://hdl.handle.net/10356/166422
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-166422
record_format	dspace
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering Heterogeneous Wireless Networks Distributed Resource Management
spellingShingle	Engineering::Computer science and engineering Heterogeneous Wireless Networks Distributed Resource Management Yang, Helin Zhao, Jun Lam, Kwok-Yan Xiong, Zehui Wu, Qingqing Xiao, Liang Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
description	This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimization problem with devices' quality-of-service (QoS) constraints is non-convex and NP-hard, we model it as a Markov decision process (MDP). Considering the fact that the network is highly complex with large state and action spaces, a multi-agent dueling deep-Q network-based algorithm combined with distributed coordinated learning is proposed to effectively learn the optimized intelligent resource management policy, where the algorithm adopts dueling deep network to learn the action-value distribution by estimating both the state-value and action advantage functions. Under the distributed coordinated learning manner and dueling architecture, the learning algorithm can rapidly converge to the optimized policy. Simulation results demonstrate that the proposed distributed coordinated learning algorithm outperforms other existing learning algorithms in terms of learning efficiency, network data rate, and QoS satisfaction probability.
author2	School of Computer Science and Engineering
author_facet	School of Computer Science and Engineering Yang, Helin Zhao, Jun Lam, Kwok-Yan Xiong, Zehui Wu, Qingqing Xiao, Liang
format	Article
author	Yang, Helin Zhao, Jun Lam, Kwok-Yan Xiong, Zehui Wu, Qingqing Xiao, Liang
author_sort	Yang, Helin
title	Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_short	Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_full	Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_fullStr	Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_full_unstemmed	Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
title_sort	distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks
publishDate	2023
url	https://hdl.handle.net/10356/166422
_version_	1765213842676121600
spelling	sg-ntu-dr.10356-1664222023-04-28T15:37:34Z Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks Yang, Helin Zhao, Jun Lam, Kwok-Yan Xiong, Zehui Wu, Qingqing Xiao, Liang School of Computer Science and Engineering Strategic Centre for Research in Privacy-Preserving Technologies and Systems Engineering::Computer science and engineering Heterogeneous Wireless Networks Distributed Resource Management This paper investigates the problem of distributed resource management in two-tier heterogeneous networks, where each cell selects its joint device association, spectrum allocation, and power allocation strategy based only on locally-observed information without any central controller. As the optimization problem with devices' quality-of-service (QoS) constraints is non-convex and NP-hard, we model it as a Markov decision process (MDP). Considering the fact that the network is highly complex with large state and action spaces, a multi-agent dueling deep-Q network-based algorithm combined with distributed coordinated learning is proposed to effectively learn the optimized intelligent resource management policy, where the algorithm adopts dueling deep network to learn the action-value distribution by estimating both the state-value and action advantage functions. Under the distributed coordinated learning manner and dueling architecture, the learning algorithm can rapidly converge to the optimized policy. Simulation results demonstrate that the proposed distributed coordinated learning algorithm outperforms other existing learning algorithms in terms of learning efficiency, network data rate, and QoS satisfaction probability. Ministry of Education (MOE) Nanyang Technological University National Research Foundation (NRF) Submitted/Accepted version This work was supported in part by the National Research Foundation (NRF), Singapore, under its Strategic Capability Research Centres Funding Initiative; in part by the Nanyang Technological University (NTU) Startup Grant; in part by the Alibaba-NTU Singapore Joint Research Institute; in part by the Singapore Ministry of Education Academic Research Fund under Grant Tier 1 RG97/20, Grant Tier 1 RG24/20, Grant Tier 1 RT07/19, Grant Tier 1 RT01/19, and Grant Tier 2 MOE2019-T2-1-176; in part by the NTU-Wallenberg AI, Autonomous Systems and Software Program (WASP) Joint Project; in part by the Energy Research Institute @ NTU; in part by the Singapore NRF National Satellite of Excellence, Design Science and Technology for Secure Critical Infrastructure under Grant NSoE DeST-SCI2019-0012; in part by the Artificial Intelligence (AI) Singapore 100 Experiments (100E) Programme; in part by the NTU Project for Large Vertical Take-Off and Landing Research Platform; in part by the Singapore University of Technology and Design (SUTD) under Grant SRG-ISTD-2021-165; in part by the SUTD-Zhejiang University (ZJU) IDEA Grant under Grant SUTD-ZJU (VP) 202102; in part by the SUTD-ZJU IDEA Seed Grant under Grant SUTD-ZJU (SD) 202101; and in part by the Natural Science Foundation of China under Grant 61971366 and Grant U21A20444. 2023-04-25T01:36:57Z 2023-04-25T01:36:57Z 2022 Journal Article Yang, H., Zhao, J., Lam, K., Xiong, Z., Wu, Q. & Xiao, L. (2022). Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks. IEEE Transactions On Wireless Communications, 21(9), 6935-6948. https://dx.doi.org/10.1109/TWC.2022.3153175 1536-1276 https://hdl.handle.net/10356/166422 10.1109/TWC.2022.3153175 2-s2.0-85125705570 9 21 6935 6948 en RG97/20 RG24/20 RT07/19 RT01/19 MOE2019-T2-1-176 NSoE DeST-SCI2019-0012 SRG-ISTD-2021-165 SUTD-ZJU (VP) 202102 SUTD-ZJU (SD) 202101 IEEE Transactions on Wireless Communications © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TWC.2022.3153175. application/pdf

Distributed deep reinforcement learning-based spectrum and power allocation for heterogeneous networks

Similar Items