Dynamic spectrum access for Internet-of-Things based on federated deep reinforcement learning
The explosive growth of Internet-of-Things (IoT) applications such as smart cities and Industry 4.0 have led to drastic increase in demand for wireless bandwidth, hence motivating the rapid development of new techniques for enhancing spectrum utilization needed by new generation wireless communicati...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/163822 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-163822 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1638222023-05-26T15:36:28Z Dynamic spectrum access for Internet-of-Things based on federated deep reinforcement learning Li, Feng Shen, Bowen Guo, Jiale Lam, Kwok-Yan Wei, Guiyi Wang, Li School of Computer Science and Engineering Engineering::Electrical and electronic engineering Internet of Things Collaborative Work The explosive growth of Internet-of-Things (IoT) applications such as smart cities and Industry 4.0 have led to drastic increase in demand for wireless bandwidth, hence motivating the rapid development of new techniques for enhancing spectrum utilization needed by new generation wireless communication technologies. Among others, dynamic spectrum access (DSA) is one of the most widely accepted approaches. In this paper, as an enhancement of existing works, we take into consideration of inter-node collaborations in a dynamic spectrum environment. Typically, in such distributed circumstances, intelligent dynamic spectrum access almost invariably relies on self-learning to achieve dynamic spectrum access improvement. Whereas, this paper proposes a DSA scheme based on deep reinforcement learning to enhance spectrum and access efficiency. Unlike traditional Q-learning-based DSA, we introduce the following to enhance the spectrum efficiency in dynamic IoT spectrum environments. First, deep double Q-learning is adopted to perform local self-spectrum-learning for IoT terminals in order to achieve better dynamic access accuracy. Second, to accelerate learning convergence, federated learning (FL) in edge nodes is used to improve the self-learning. Third, multiple secondary users, who do not interfere with each other and have similar operation condition, are clustered for federated learning to enhance the efficiency of deep reinforcement learning. Comparing with the traditional distributed DSA with deep learning, the proposed scheme has faster access convergence speed due to the characteristic of global optimization for federated learning. Based on this, a framework of federated deep reinforcement learning (FDRL) for DSA is proposed. Furthermore, this scheme preserves privacy of IoT users in that FDRL only requires model parameters to be uploaded to edge servers. Simulations are performed to show the effectiveness of theproposed FDRL-based DSA framework. National Research Foundation (NRF) Submitted/Accepted version This work was supported in part by the National Research Foundation, Singapore under its Strategic Capability Research Centres Funding Initiative any opinions, findings and conclusions or recommendations expressed in this material are those of the author(s) and do not reflect the views of National Research Foundation, Singapore and in part by the Fundamental Research Funds for the Central Universities under Grant 3132021335. 2022-12-19T05:22:52Z 2022-12-19T05:22:52Z 2022 Journal Article Li, F., Shen, B., Guo, J., Lam, K., Wei, G. & Wang, L. (2022). Dynamic spectrum access for Internet-of-Things based on federated deep reinforcement learning. IEEE Transactions On Vehicular Technology, 71(7), 7952-7956. https://dx.doi.org/10.1109/TVT.2022.3166535 0018-9545 https://hdl.handle.net/10356/163822 10.1109/TVT.2022.3166535 2-s2.0-85128692697 7 71 7952 7956 en IEEE Transactions on Vehicular Technology © 2022 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: https://doi.org/10.1109/TVT.2022.3166535. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Internet of Things Collaborative Work |
spellingShingle |
Engineering::Electrical and electronic engineering Internet of Things Collaborative Work Li, Feng Shen, Bowen Guo, Jiale Lam, Kwok-Yan Wei, Guiyi Wang, Li Dynamic spectrum access for Internet-of-Things based on federated deep reinforcement learning |
description |
The explosive growth of Internet-of-Things (IoT) applications such as smart cities and Industry 4.0 have led to drastic increase in demand for wireless bandwidth, hence motivating the rapid development of new techniques for enhancing spectrum utilization needed by new generation wireless communication technologies. Among others, dynamic spectrum access (DSA) is one of the most widely accepted approaches. In this paper, as an enhancement of existing works, we take into consideration of inter-node collaborations in a dynamic spectrum environment. Typically, in such distributed circumstances, intelligent dynamic spectrum access almost invariably relies on self-learning to achieve dynamic spectrum access improvement. Whereas, this paper proposes a DSA scheme based on deep reinforcement learning to enhance spectrum and access efficiency. Unlike traditional Q-learning-based DSA, we introduce the following to enhance the spectrum efficiency in dynamic IoT spectrum environments. First, deep double Q-learning is adopted to perform local self-spectrum-learning for IoT terminals in order to achieve better dynamic access accuracy. Second, to accelerate learning convergence, federated learning (FL) in edge nodes is used to improve the self-learning. Third, multiple secondary users, who do not interfere with each other and have similar operation condition, are clustered for federated learning to enhance the efficiency of deep reinforcement learning. Comparing with the traditional distributed DSA with deep learning, the proposed scheme has faster access convergence speed due to the characteristic of global optimization for federated learning. Based on this, a framework of federated deep reinforcement learning (FDRL) for DSA is proposed. Furthermore, this scheme preserves privacy of IoT users in that FDRL only requires model parameters to be uploaded to edge servers. Simulations are performed to show the effectiveness of theproposed FDRL-based DSA framework. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Li, Feng Shen, Bowen Guo, Jiale Lam, Kwok-Yan Wei, Guiyi Wang, Li |
format |
Article |
author |
Li, Feng Shen, Bowen Guo, Jiale Lam, Kwok-Yan Wei, Guiyi Wang, Li |
author_sort |
Li, Feng |
title |
Dynamic spectrum access for Internet-of-Things based on federated deep reinforcement learning |
title_short |
Dynamic spectrum access for Internet-of-Things based on federated deep reinforcement learning |
title_full |
Dynamic spectrum access for Internet-of-Things based on federated deep reinforcement learning |
title_fullStr |
Dynamic spectrum access for Internet-of-Things based on federated deep reinforcement learning |
title_full_unstemmed |
Dynamic spectrum access for Internet-of-Things based on federated deep reinforcement learning |
title_sort |
dynamic spectrum access for internet-of-things based on federated deep reinforcement learning |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/163822 |
_version_ |
1772826434345107456 |