VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
The radio frequency spectrum crunch has triggered the harnessing of other sources of bandwidth, for which visible light is a promising candidate. Even though visible light communication (VLC) ensures high capacity, coverage is limited. This necessitates the integration of VLC and device-To-device (D...
Saved in:
Main Authors: | , , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2021
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/150746 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-150746 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1507462021-06-14T04:49:10Z VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints Raveendran, Neetu Zhang, Huaqing Niyato, Dusit Yang, Fang Song, Jian Han, Zhu School of Computer Science and Engineering Engineering::Computer science and engineering Visible Light Communication Device-to-device The radio frequency spectrum crunch has triggered the harnessing of other sources of bandwidth, for which visible light is a promising candidate. Even though visible light communication (VLC) ensures high capacity, coverage is limited. This necessitates the integration of VLC and device-To-device (D2D) technologies into heterogeneous networks. In particular, mobile users which are accessible by the VLC transmitters can relay data to mobile users which are not, by means of D2D communication. However, due to the distributed behaviors of mobile users, determining optimal data transmission routes from VLC transmitters to end mobile devices is a major challenge. In this paper, we propose a reinforcement learning (RL)-based approach to determine multi-hop data transmission routes in an indoor VLC-D2D heterogeneous network. We obtain the rewards for the RL-based method dynamically, by formulating the interactions between the mobile users relaying the data as an equilibrium problem with equilibrium constraints and using alternating direction method of multipliers to solve it. The proposed technique can achieve optimal data transmission routes in a distributed manner. The simulation results demonstrate the effectiveness of the proposed approach, showing that transmission routes with low delays and high capacities can be achieved through the learning algorithm. Energy Market Authority (EMA) Ministry of Education (MOE) Nanyang Technological University National Research Foundation (NRF) This work was supported in part by US MURI AFOSR MURI under Grant 18RT0073, in part by the NSF under Grant CNS-1717454, Grant CNS-1731424, Grant CNS-1702850, Grant CNS-1646607, and Grant ECCS-1547201, in part by the National Natural Science Foundation of China under Grant 61871255, in part by the Natural Science Foundation of Guangdong Province under Grant 2015A030312006, in part by the Guangdong Key Laboratory Project under Grant 2017B030314147, in part by WASP/NTU under Grant M4082187 (4080), in part by the Singapore MOE Tier 1 under Grant 2017-T1-002-007 RG122/17, in part by the MOE Tier 2 under Grant MOE2014-T2-2-015 ARC4/15 and Grant NRF2015-NRF-ISF001-2277, and in part by EMA Energy Resilience under Grant NRF2017EWT-EP003-041. 2021-06-14T04:49:10Z 2021-06-14T04:49:10Z 2019 Journal Article Raveendran, N., Zhang, H., Niyato, D., Yang, F., Song, J. & Han, Z. (2019). VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints. IEEE Transactions On Wireless Communications, 18(2), 1115-1127. https://dx.doi.org/10.1109/TWC.2018.2890057 1536-1276 0000-0002-4660-8893 0000-0001-8791-1672 0000-0002-7442-7416 0000-0003-3575-5086 https://hdl.handle.net/10356/150746 10.1109/TWC.2018.2890057 2-s2.0-85061728200 2 18 1115 1127 en M4082187 (4080) 2017-T1-002-007 RG122/17 MOE2014-T2-2-015 ARC4/15 NRF2015-NRF-ISF001-2277 NRF2017EWT-EP003-041 IEEE Transactions on Wireless Communications © 2019 IEEE. All rights reserved. |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering Visible Light Communication Device-to-device |
spellingShingle |
Engineering::Computer science and engineering Visible Light Communication Device-to-device Raveendran, Neetu Zhang, Huaqing Niyato, Dusit Yang, Fang Song, Jian Han, Zhu VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints |
description |
The radio frequency spectrum crunch has triggered the harnessing of other sources of bandwidth, for which visible light is a promising candidate. Even though visible light communication (VLC) ensures high capacity, coverage is limited. This necessitates the integration of VLC and device-To-device (D2D) technologies into heterogeneous networks. In particular, mobile users which are accessible by the VLC transmitters can relay data to mobile users which are not, by means of D2D communication. However, due to the distributed behaviors of mobile users, determining optimal data transmission routes from VLC transmitters to end mobile devices is a major challenge. In this paper, we propose a reinforcement learning (RL)-based approach to determine multi-hop data transmission routes in an indoor VLC-D2D heterogeneous network. We obtain the rewards for the RL-based method dynamically, by formulating the interactions between the mobile users relaying the data as an equilibrium problem with equilibrium constraints and using alternating direction method of multipliers to solve it. The proposed technique can achieve optimal data transmission routes in a distributed manner. The simulation results demonstrate the effectiveness of the proposed approach, showing that transmission routes with low delays and high capacities can be achieved through the learning algorithm. |
author2 |
School of Computer Science and Engineering |
author_facet |
School of Computer Science and Engineering Raveendran, Neetu Zhang, Huaqing Niyato, Dusit Yang, Fang Song, Jian Han, Zhu |
format |
Article |
author |
Raveendran, Neetu Zhang, Huaqing Niyato, Dusit Yang, Fang Song, Jian Han, Zhu |
author_sort |
Raveendran, Neetu |
title |
VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints |
title_short |
VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints |
title_full |
VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints |
title_fullStr |
VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints |
title_full_unstemmed |
VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints |
title_sort |
vlc and d2d heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints |
publishDate |
2021 |
url |
https://hdl.handle.net/10356/150746 |
_version_ |
1703971181827194880 |