VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints

The radio frequency spectrum crunch has triggered the harnessing of other sources of bandwidth, for which visible light is a promising candidate. Even though visible light communication (VLC) ensures high capacity, coverage is limited. This necessitates the integration of VLC and device-To-device (D...

Full description

Saved in:
Bibliographic Details
Main Authors: Raveendran, Neetu, Zhang, Huaqing, Niyato, Dusit, Yang, Fang, Song, Jian, Han, Zhu
Other Authors: School of Computer Science and Engineering
Format: Article
Language:English
Published: 2021
Subjects:
Online Access:https://hdl.handle.net/10356/150746
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-150746
record_format dspace
spelling sg-ntu-dr.10356-1507462021-06-14T04:49:10Z VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints Raveendran, Neetu Zhang, Huaqing Niyato, Dusit Yang, Fang Song, Jian Han, Zhu School of Computer Science and Engineering Engineering::Computer science and engineering Visible Light Communication Device-to-device The radio frequency spectrum crunch has triggered the harnessing of other sources of bandwidth, for which visible light is a promising candidate. Even though visible light communication (VLC) ensures high capacity, coverage is limited. This necessitates the integration of VLC and device-To-device (D2D) technologies into heterogeneous networks. In particular, mobile users which are accessible by the VLC transmitters can relay data to mobile users which are not, by means of D2D communication. However, due to the distributed behaviors of mobile users, determining optimal data transmission routes from VLC transmitters to end mobile devices is a major challenge. In this paper, we propose a reinforcement learning (RL)-based approach to determine multi-hop data transmission routes in an indoor VLC-D2D heterogeneous network. We obtain the rewards for the RL-based method dynamically, by formulating the interactions between the mobile users relaying the data as an equilibrium problem with equilibrium constraints and using alternating direction method of multipliers to solve it. The proposed technique can achieve optimal data transmission routes in a distributed manner. The simulation results demonstrate the effectiveness of the proposed approach, showing that transmission routes with low delays and high capacities can be achieved through the learning algorithm. Energy Market Authority (EMA) Ministry of Education (MOE) Nanyang Technological University National Research Foundation (NRF) This work was supported in part by US MURI AFOSR MURI under Grant 18RT0073, in part by the NSF under Grant CNS-1717454, Grant CNS-1731424, Grant CNS-1702850, Grant CNS-1646607, and Grant ECCS-1547201, in part by the National Natural Science Foundation of China under Grant 61871255, in part by the Natural Science Foundation of Guangdong Province under Grant 2015A030312006, in part by the Guangdong Key Laboratory Project under Grant 2017B030314147, in part by WASP/NTU under Grant M4082187 (4080), in part by the Singapore MOE Tier 1 under Grant 2017-T1-002-007 RG122/17, in part by the MOE Tier 2 under Grant MOE2014-T2-2-015 ARC4/15 and Grant NRF2015-NRF-ISF001-2277, and in part by EMA Energy Resilience under Grant NRF2017EWT-EP003-041. 2021-06-14T04:49:10Z 2021-06-14T04:49:10Z 2019 Journal Article Raveendran, N., Zhang, H., Niyato, D., Yang, F., Song, J. & Han, Z. (2019). VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints. IEEE Transactions On Wireless Communications, 18(2), 1115-1127. https://dx.doi.org/10.1109/TWC.2018.2890057 1536-1276 0000-0002-4660-8893 0000-0001-8791-1672 0000-0002-7442-7416 0000-0003-3575-5086 https://hdl.handle.net/10356/150746 10.1109/TWC.2018.2890057 2-s2.0-85061728200 2 18 1115 1127 en M4082187 (4080) 2017-T1-002-007 RG122/17 MOE2014-T2-2-015 ARC4/15 NRF2015-NRF-ISF001-2277 NRF2017EWT-EP003-041 IEEE Transactions on Wireless Communications © 2019 IEEE. All rights reserved.
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
Visible Light Communication
Device-to-device
spellingShingle Engineering::Computer science and engineering
Visible Light Communication
Device-to-device
Raveendran, Neetu
Zhang, Huaqing
Niyato, Dusit
Yang, Fang
Song, Jian
Han, Zhu
VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
description The radio frequency spectrum crunch has triggered the harnessing of other sources of bandwidth, for which visible light is a promising candidate. Even though visible light communication (VLC) ensures high capacity, coverage is limited. This necessitates the integration of VLC and device-To-device (D2D) technologies into heterogeneous networks. In particular, mobile users which are accessible by the VLC transmitters can relay data to mobile users which are not, by means of D2D communication. However, due to the distributed behaviors of mobile users, determining optimal data transmission routes from VLC transmitters to end mobile devices is a major challenge. In this paper, we propose a reinforcement learning (RL)-based approach to determine multi-hop data transmission routes in an indoor VLC-D2D heterogeneous network. We obtain the rewards for the RL-based method dynamically, by formulating the interactions between the mobile users relaying the data as an equilibrium problem with equilibrium constraints and using alternating direction method of multipliers to solve it. The proposed technique can achieve optimal data transmission routes in a distributed manner. The simulation results demonstrate the effectiveness of the proposed approach, showing that transmission routes with low delays and high capacities can be achieved through the learning algorithm.
author2 School of Computer Science and Engineering
author_facet School of Computer Science and Engineering
Raveendran, Neetu
Zhang, Huaqing
Niyato, Dusit
Yang, Fang
Song, Jian
Han, Zhu
format Article
author Raveendran, Neetu
Zhang, Huaqing
Niyato, Dusit
Yang, Fang
Song, Jian
Han, Zhu
author_sort Raveendran, Neetu
title VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_short VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_full VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_fullStr VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_full_unstemmed VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_sort vlc and d2d heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
publishDate 2021
url https://hdl.handle.net/10356/150746
_version_ 1703971181827194880