VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints

The radio frequency spectrum crunch has triggered the harnessing of other sources of bandwidth, for which visible light is a promising candidate. Even though visible light communication (VLC) ensures high capacity, coverage is limited. This necessitates the integration of VLC and device-To-device (D...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	Raveendran, Neetu, Zhang, Huaqing, Niyato, Dusit, Yang, Fang, Song, Jian, Han, Zhu
مؤلفون آخرون:	School of Computer Science and Engineering
التنسيق:	مقال
اللغة:	English
منشور في:	2021
الموضوعات:	Engineering::Computer science and engineering Visible Light Communication Device-to-device
الوصول للمادة أونلاين:	https://hdl.handle.net/10356/150746
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

id	sg-ntu-dr.10356-150746
record_format	dspace
spelling	sg-ntu-dr.10356-1507462021-06-14T04:49:10Z VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints Raveendran, Neetu Zhang, Huaqing Niyato, Dusit Yang, Fang Song, Jian Han, Zhu School of Computer Science and Engineering Engineering::Computer science and engineering Visible Light Communication Device-to-device The radio frequency spectrum crunch has triggered the harnessing of other sources of bandwidth, for which visible light is a promising candidate. Even though visible light communication (VLC) ensures high capacity, coverage is limited. This necessitates the integration of VLC and device-To-device (D2D) technologies into heterogeneous networks. In particular, mobile users which are accessible by the VLC transmitters can relay data to mobile users which are not, by means of D2D communication. However, due to the distributed behaviors of mobile users, determining optimal data transmission routes from VLC transmitters to end mobile devices is a major challenge. In this paper, we propose a reinforcement learning (RL)-based approach to determine multi-hop data transmission routes in an indoor VLC-D2D heterogeneous network. We obtain the rewards for the RL-based method dynamically, by formulating the interactions between the mobile users relaying the data as an equilibrium problem with equilibrium constraints and using alternating direction method of multipliers to solve it. The proposed technique can achieve optimal data transmission routes in a distributed manner. The simulation results demonstrate the effectiveness of the proposed approach, showing that transmission routes with low delays and high capacities can be achieved through the learning algorithm. Energy Market Authority (EMA) Ministry of Education (MOE) Nanyang Technological University National Research Foundation (NRF) This work was supported in part by US MURI AFOSR MURI under Grant 18RT0073, in part by the NSF under Grant CNS-1717454, Grant CNS-1731424, Grant CNS-1702850, Grant CNS-1646607, and Grant ECCS-1547201, in part by the National Natural Science Foundation of China under Grant 61871255, in part by the Natural Science Foundation of Guangdong Province under Grant 2015A030312006, in part by the Guangdong Key Laboratory Project under Grant 2017B030314147, in part by WASP/NTU under Grant M4082187 (4080), in part by the Singapore MOE Tier 1 under Grant 2017-T1-002-007 RG122/17, in part by the MOE Tier 2 under Grant MOE2014-T2-2-015 ARC4/15 and Grant NRF2015-NRF-ISF001-2277, and in part by EMA Energy Resilience under Grant NRF2017EWT-EP003-041. 2021-06-14T04:49:10Z 2021-06-14T04:49:10Z 2019 Journal Article Raveendran, N., Zhang, H., Niyato, D., Yang, F., Song, J. & Han, Z. (2019). VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints. IEEE Transactions On Wireless Communications, 18(2), 1115-1127. https://dx.doi.org/10.1109/TWC.2018.2890057 1536-1276 0000-0002-4660-8893 0000-0001-8791-1672 0000-0002-7442-7416 0000-0003-3575-5086 https://hdl.handle.net/10356/150746 10.1109/TWC.2018.2890057 2-s2.0-85061728200 2 18 1115 1127 en M4082187 (4080) 2017-T1-002-007 RG122/17 MOE2014-T2-2-015 ARC4/15 NRF2015-NRF-ISF001-2277 NRF2017EWT-EP003-041 IEEE Transactions on Wireless Communications © 2019 IEEE. All rights reserved.
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering Visible Light Communication Device-to-device
spellingShingle	Engineering::Computer science and engineering Visible Light Communication Device-to-device Raveendran, Neetu Zhang, Huaqing Niyato, Dusit Yang, Fang Song, Jian Han, Zhu VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
description	The radio frequency spectrum crunch has triggered the harnessing of other sources of bandwidth, for which visible light is a promising candidate. Even though visible light communication (VLC) ensures high capacity, coverage is limited. This necessitates the integration of VLC and device-To-device (D2D) technologies into heterogeneous networks. In particular, mobile users which are accessible by the VLC transmitters can relay data to mobile users which are not, by means of D2D communication. However, due to the distributed behaviors of mobile users, determining optimal data transmission routes from VLC transmitters to end mobile devices is a major challenge. In this paper, we propose a reinforcement learning (RL)-based approach to determine multi-hop data transmission routes in an indoor VLC-D2D heterogeneous network. We obtain the rewards for the RL-based method dynamically, by formulating the interactions between the mobile users relaying the data as an equilibrium problem with equilibrium constraints and using alternating direction method of multipliers to solve it. The proposed technique can achieve optimal data transmission routes in a distributed manner. The simulation results demonstrate the effectiveness of the proposed approach, showing that transmission routes with low delays and high capacities can be achieved through the learning algorithm.
author2	School of Computer Science and Engineering
author_facet	School of Computer Science and Engineering Raveendran, Neetu Zhang, Huaqing Niyato, Dusit Yang, Fang Song, Jian Han, Zhu
format	Article
author	Raveendran, Neetu Zhang, Huaqing Niyato, Dusit Yang, Fang Song, Jian Han, Zhu
author_sort	Raveendran, Neetu
title	VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_short	VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_full	VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_fullStr	VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_full_unstemmed	VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
title_sort	vlc and d2d heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints
publishDate	2021
url	https://hdl.handle.net/10356/150746
_version_	1703971181827194880

VLC and D2D heterogeneous network optimization : a reinforcement learning approach based on equilibrium problems with equilibrium constraints

مواد مشابهة