Learning Temporal–Spatial Spectrum Reuse

We formulate and study a multi-user multi-armed bandit problem that exploits the temporal-spatial opportunistic spectrum access (OSA) of primary user channels, so that secondary users (SUs) who do not interfere with each other can make use of the same PU channel. We first propose a centralized chann...

Full description

Saved in:
Bibliographic Details
Main Authors: Zhang, Yi, Tay, Wee Peng, Li, Kwok Hung, Esseghir, Moez, Gaiti, Dominique
Other Authors: School of Electrical and Electronic Engineering
Format: Article
Language:English
Published: 2017
Subjects:
Online Access:https://hdl.handle.net/10356/81415
http://hdl.handle.net/10220/43462
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-81415
record_format dspace
spelling sg-ntu-dr.10356-814152020-03-07T13:57:26Z Learning Temporal–Spatial Spectrum Reuse Zhang, Yi Tay, Wee Peng Li, Kwok Hung Esseghir, Moez Gaiti, Dominique School of Electrical and Electronic Engineering Cognitive radio Spectrum reuse We formulate and study a multi-user multi-armed bandit problem that exploits the temporal-spatial opportunistic spectrum access (OSA) of primary user channels, so that secondary users (SUs) who do not interfere with each other can make use of the same PU channel. We first propose a centralized channel allocation policy that has logarithmic regret, but requires a central processor to solve an NP-complete optimization problem at exponentially increasing time intervals. To overcome the high computation complexity at the central processor, we also propose heuristic distributed policies that, however, have linear regrets. Our first distributed policy utilizes a distributed graph coloring and consensus algorithm to determine SUs' channel access ranks, while our second distributed policy incorporates channel access rank learning in a local procedure at each SU at the cost of a higher regret. We compare the performance of our proposed policies with other distributed policies recently proposed for temporal (but not spatial) OSA. We show that all these policies have linear regrets in our temporal-spatial OSA framework. Simulations suggest that our proposed policies have significantly smaller regrets than the other policies when spectrum temporal-spatial reuse is allowed. MOE (Min. of Education, S’pore) Accepted version 2017-07-27T06:44:22Z 2019-12-06T14:30:30Z 2017-07-27T06:44:22Z 2019-12-06T14:30:30Z 2016 Journal Article Zhang, Y., Tay, W. P., Li, K. H., Esseghir, M., & Gaiti, D. (2016). Learning Temporal–Spatial Spectrum Reuse. IEEE Transactions on Communications, 64(7), 3092-3103. 0090-6778 https://hdl.handle.net/10356/81415 http://hdl.handle.net/10220/43462 10.1109/TCOMM.2016.2569093 en IEEE Transactions on Communications © 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/TCOMM.2016.2569093]. 11 p. application/pdf
institution Nanyang Technological University
building NTU Library
country Singapore
collection DR-NTU
language English
topic Cognitive radio
Spectrum reuse
spellingShingle Cognitive radio
Spectrum reuse
Zhang, Yi
Tay, Wee Peng
Li, Kwok Hung
Esseghir, Moez
Gaiti, Dominique
Learning Temporal–Spatial Spectrum Reuse
description We formulate and study a multi-user multi-armed bandit problem that exploits the temporal-spatial opportunistic spectrum access (OSA) of primary user channels, so that secondary users (SUs) who do not interfere with each other can make use of the same PU channel. We first propose a centralized channel allocation policy that has logarithmic regret, but requires a central processor to solve an NP-complete optimization problem at exponentially increasing time intervals. To overcome the high computation complexity at the central processor, we also propose heuristic distributed policies that, however, have linear regrets. Our first distributed policy utilizes a distributed graph coloring and consensus algorithm to determine SUs' channel access ranks, while our second distributed policy incorporates channel access rank learning in a local procedure at each SU at the cost of a higher regret. We compare the performance of our proposed policies with other distributed policies recently proposed for temporal (but not spatial) OSA. We show that all these policies have linear regrets in our temporal-spatial OSA framework. Simulations suggest that our proposed policies have significantly smaller regrets than the other policies when spectrum temporal-spatial reuse is allowed.
author2 School of Electrical and Electronic Engineering
author_facet School of Electrical and Electronic Engineering
Zhang, Yi
Tay, Wee Peng
Li, Kwok Hung
Esseghir, Moez
Gaiti, Dominique
format Article
author Zhang, Yi
Tay, Wee Peng
Li, Kwok Hung
Esseghir, Moez
Gaiti, Dominique
author_sort Zhang, Yi
title Learning Temporal–Spatial Spectrum Reuse
title_short Learning Temporal–Spatial Spectrum Reuse
title_full Learning Temporal–Spatial Spectrum Reuse
title_fullStr Learning Temporal–Spatial Spectrum Reuse
title_full_unstemmed Learning Temporal–Spatial Spectrum Reuse
title_sort learning temporal–spatial spectrum reuse
publishDate 2017
url https://hdl.handle.net/10356/81415
http://hdl.handle.net/10220/43462
_version_ 1681047909199511552