Learning Temporal–Spatial Spectrum Reuse
We formulate and study a multi-user multi-armed bandit problem that exploits the temporal-spatial opportunistic spectrum access (OSA) of primary user channels, so that secondary users (SUs) who do not interfere with each other can make use of the same PU channel. We first propose a centralized chann...
Saved in:
Main Authors: | , , , , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2017
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/81415 http://hdl.handle.net/10220/43462 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-81415 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-814152020-03-07T13:57:26Z Learning Temporal–Spatial Spectrum Reuse Zhang, Yi Tay, Wee Peng Li, Kwok Hung Esseghir, Moez Gaiti, Dominique School of Electrical and Electronic Engineering Cognitive radio Spectrum reuse We formulate and study a multi-user multi-armed bandit problem that exploits the temporal-spatial opportunistic spectrum access (OSA) of primary user channels, so that secondary users (SUs) who do not interfere with each other can make use of the same PU channel. We first propose a centralized channel allocation policy that has logarithmic regret, but requires a central processor to solve an NP-complete optimization problem at exponentially increasing time intervals. To overcome the high computation complexity at the central processor, we also propose heuristic distributed policies that, however, have linear regrets. Our first distributed policy utilizes a distributed graph coloring and consensus algorithm to determine SUs' channel access ranks, while our second distributed policy incorporates channel access rank learning in a local procedure at each SU at the cost of a higher regret. We compare the performance of our proposed policies with other distributed policies recently proposed for temporal (but not spatial) OSA. We show that all these policies have linear regrets in our temporal-spatial OSA framework. Simulations suggest that our proposed policies have significantly smaller regrets than the other policies when spectrum temporal-spatial reuse is allowed. MOE (Min. of Education, S’pore) Accepted version 2017-07-27T06:44:22Z 2019-12-06T14:30:30Z 2017-07-27T06:44:22Z 2019-12-06T14:30:30Z 2016 Journal Article Zhang, Y., Tay, W. P., Li, K. H., Esseghir, M., & Gaiti, D. (2016). Learning Temporal–Spatial Spectrum Reuse. IEEE Transactions on Communications, 64(7), 3092-3103. 0090-6778 https://hdl.handle.net/10356/81415 http://hdl.handle.net/10220/43462 10.1109/TCOMM.2016.2569093 en IEEE Transactions on Communications © 2016 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works. The published version is available at: [http://dx.doi.org/10.1109/TCOMM.2016.2569093]. 11 p. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
country |
Singapore |
collection |
DR-NTU |
language |
English |
topic |
Cognitive radio Spectrum reuse |
spellingShingle |
Cognitive radio Spectrum reuse Zhang, Yi Tay, Wee Peng Li, Kwok Hung Esseghir, Moez Gaiti, Dominique Learning Temporal–Spatial Spectrum Reuse |
description |
We formulate and study a multi-user multi-armed bandit problem that exploits the temporal-spatial opportunistic spectrum access (OSA) of primary user channels, so that secondary users (SUs) who do not interfere with each other can make use of the same PU channel. We first propose a centralized channel allocation policy that has logarithmic regret, but requires a central processor to solve an NP-complete optimization problem at exponentially increasing time intervals. To overcome the high computation complexity at the central processor, we also propose heuristic distributed policies that, however, have linear regrets. Our first distributed policy utilizes a distributed graph coloring and consensus algorithm to determine SUs' channel access ranks, while our second distributed policy incorporates channel access rank learning in a local procedure at each SU at the cost of a higher regret. We compare the performance of our proposed policies with other distributed policies recently proposed for temporal (but not spatial) OSA. We show that all these policies have linear regrets in our temporal-spatial OSA framework. Simulations suggest that our proposed policies have significantly smaller regrets than the other policies when spectrum temporal-spatial reuse is allowed. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Zhang, Yi Tay, Wee Peng Li, Kwok Hung Esseghir, Moez Gaiti, Dominique |
format |
Article |
author |
Zhang, Yi Tay, Wee Peng Li, Kwok Hung Esseghir, Moez Gaiti, Dominique |
author_sort |
Zhang, Yi |
title |
Learning Temporal–Spatial Spectrum Reuse |
title_short |
Learning Temporal–Spatial Spectrum Reuse |
title_full |
Learning Temporal–Spatial Spectrum Reuse |
title_fullStr |
Learning Temporal–Spatial Spectrum Reuse |
title_full_unstemmed |
Learning Temporal–Spatial Spectrum Reuse |
title_sort |
learning temporal–spatial spectrum reuse |
publishDate |
2017 |
url |
https://hdl.handle.net/10356/81415 http://hdl.handle.net/10220/43462 |
_version_ |
1681047909199511552 |