Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
Parrondo's paradox refers to the counterintuitive phenomenon whereby two losing strategies, when alternated in a certain manner, can result in a winning outcome. Understanding the optimal sequence in Parrondo's games is of significant importance for maximizing profits in various contexts....
Saved in:
Main Authors: | , |
---|---|
Other Authors: | |
Format: | Article |
Language: | English |
Published: |
2024
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/181728 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-181728 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1817282024-12-16T15:35:56Z Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning Cheong, Kang Hao Zhao, Jie School of Physical and Mathematical Sciences Mathematical Sciences Adaptive strategy Parrondo paradox Parrondo's paradox refers to the counterintuitive phenomenon whereby two losing strategies, when alternated in a certain manner, can result in a winning outcome. Understanding the optimal sequence in Parrondo's games is of significant importance for maximizing profits in various contexts. However, the current predefined sequences may not adapt well to changing environments, limiting their potential for achieving the best performance. We posit that the optimal strategy that determines which game to play should be learnable through experience. In this Letter, we propose an efficient and robust approach that leverages Q learning to adaptively learn the optimal sequence in Parrondo's games. Through extensive simulations of coin-tossing games, we demonstrate that the learned switching strategy in Parrondo's games outperforms other predefined sequences in terms of profit. Furthermore, the experimental results show that our proposed method can be easily adjusted to adapt to different cases of capital-dependent games and history-dependent games. Ministry of Education (MOE) Published version This work was supported by the Singapore Ministry of Education (MOE) Academic Research Fund (AcRF) Tier 2 Grant No. MOET2EP50120-0021. 2024-12-16T04:41:31Z 2024-12-16T04:41:31Z 2024 Journal Article Cheong, K. H. & Zhao, J. (2024). Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning. Physical Review Research, 6(3), L032009-. https://dx.doi.org/10.1103/PhysRevResearch.6.L032009 2643-1564 https://hdl.handle.net/10356/181728 10.1103/PhysRevResearch.6.L032009 2-s2.0-85198852807 3 6 L032009 en MOET2EP50120-0021 Physical Review Research © 2024 the Authors. Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article’s title, journal citation, and DOI. application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Mathematical Sciences Adaptive strategy Parrondo paradox |
spellingShingle |
Mathematical Sciences Adaptive strategy Parrondo paradox Cheong, Kang Hao Zhao, Jie Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning |
description |
Parrondo's paradox refers to the counterintuitive phenomenon whereby two losing strategies, when alternated in a certain manner, can result in a winning outcome. Understanding the optimal sequence in Parrondo's games is of significant importance for maximizing profits in various contexts. However, the current predefined sequences may not adapt well to changing environments, limiting their potential for achieving the best performance. We posit that the optimal strategy that determines which game to play should be learnable through experience. In this Letter, we propose an efficient and robust approach that leverages Q learning to adaptively learn the optimal sequence in Parrondo's games. Through extensive simulations of coin-tossing games, we demonstrate that the learned switching strategy in Parrondo's games outperforms other predefined sequences in terms of profit. Furthermore, the experimental results show that our proposed method can be easily adjusted to adapt to different cases of capital-dependent games and history-dependent games. |
author2 |
School of Physical and Mathematical Sciences |
author_facet |
School of Physical and Mathematical Sciences Cheong, Kang Hao Zhao, Jie |
format |
Article |
author |
Cheong, Kang Hao Zhao, Jie |
author_sort |
Cheong, Kang Hao |
title |
Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning |
title_short |
Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning |
title_full |
Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning |
title_fullStr |
Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning |
title_full_unstemmed |
Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning |
title_sort |
adaptive strategy optimization in game-theoretic paradigm using reinforcement learning |
publishDate |
2024 |
url |
https://hdl.handle.net/10356/181728 |
_version_ |
1819113018288504832 |