Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning

Parrondo's paradox refers to the counterintuitive phenomenon whereby two losing strategies, when alternated in a certain manner, can result in a winning outcome. Understanding the optimal sequence in Parrondo's games is of significant importance for maximizing profits in various contexts....

Full description

Saved in:
Bibliographic Details
Main Authors: Cheong, Kang Hao, Zhao, Jie
Other Authors: School of Physical and Mathematical Sciences
Format: Article
Language:English
Published: 2024
Subjects:
Online Access:https://hdl.handle.net/10356/181728
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-181728
record_format dspace
spelling sg-ntu-dr.10356-1817282024-12-16T15:35:56Z Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning Cheong, Kang Hao Zhao, Jie School of Physical and Mathematical Sciences Mathematical Sciences Adaptive strategy Parrondo paradox Parrondo's paradox refers to the counterintuitive phenomenon whereby two losing strategies, when alternated in a certain manner, can result in a winning outcome. Understanding the optimal sequence in Parrondo's games is of significant importance for maximizing profits in various contexts. However, the current predefined sequences may not adapt well to changing environments, limiting their potential for achieving the best performance. We posit that the optimal strategy that determines which game to play should be learnable through experience. In this Letter, we propose an efficient and robust approach that leverages Q learning to adaptively learn the optimal sequence in Parrondo's games. Through extensive simulations of coin-tossing games, we demonstrate that the learned switching strategy in Parrondo's games outperforms other predefined sequences in terms of profit. Furthermore, the experimental results show that our proposed method can be easily adjusted to adapt to different cases of capital-dependent games and history-dependent games. Ministry of Education (MOE) Published version This work was supported by the Singapore Ministry of Education (MOE) Academic Research Fund (AcRF) Tier 2 Grant No. MOET2EP50120-0021. 2024-12-16T04:41:31Z 2024-12-16T04:41:31Z 2024 Journal Article Cheong, K. H. & Zhao, J. (2024). Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning. Physical Review Research, 6(3), L032009-. https://dx.doi.org/10.1103/PhysRevResearch.6.L032009 2643-1564 https://hdl.handle.net/10356/181728 10.1103/PhysRevResearch.6.L032009 2-s2.0-85198852807 3 6 L032009 en MOET2EP50120-0021 Physical Review Research © 2024 the Authors. Published by the American Physical Society under the terms of the Creative Commons Attribution 4.0 International license. Further distribution of this work must maintain attribution to the author(s) and the published article’s title, journal citation, and DOI. application/pdf
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Mathematical Sciences
Adaptive strategy
Parrondo paradox
spellingShingle Mathematical Sciences
Adaptive strategy
Parrondo paradox
Cheong, Kang Hao
Zhao, Jie
Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
description Parrondo's paradox refers to the counterintuitive phenomenon whereby two losing strategies, when alternated in a certain manner, can result in a winning outcome. Understanding the optimal sequence in Parrondo's games is of significant importance for maximizing profits in various contexts. However, the current predefined sequences may not adapt well to changing environments, limiting their potential for achieving the best performance. We posit that the optimal strategy that determines which game to play should be learnable through experience. In this Letter, we propose an efficient and robust approach that leverages Q learning to adaptively learn the optimal sequence in Parrondo's games. Through extensive simulations of coin-tossing games, we demonstrate that the learned switching strategy in Parrondo's games outperforms other predefined sequences in terms of profit. Furthermore, the experimental results show that our proposed method can be easily adjusted to adapt to different cases of capital-dependent games and history-dependent games.
author2 School of Physical and Mathematical Sciences
author_facet School of Physical and Mathematical Sciences
Cheong, Kang Hao
Zhao, Jie
format Article
author Cheong, Kang Hao
Zhao, Jie
author_sort Cheong, Kang Hao
title Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
title_short Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
title_full Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
title_fullStr Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
title_full_unstemmed Adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
title_sort adaptive strategy optimization in game-theoretic paradigm using reinforcement learning
publishDate 2024
url https://hdl.handle.net/10356/181728
_version_ 1819113018288504832