Configurable mirror descent : Towards a unification of decision making

Decision-making problems, categorized as single-agent, e.g., Atari, cooperative multi-agent, e.g., Hanabi, competitive multi-agent, e.g., Hold’em poker, and mixed cooperative and competitive, e.g., football, are ubiquitous in the real world. Although various methods have been proposed to address the...

Full description

Saved in:

Bibliographic Details
Main Authors:	LI, Pengdeng, LI, Shuxin, YANG, Chang, WANG, Xinrun, CHAN, Hau, AN, Bo
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2024
Subjects:	Decision making categorization Decision making algorithm Reinforcement learning Machine learning Artificial Intelligence and Robotics Management Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/9829 https://ink.library.smu.edu.sg/context/sis_research/article/10829/viewcontent/li24an.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-10829
record_format	dspace
spelling	sg-smu-ink.sis_research-108292024-12-24T03:36:45Z Configurable mirror descent : Towards a unification of decision making LI, Pengdeng LI, Shuxin YANG, Chang WANG, Xinrun CHAN, Hau AN, Bo Decision-making problems, categorized as single-agent, e.g., Atari, cooperative multi-agent, e.g., Hanabi, competitive multi-agent, e.g., Hold’em poker, and mixed cooperative and competitive, e.g., football, are ubiquitous in the real world. Although various methods have been proposed to address the specific decision-making categories, these methods typically evolve independently and cannot generalize to other categories. Therefore, a fundamental question for decision-making is: Can we develop a single algorithm to tackle ALL categories of decision-making problems? There are several main challenges to address this question: i) different decision-making categories involve different numbers of agents and different relationships between agents, ii) different categories have different solution concepts and evaluation measures, and iii) there lacks a comprehensive benchmark covering all the categories. This work presents a preliminary attempt to address the question with three main contributions. i) We propose the generalized mirror descent (GMD), a generalization of MD variants, which considers multiple historical policies and works with a broader class of Bregman divergences. ii) We propose the configurable mirror descent (CMD) where a meta-controller is introduced to dynamically adjust the hyper-parameters in GMD conditional on the evaluation measures. iii) We construct the GameBench with 15 academic-friendly games across different decision-making categories. Extensive experiments demonstrate that CMD achieves empirically competitive or better outcomes compared to baselines while providing the capability of exploring diverse dimensions of decision making. 2024-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9829 https://ink.library.smu.edu.sg/context/sis_research/article/10829/viewcontent/li24an.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Decision making categorization Decision making algorithm Reinforcement learning Machine learning Artificial Intelligence and Robotics Management Information Systems
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Decision making categorization Decision making algorithm Reinforcement learning Machine learning Artificial Intelligence and Robotics Management Information Systems
spellingShingle	Decision making categorization Decision making algorithm Reinforcement learning Machine learning Artificial Intelligence and Robotics Management Information Systems LI, Pengdeng LI, Shuxin YANG, Chang WANG, Xinrun CHAN, Hau AN, Bo Configurable mirror descent : Towards a unification of decision making
description	Decision-making problems, categorized as single-agent, e.g., Atari, cooperative multi-agent, e.g., Hanabi, competitive multi-agent, e.g., Hold’em poker, and mixed cooperative and competitive, e.g., football, are ubiquitous in the real world. Although various methods have been proposed to address the specific decision-making categories, these methods typically evolve independently and cannot generalize to other categories. Therefore, a fundamental question for decision-making is: Can we develop a single algorithm to tackle ALL categories of decision-making problems? There are several main challenges to address this question: i) different decision-making categories involve different numbers of agents and different relationships between agents, ii) different categories have different solution concepts and evaluation measures, and iii) there lacks a comprehensive benchmark covering all the categories. This work presents a preliminary attempt to address the question with three main contributions. i) We propose the generalized mirror descent (GMD), a generalization of MD variants, which considers multiple historical policies and works with a broader class of Bregman divergences. ii) We propose the configurable mirror descent (CMD) where a meta-controller is introduced to dynamically adjust the hyper-parameters in GMD conditional on the evaluation measures. iii) We construct the GameBench with 15 academic-friendly games across different decision-making categories. Extensive experiments demonstrate that CMD achieves empirically competitive or better outcomes compared to baselines while providing the capability of exploring diverse dimensions of decision making.
format	text
author	LI, Pengdeng LI, Shuxin YANG, Chang WANG, Xinrun CHAN, Hau AN, Bo
author_facet	LI, Pengdeng LI, Shuxin YANG, Chang WANG, Xinrun CHAN, Hau AN, Bo
author_sort	LI, Pengdeng
title	Configurable mirror descent : Towards a unification of decision making
title_short	Configurable mirror descent : Towards a unification of decision making
title_full	Configurable mirror descent : Towards a unification of decision making
title_fullStr	Configurable mirror descent : Towards a unification of decision making
title_full_unstemmed	Configurable mirror descent : Towards a unification of decision making
title_sort	configurable mirror descent : towards a unification of decision making
publisher	Institutional Knowledge at Singapore Management University
publishDate	2024
url	https://ink.library.smu.edu.sg/sis_research/9829 https://ink.library.smu.edu.sg/context/sis_research/article/10829/viewcontent/li24an.pdf
_version_	1821237241550209024

Configurable mirror descent : Towards a unification of decision making

Similar Items