Neural regret-matching for distributed constraint optimization problems

Distributed constraint optimization problems (DCOPs) are a powerful model for multi-agent coordination and optimization, where information and controls are distributed among multiple agents by nature. Sampling-based algorithms are important incomplete techniques for solving medium-scale DCOPs. Howev...

Full description

Saved in:

Bibliographic Details
Main Authors:	DENG, Yanchen, YU, Runshen, WANG, Xinrun, AN, Bo
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2021
Subjects:	Agent-based and Multi-agent Systems Coordination and Cooperation Constraints and SAT: Constraint OptimizationConstraints and SAT Distributed Constraints Artificial Intelligence and Robotics Theory and Algorithms
Online Access:	https://ink.library.smu.edu.sg/sis_research/9142 https://ink.library.smu.edu.sg/context/sis_research/article/10145/viewcontent/Neural_Regret_Matching_pvoa.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-10145
record_format	dspace
spelling	sg-smu-ink.sis_research-101452024-08-01T09:23:27Z Neural regret-matching for distributed constraint optimization problems DENG, Yanchen YU, Runshen WANG, Xinrun AN, Bo Distributed constraint optimization problems (DCOPs) are a powerful model for multi-agent coordination and optimization, where information and controls are distributed among multiple agents by nature. Sampling-based algorithms are important incomplete techniques for solving medium-scale DCOPs. However, they use tables to exactly store all the information (e.g., costs, confidence bounds) to facilitate sampling, which limits their scalability. This paper tackles the limitation by incorporating deep neural networks in solving DCOPs for the first time and presents a neural-based sampling scheme built upon regret-matching. In the algorithm, each agent trains a neural network to approximate the regret related to its local problem and performs sampling according to the estimated regret. Furthermore, to ensure exploration we propose a regret rounding scheme that rounds small regret values to positive numbers. We theoretically show the regret bound of our algorithm and extensive evaluations indicate that our algorithm can scale up to large-scale DCOPs and significantly outperform the state-of-the-art methods. 2021-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9142 info:doi/10.24963/ijcai.2021/21 https://ink.library.smu.edu.sg/context/sis_research/article/10145/viewcontent/Neural_Regret_Matching_pvoa.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Agent-based and Multi-agent Systems Coordination and Cooperation Constraints and SAT: Constraint OptimizationConstraints and SAT Distributed Constraints Artificial Intelligence and Robotics Theory and Algorithms
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Agent-based and Multi-agent Systems Coordination and Cooperation Constraints and SAT: Constraint OptimizationConstraints and SAT Distributed Constraints Artificial Intelligence and Robotics Theory and Algorithms
spellingShingle	Agent-based and Multi-agent Systems Coordination and Cooperation Constraints and SAT: Constraint OptimizationConstraints and SAT Distributed Constraints Artificial Intelligence and Robotics Theory and Algorithms DENG, Yanchen YU, Runshen WANG, Xinrun AN, Bo Neural regret-matching for distributed constraint optimization problems
description	Distributed constraint optimization problems (DCOPs) are a powerful model for multi-agent coordination and optimization, where information and controls are distributed among multiple agents by nature. Sampling-based algorithms are important incomplete techniques for solving medium-scale DCOPs. However, they use tables to exactly store all the information (e.g., costs, confidence bounds) to facilitate sampling, which limits their scalability. This paper tackles the limitation by incorporating deep neural networks in solving DCOPs for the first time and presents a neural-based sampling scheme built upon regret-matching. In the algorithm, each agent trains a neural network to approximate the regret related to its local problem and performs sampling according to the estimated regret. Furthermore, to ensure exploration we propose a regret rounding scheme that rounds small regret values to positive numbers. We theoretically show the regret bound of our algorithm and extensive evaluations indicate that our algorithm can scale up to large-scale DCOPs and significantly outperform the state-of-the-art methods.
format	text
author	DENG, Yanchen YU, Runshen WANG, Xinrun AN, Bo
author_facet	DENG, Yanchen YU, Runshen WANG, Xinrun AN, Bo
author_sort	DENG, Yanchen
title	Neural regret-matching for distributed constraint optimization problems
title_short	Neural regret-matching for distributed constraint optimization problems
title_full	Neural regret-matching for distributed constraint optimization problems
title_fullStr	Neural regret-matching for distributed constraint optimization problems
title_full_unstemmed	Neural regret-matching for distributed constraint optimization problems
title_sort	neural regret-matching for distributed constraint optimization problems
publisher	Institutional Knowledge at Singapore Management University
publishDate	2021
url	https://ink.library.smu.edu.sg/sis_research/9142 https://ink.library.smu.edu.sg/context/sis_research/article/10145/viewcontent/Neural_Regret_Matching_pvoa.pdf
_version_	1814047754227286016

Neural regret-matching for distributed constraint optimization problems

Similar Items