Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions

Decentralized partially observable MDPs (DEC-POMDPs) provide a rich framework for modeling decision making by a team of agents. Despite rapid progress in this area, the limited scalability of solution techniques has restricted the applicability of the model. To overcome this computational barrier, r...

Full description

Saved in:

Bibliographic Details
Main Authors:	KUMAR, Akshat, ZILBERSTEIN, Shlomo
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2009
Subjects:	Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/2212 https://ink.library.smu.edu.sg/context/sis_research/article/3212/viewcontent/Constraint_Based_Dynamic_Programming_for_Decentralized_POMDPs_with_Structured_Interactions.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-3212
record_format	dspace
spelling	sg-smu-ink.sis_research-32122018-07-13T03:43:27Z Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions KUMAR, Akshat ZILBERSTEIN, Shlomo Decentralized partially observable MDPs (DEC-POMDPs) provide a rich framework for modeling decision making by a team of agents. Despite rapid progress in this area, the limited scalability of solution techniques has restricted the applicability of the model. To overcome this computational barrier, research has focused on restricted classes of DEC-POMDPs, which are easier to solve yet rich enough to capture many practical problems. We present CBDP, an efficient and scalable point-based dynamic programming algorithm for one such model called ND-POMDP (Network Distributed POMDP). Specifically, CBDP provides magnitudes of speedup in the policy computation and generates better quality solution for all test instances. It has linear complexity in the number of agents and horizon length. Furthermore, the complexity per horizon for the examined class of problems is exponential only in a small parameter that depends upon the interaction among the agents, achieving significant scalability for large, loosely coupled multi-agent systems. The efficiency of CBDP lies in exploiting the structure of interactions using constraint networks. These results extend significantly the effectiveness of decision-theoretic planning in multi-agent settings. 2009-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2212 https://ink.library.smu.edu.sg/context/sis_research/article/3212/viewcontent/Constraint_Based_Dynamic_Programming_for_Decentralized_POMDPs_with_Structured_Interactions.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering
spellingShingle	Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering KUMAR, Akshat ZILBERSTEIN, Shlomo Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
description	Decentralized partially observable MDPs (DEC-POMDPs) provide a rich framework for modeling decision making by a team of agents. Despite rapid progress in this area, the limited scalability of solution techniques has restricted the applicability of the model. To overcome this computational barrier, research has focused on restricted classes of DEC-POMDPs, which are easier to solve yet rich enough to capture many practical problems. We present CBDP, an efficient and scalable point-based dynamic programming algorithm for one such model called ND-POMDP (Network Distributed POMDP). Specifically, CBDP provides magnitudes of speedup in the policy computation and generates better quality solution for all test instances. It has linear complexity in the number of agents and horizon length. Furthermore, the complexity per horizon for the examined class of problems is exponential only in a small parameter that depends upon the interaction among the agents, achieving significant scalability for large, loosely coupled multi-agent systems. The efficiency of CBDP lies in exploiting the structure of interactions using constraint networks. These results extend significantly the effectiveness of decision-theoretic planning in multi-agent settings.
format	text
author	KUMAR, Akshat ZILBERSTEIN, Shlomo
author_facet	KUMAR, Akshat ZILBERSTEIN, Shlomo
author_sort	KUMAR, Akshat
title	Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_short	Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_full	Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_fullStr	Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_full_unstemmed	Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_sort	constraint-based dynamic programming for decentralized pomdps with structured interactions
publisher	Institutional Knowledge at Singapore Management University
publishDate	2009
url	https://ink.library.smu.edu.sg/sis_research/2212 https://ink.library.smu.edu.sg/context/sis_research/article/3212/viewcontent/Constraint_Based_Dynamic_Programming_for_Decentralized_POMDPs_with_Structured_Interactions.pdf
_version_	1770571884909297664

Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions

Similar Items