Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions

Decentralized partially observable MDPs (DEC-POMDPs) provide a rich framework for modeling decision making by a team of agents. Despite rapid progress in this area, the limited scalability of solution techniques has restricted the applicability of the model. To overcome this computational barrier, r...

Full description

Saved in:
Bibliographic Details
Main Authors: KUMAR, Akshat, ZILBERSTEIN, Shlomo
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2009
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2212
https://ink.library.smu.edu.sg/context/sis_research/article/3212/viewcontent/Constraint_Based_Dynamic_Programming_for_Decentralized_POMDPs_with_Structured_Interactions.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3212
record_format dspace
spelling sg-smu-ink.sis_research-32122018-07-13T03:43:27Z Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions KUMAR, Akshat ZILBERSTEIN, Shlomo Decentralized partially observable MDPs (DEC-POMDPs) provide a rich framework for modeling decision making by a team of agents. Despite rapid progress in this area, the limited scalability of solution techniques has restricted the applicability of the model. To overcome this computational barrier, research has focused on restricted classes of DEC-POMDPs, which are easier to solve yet rich enough to capture many practical problems. We present CBDP, an efficient and scalable point-based dynamic programming algorithm for one such model called ND-POMDP (Network Distributed POMDP). Specifically, CBDP provides magnitudes of speedup in the policy computation and generates better quality solution for all test instances. It has linear complexity in the number of agents and horizon length. Furthermore, the complexity per horizon for the examined class of problems is exponential only in a small parameter that depends upon the interaction among the agents, achieving significant scalability for large, loosely coupled multi-agent systems. The efficiency of CBDP lies in exploiting the structure of interactions using constraint networks. These results extend significantly the effectiveness of decision-theoretic planning in multi-agent settings. 2009-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2212 https://ink.library.smu.edu.sg/context/sis_research/article/3212/viewcontent/Constraint_Based_Dynamic_Programming_for_Decentralized_POMDPs_with_Structured_Interactions.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Artificial Intelligence and Robotics
Operations Research, Systems Engineering and Industrial Engineering
spellingShingle Artificial Intelligence and Robotics
Operations Research, Systems Engineering and Industrial Engineering
KUMAR, Akshat
ZILBERSTEIN, Shlomo
Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
description Decentralized partially observable MDPs (DEC-POMDPs) provide a rich framework for modeling decision making by a team of agents. Despite rapid progress in this area, the limited scalability of solution techniques has restricted the applicability of the model. To overcome this computational barrier, research has focused on restricted classes of DEC-POMDPs, which are easier to solve yet rich enough to capture many practical problems. We present CBDP, an efficient and scalable point-based dynamic programming algorithm for one such model called ND-POMDP (Network Distributed POMDP). Specifically, CBDP provides magnitudes of speedup in the policy computation and generates better quality solution for all test instances. It has linear complexity in the number of agents and horizon length. Furthermore, the complexity per horizon for the examined class of problems is exponential only in a small parameter that depends upon the interaction among the agents, achieving significant scalability for large, loosely coupled multi-agent systems. The efficiency of CBDP lies in exploiting the structure of interactions using constraint networks. These results extend significantly the effectiveness of decision-theoretic planning in multi-agent settings.
format text
author KUMAR, Akshat
ZILBERSTEIN, Shlomo
author_facet KUMAR, Akshat
ZILBERSTEIN, Shlomo
author_sort KUMAR, Akshat
title Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_short Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_full Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_fullStr Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_full_unstemmed Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
title_sort constraint-based dynamic programming for decentralized pomdps with structured interactions
publisher Institutional Knowledge at Singapore Management University
publishDate 2009
url https://ink.library.smu.edu.sg/sis_research/2212
https://ink.library.smu.edu.sg/context/sis_research/article/3212/viewcontent/Constraint_Based_Dynamic_Programming_for_Decentralized_POMDPs_with_Structured_Interactions.pdf
_version_ 1770571884909297664