Point-Based Backup for Decentralized POMPDs: Complexity and New Algorithms

Decentralized POMDPs provide an expressive framework for sequential multi-agent decision making. Despite their high complexity, there has been significant progress in scaling up existing algorithms, largely due to the use of point-based methods. Performing point-based backup is a fundamental operati...

Full description

Saved in:
Bibliographic Details
Main Authors: KUMAR, Akshat, ZILBERSTEIN, Shlomo
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2010
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2210
https://ink.library.smu.edu.sg/context/sis_research/article/3210/viewcontent/Point_Based_Backup_for_Decentralized_POMPDs__Complexity_and_New_Algorithms.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3210
record_format dspace
spelling sg-smu-ink.sis_research-32102018-07-13T03:43:45Z Point-Based Backup for Decentralized POMPDs: Complexity and New Algorithms KUMAR, Akshat ZILBERSTEIN, Shlomo Decentralized POMDPs provide an expressive framework for sequential multi-agent decision making. Despite their high complexity, there has been significant progress in scaling up existing algorithms, largely due to the use of point-based methods. Performing point-based backup is a fundamental operation in state-of-the-art algorithms. We show that even a single backup step in the multi-agent setting is NP-Complete. Despite this negative worst-case result, we present an efficient and scalable optimal algorithm as well as a principled approximation scheme. The optimal algorithm exploits recent advances in the weighted CSP literature to overcome the complexity of the backup operation. The polytime approximation scheme provides a constant factor approximation guarantee based on the number of belief points. In experiments on standard domains, the optimal approach provides significant speedup (up to 2 orders of magnitude) over the previous best optimal algorithm and is able to increase the number of belief points by more than a factor of 3. The approximation scheme also works well in practice, providing near-optimal solutions to the backup problem. 2010-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2210 https://ink.library.smu.edu.sg/context/sis_research/article/3210/viewcontent/Point_Based_Backup_for_Decentralized_POMPDs__Complexity_and_New_Algorithms.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Artificial Intelligence and Robotics
Operations Research, Systems Engineering and Industrial Engineering
spellingShingle Artificial Intelligence and Robotics
Operations Research, Systems Engineering and Industrial Engineering
KUMAR, Akshat
ZILBERSTEIN, Shlomo
Point-Based Backup for Decentralized POMPDs: Complexity and New Algorithms
description Decentralized POMDPs provide an expressive framework for sequential multi-agent decision making. Despite their high complexity, there has been significant progress in scaling up existing algorithms, largely due to the use of point-based methods. Performing point-based backup is a fundamental operation in state-of-the-art algorithms. We show that even a single backup step in the multi-agent setting is NP-Complete. Despite this negative worst-case result, we present an efficient and scalable optimal algorithm as well as a principled approximation scheme. The optimal algorithm exploits recent advances in the weighted CSP literature to overcome the complexity of the backup operation. The polytime approximation scheme provides a constant factor approximation guarantee based on the number of belief points. In experiments on standard domains, the optimal approach provides significant speedup (up to 2 orders of magnitude) over the previous best optimal algorithm and is able to increase the number of belief points by more than a factor of 3. The approximation scheme also works well in practice, providing near-optimal solutions to the backup problem.
format text
author KUMAR, Akshat
ZILBERSTEIN, Shlomo
author_facet KUMAR, Akshat
ZILBERSTEIN, Shlomo
author_sort KUMAR, Akshat
title Point-Based Backup for Decentralized POMPDs: Complexity and New Algorithms
title_short Point-Based Backup for Decentralized POMPDs: Complexity and New Algorithms
title_full Point-Based Backup for Decentralized POMPDs: Complexity and New Algorithms
title_fullStr Point-Based Backup for Decentralized POMPDs: Complexity and New Algorithms
title_full_unstemmed Point-Based Backup for Decentralized POMPDs: Complexity and New Algorithms
title_sort point-based backup for decentralized pompds: complexity and new algorithms
publisher Institutional Knowledge at Singapore Management University
publishDate 2010
url https://ink.library.smu.edu.sg/sis_research/2210
https://ink.library.smu.edu.sg/context/sis_research/article/3210/viewcontent/Point_Based_Backup_for_Decentralized_POMPDs__Complexity_and_New_Algorithms.pdf
_version_ 1770571884454215680