Anytime Planning for Decentralized POMDPs using Expectation Maximization

Decentralized POMDPs provide an expressive framework for multi-agent sequential decision making. While finite-horizon DECPOMDPs have enjoyed signifcant success, progress remains slow for the infinite-horizon case mainly due to the inherent complexity of optimizing stochastic controllers representing...

Full description

Saved in:
Bibliographic Details
Main Authors: KUMAR, Akshat, ZILBERSTEIN, Shlomo
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2010
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2209
https://ink.library.smu.edu.sg/context/sis_research/article/3209/viewcontent/Anytime_Planning_for_Decentralized_POMDPs_using_Expectation_Maximization.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-3209
record_format dspace
spelling sg-smu-ink.sis_research-32092018-06-26T05:19:12Z Anytime Planning for Decentralized POMDPs using Expectation Maximization KUMAR, Akshat ZILBERSTEIN, Shlomo Decentralized POMDPs provide an expressive framework for multi-agent sequential decision making. While finite-horizon DECPOMDPs have enjoyed signifcant success, progress remains slow for the infinite-horizon case mainly due to the inherent complexity of optimizing stochastic controllers representing agent policies. We present a promising new class of algorithms for the infinite-horizon case, which recasts the optimization problem as inference in a mixture of DBNs. An attractive feature of this approach is the straightforward adoption of existing inference techniques in DBNs for solving DEC-POMDPs and supporting richer representations such as factored or continuous states and actions. We also derive the Expectation Maximization (EM) algorithm to optimize the joint policy represented as DBNs. Experiments on benchmark domains show that EM compares favorably against the state-of-the-art solvers. 2010-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2209 https://ink.library.smu.edu.sg/context/sis_research/article/3209/viewcontent/Anytime_Planning_for_Decentralized_POMDPs_using_Expectation_Maximization.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Artificial Intelligence and Robotics
Operations Research, Systems Engineering and Industrial Engineering
spellingShingle Artificial Intelligence and Robotics
Operations Research, Systems Engineering and Industrial Engineering
KUMAR, Akshat
ZILBERSTEIN, Shlomo
Anytime Planning for Decentralized POMDPs using Expectation Maximization
description Decentralized POMDPs provide an expressive framework for multi-agent sequential decision making. While finite-horizon DECPOMDPs have enjoyed signifcant success, progress remains slow for the infinite-horizon case mainly due to the inherent complexity of optimizing stochastic controllers representing agent policies. We present a promising new class of algorithms for the infinite-horizon case, which recasts the optimization problem as inference in a mixture of DBNs. An attractive feature of this approach is the straightforward adoption of existing inference techniques in DBNs for solving DEC-POMDPs and supporting richer representations such as factored or continuous states and actions. We also derive the Expectation Maximization (EM) algorithm to optimize the joint policy represented as DBNs. Experiments on benchmark domains show that EM compares favorably against the state-of-the-art solvers.
format text
author KUMAR, Akshat
ZILBERSTEIN, Shlomo
author_facet KUMAR, Akshat
ZILBERSTEIN, Shlomo
author_sort KUMAR, Akshat
title Anytime Planning for Decentralized POMDPs using Expectation Maximization
title_short Anytime Planning for Decentralized POMDPs using Expectation Maximization
title_full Anytime Planning for Decentralized POMDPs using Expectation Maximization
title_fullStr Anytime Planning for Decentralized POMDPs using Expectation Maximization
title_full_unstemmed Anytime Planning for Decentralized POMDPs using Expectation Maximization
title_sort anytime planning for decentralized pomdps using expectation maximization
publisher Institutional Knowledge at Singapore Management University
publishDate 2010
url https://ink.library.smu.edu.sg/sis_research/2209
https://ink.library.smu.edu.sg/context/sis_research/article/3209/viewcontent/Anytime_Planning_for_Decentralized_POMDPs_using_Expectation_Maximization.pdf
_version_ 1770571884085116928