Anytime Planning for Decentralized POMDPs using Expectation Maximization
Decentralized POMDPs provide an expressive framework for multi-agent sequential decision making. While finite-horizon DECPOMDPs have enjoyed signifcant success, progress remains slow for the infinite-horizon case mainly due to the inherent complexity of optimizing stochastic controllers representing...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2010
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2209 https://ink.library.smu.edu.sg/context/sis_research/article/3209/viewcontent/Anytime_Planning_for_Decentralized_POMDPs_using_Expectation_Maximization.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3209 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-32092018-06-26T05:19:12Z Anytime Planning for Decentralized POMDPs using Expectation Maximization KUMAR, Akshat ZILBERSTEIN, Shlomo Decentralized POMDPs provide an expressive framework for multi-agent sequential decision making. While finite-horizon DECPOMDPs have enjoyed signifcant success, progress remains slow for the infinite-horizon case mainly due to the inherent complexity of optimizing stochastic controllers representing agent policies. We present a promising new class of algorithms for the infinite-horizon case, which recasts the optimization problem as inference in a mixture of DBNs. An attractive feature of this approach is the straightforward adoption of existing inference techniques in DBNs for solving DEC-POMDPs and supporting richer representations such as factored or continuous states and actions. We also derive the Expectation Maximization (EM) algorithm to optimize the joint policy represented as DBNs. Experiments on benchmark domains show that EM compares favorably against the state-of-the-art solvers. 2010-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2209 https://ink.library.smu.edu.sg/context/sis_research/article/3209/viewcontent/Anytime_Planning_for_Decentralized_POMDPs_using_Expectation_Maximization.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering |
spellingShingle |
Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering KUMAR, Akshat ZILBERSTEIN, Shlomo Anytime Planning for Decentralized POMDPs using Expectation Maximization |
description |
Decentralized POMDPs provide an expressive framework for multi-agent sequential decision making. While finite-horizon DECPOMDPs have enjoyed signifcant success, progress remains slow for the infinite-horizon case mainly due to the inherent complexity of optimizing stochastic controllers representing agent policies. We present a promising new class of algorithms for the infinite-horizon case, which recasts the optimization problem as inference in a mixture of DBNs. An attractive feature of this approach is the straightforward adoption of existing inference techniques in DBNs for solving DEC-POMDPs and supporting richer representations such as factored or continuous states and actions. We also derive the Expectation Maximization (EM) algorithm to optimize the joint policy represented as DBNs. Experiments on benchmark domains show that EM compares favorably against the state-of-the-art solvers. |
format |
text |
author |
KUMAR, Akshat ZILBERSTEIN, Shlomo |
author_facet |
KUMAR, Akshat ZILBERSTEIN, Shlomo |
author_sort |
KUMAR, Akshat |
title |
Anytime Planning for Decentralized POMDPs using Expectation Maximization |
title_short |
Anytime Planning for Decentralized POMDPs using Expectation Maximization |
title_full |
Anytime Planning for Decentralized POMDPs using Expectation Maximization |
title_fullStr |
Anytime Planning for Decentralized POMDPs using Expectation Maximization |
title_full_unstemmed |
Anytime Planning for Decentralized POMDPs using Expectation Maximization |
title_sort |
anytime planning for decentralized pomdps using expectation maximization |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2010 |
url |
https://ink.library.smu.edu.sg/sis_research/2209 https://ink.library.smu.edu.sg/context/sis_research/article/3209/viewcontent/Anytime_Planning_for_Decentralized_POMDPs_using_Expectation_Maximization.pdf |
_version_ |
1770571884085116928 |