Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents

The use of distributed POMDPs for cooperative teams has been severely limited by the incredibly large joint policy- space that results from combining the policy-spaces of the individual agents. However, much of the computational cost of exploring the entire joint policy space can be avoided by obser...

Full description

Saved in:

Bibliographic Details
Main Authors:	VELAGAPUDI, Prasanna, VARAKANTHAM, Pradeep Reddy, Sycara, Katia, Scerri, Paul
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2011
Subjects:	DEC-POMDP Uncertainty Multi-agent systems Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/1342 https://ink.library.smu.edu.sg/context/sis_research/article/2341/viewcontent/D_TREMOR.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-2341
record_format	dspace
spelling	sg-smu-ink.sis_research-23412017-11-19T13:52:14Z Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents VELAGAPUDI, Prasanna VARAKANTHAM, Pradeep Reddy Sycara, Katia Scerri, Paul The use of distributed POMDPs for cooperative teams has been severely limited by the incredibly large joint policy- space that results from combining the policy-spaces of the individual agents. However, much of the computational cost of exploring the entire joint policy space can be avoided by observing that in many domains important interactions between agents occur in a relatively small set of scenarios, previously defined as coordination locales (CLs) [11]. Moreover, even when numerous interactions might occur, given a set of individual policies there are relatively few actual interactions. Exploiting this observation and building on an existing model shaping algorithm, this paper presents D-TREMOR, an algorithm in which cooperative agents iteratively generate individual policies, identify and communicate possible interactions between their policies, shape their models based on this information and generate new policies. D-TREMOR has three properties that jointly distinguish it from previous DEC-POMDP work: (1) it is completely distributed; (2) it is scalable (allowing 100 agents to compute a \good" joint policy in under 6 hours) and (3) it has low communication overhead. D-TREMOR complements these traits with the following key contributions, which ensure improved scalability and solution quality: (a) techniques to ensure convergence; (b) faster approaches to detect and evaluate CLs; (c) heuristics to capture dependencies between CLs; and (d) novel shaping heuristics to aggregate effects of CLs. While the resulting policies are not globally optimal, empirical results show that agents have policies that effectively manage uncertainty and the joint policy is better than policies generated by independent solvers. 2011-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/1342 https://ink.library.smu.edu.sg/context/sis_research/article/2341/viewcontent/D_TREMOR.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University DEC-POMDP Uncertainty Multi-agent systems Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	DEC-POMDP Uncertainty Multi-agent systems Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering
spellingShingle	DEC-POMDP Uncertainty Multi-agent systems Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering VELAGAPUDI, Prasanna VARAKANTHAM, Pradeep Reddy Sycara, Katia Scerri, Paul Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents
description	The use of distributed POMDPs for cooperative teams has been severely limited by the incredibly large joint policy- space that results from combining the policy-spaces of the individual agents. However, much of the computational cost of exploring the entire joint policy space can be avoided by observing that in many domains important interactions between agents occur in a relatively small set of scenarios, previously defined as coordination locales (CLs) [11]. Moreover, even when numerous interactions might occur, given a set of individual policies there are relatively few actual interactions. Exploiting this observation and building on an existing model shaping algorithm, this paper presents D-TREMOR, an algorithm in which cooperative agents iteratively generate individual policies, identify and communicate possible interactions between their policies, shape their models based on this information and generate new policies. D-TREMOR has three properties that jointly distinguish it from previous DEC-POMDP work: (1) it is completely distributed; (2) it is scalable (allowing 100 agents to compute a \good" joint policy in under 6 hours) and (3) it has low communication overhead. D-TREMOR complements these traits with the following key contributions, which ensure improved scalability and solution quality: (a) techniques to ensure convergence; (b) faster approaches to detect and evaluate CLs; (c) heuristics to capture dependencies between CLs; and (d) novel shaping heuristics to aggregate effects of CLs. While the resulting policies are not globally optimal, empirical results show that agents have policies that effectively manage uncertainty and the joint policy is better than policies generated by independent solvers.
format	text
author	VELAGAPUDI, Prasanna VARAKANTHAM, Pradeep Reddy Sycara, Katia Scerri, Paul
author_facet	VELAGAPUDI, Prasanna VARAKANTHAM, Pradeep Reddy Sycara, Katia Scerri, Paul
author_sort	VELAGAPUDI, Prasanna
title	Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents
title_short	Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents
title_full	Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents
title_fullStr	Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents
title_full_unstemmed	Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents
title_sort	distributed model shaping for scaling to decentralized pomdps with hundreds of agents
publisher	Institutional Knowledge at Singapore Management University
publishDate	2011
url	https://ink.library.smu.edu.sg/sis_research/1342 https://ink.library.smu.edu.sg/context/sis_research/article/2341/viewcontent/D_TREMOR.pdf
_version_	1770570972413296640

Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents

Similar Items