Not All Agents are Equal: Scaling up Distributed POMDPs for Agent Networks

Many applications of networks of agents, including mobile sensor networks, unmanned air vehicles, autonomous underwater vehicles, involve 100s of agents acting collaboratively under uncertainty. Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are well-suited to address...

Full description

Saved in:
Bibliographic Details
Main Authors: MARECKI, Janusz, GUPTA, Tapana, VARAKANTHAM, Pradeep Reddy, Tambe, Milind, Yokoo, Makoto
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2008
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/950
http://dl.acm.org/citation.cfm?id=1402453
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-1949
record_format dspace
spelling sg-smu-ink.sis_research-19492010-12-15T08:06:06Z Not All Agents are Equal: Scaling up Distributed POMDPs for Agent Networks MARECKI, Janusz GUPTA, Tapana VARAKANTHAM, Pradeep Reddy Tambe, Milind Yokoo, Makoto Many applications of networks of agents, including mobile sensor networks, unmanned air vehicles, autonomous underwater vehicles, involve 100s of agents acting collaboratively under uncertainty. Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are well-suited to address such applications, but so far, only limited scale-ups of up to five agents have been demonstrated. This paper escalates the scale-up, presenting an algorithm called FANS, increasing the number of agents in distributed POMDPs for the first time into double digits. FANS is founded on finite state machines (FSMs) for policy representation and expoits these FSMs to provide three key contributions: (i) Not all agents within an agent network need the same expressivity of policy representation; FANS introduces novel heuristics to automatically vary the FSM size in different agents for scaleup; (ii) FANS illustrates efficient integration of its FSM-based policy search within algorithms that exploit agent network structure; (iii) FANS provides significant speedups in policy evaluation and heuristic computations within the network algorithms by exploiting the FSMs for dynamic programming. Experimental results show not only orders of magnitude improvements over previous best known algorithms for smaller-scale domains (with similar solution quality), but also a scale-up into double digits in terms of numbers of agents. 2008-01-01T08:00:00Z text https://ink.library.smu.edu.sg/sis_research/950 http://dl.acm.org/citation.cfm?id=1402453 Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Computer Sciences
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Computer Sciences
spellingShingle Computer Sciences
MARECKI, Janusz
GUPTA, Tapana
VARAKANTHAM, Pradeep Reddy
Tambe, Milind
Yokoo, Makoto
Not All Agents are Equal: Scaling up Distributed POMDPs for Agent Networks
description Many applications of networks of agents, including mobile sensor networks, unmanned air vehicles, autonomous underwater vehicles, involve 100s of agents acting collaboratively under uncertainty. Distributed Partially Observable Markov Decision Problems (Distributed POMDPs) are well-suited to address such applications, but so far, only limited scale-ups of up to five agents have been demonstrated. This paper escalates the scale-up, presenting an algorithm called FANS, increasing the number of agents in distributed POMDPs for the first time into double digits. FANS is founded on finite state machines (FSMs) for policy representation and expoits these FSMs to provide three key contributions: (i) Not all agents within an agent network need the same expressivity of policy representation; FANS introduces novel heuristics to automatically vary the FSM size in different agents for scaleup; (ii) FANS illustrates efficient integration of its FSM-based policy search within algorithms that exploit agent network structure; (iii) FANS provides significant speedups in policy evaluation and heuristic computations within the network algorithms by exploiting the FSMs for dynamic programming. Experimental results show not only orders of magnitude improvements over previous best known algorithms for smaller-scale domains (with similar solution quality), but also a scale-up into double digits in terms of numbers of agents.
format text
author MARECKI, Janusz
GUPTA, Tapana
VARAKANTHAM, Pradeep Reddy
Tambe, Milind
Yokoo, Makoto
author_facet MARECKI, Janusz
GUPTA, Tapana
VARAKANTHAM, Pradeep Reddy
Tambe, Milind
Yokoo, Makoto
author_sort MARECKI, Janusz
title Not All Agents are Equal: Scaling up Distributed POMDPs for Agent Networks
title_short Not All Agents are Equal: Scaling up Distributed POMDPs for Agent Networks
title_full Not All Agents are Equal: Scaling up Distributed POMDPs for Agent Networks
title_fullStr Not All Agents are Equal: Scaling up Distributed POMDPs for Agent Networks
title_full_unstemmed Not All Agents are Equal: Scaling up Distributed POMDPs for Agent Networks
title_sort not all agents are equal: scaling up distributed pomdps for agent networks
publisher Institutional Knowledge at Singapore Management University
publishDate 2008
url https://ink.library.smu.edu.sg/sis_research/950
http://dl.acm.org/citation.cfm?id=1402453
_version_ 1770570790765330432