Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions

Recent work in decentralized stochastic planning for cooperative agents has focussed on exploiting omogeneity of agents and anonymity in interactions to solve problems with large numbers of agents. Due to a linear optimization formulation that computes joint policy and an objective that indirectly a...

Full description

Saved in:

Bibliographic Details
Main Authors:	RANJAN KUMAR, Rajiv, VARAKANTHAM, Pradeep
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2017
Subjects:	Theory and Algorithms
Online Access:	https://ink.library.smu.edu.sg/sis_research/3936 https://ink.library.smu.edu.sg/context/sis_research/article/4938/viewcontent/p732.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-4938
record_format	dspace
spelling	sg-smu-ink.sis_research-49382018-02-08T01:23:24Z Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions RANJAN KUMAR, Rajiv VARAKANTHAM, Pradeep Recent work in decentralized stochastic planning for cooperative agents has focussed on exploiting omogeneity of agents and anonymity in interactions to solve problems with large numbers of agents. Due to a linear optimization formulation that computes joint policy and an objective that indirectly approximates joint expected reward with reward for expected number of agents in all state, action pairs, these approaches have ensured improved scalability. Such an objective closely approximates joint expected reward when there are many agents, due to law of large numbers. However, the performance deteriorates in problems with fewer agents. In this paper, we improve on the previous line of work by providing a linear optimization formulation that employs a more direct approximation of joint expected reward. The new approximation is based on offline computation of binomial distributions. Our new technique is not only able to improve quality performance on problems with large numbers of agents, but is able to perform on par with existing best approaches on problems with fewer agents. This is achieved without sacrificing on scalability/run-time performance of previous work. 2017-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3936 https://ink.library.smu.edu.sg/context/sis_research/article/4938/viewcontent/p732.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Theory and Algorithms
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Theory and Algorithms
spellingShingle	Theory and Algorithms RANJAN KUMAR, Rajiv VARAKANTHAM, Pradeep Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions
description	Recent work in decentralized stochastic planning for cooperative agents has focussed on exploiting omogeneity of agents and anonymity in interactions to solve problems with large numbers of agents. Due to a linear optimization formulation that computes joint policy and an objective that indirectly approximates joint expected reward with reward for expected number of agents in all state, action pairs, these approaches have ensured improved scalability. Such an objective closely approximates joint expected reward when there are many agents, due to law of large numbers. However, the performance deteriorates in problems with fewer agents. In this paper, we improve on the previous line of work by providing a linear optimization formulation that employs a more direct approximation of joint expected reward. The new approximation is based on offline computation of binomial distributions. Our new technique is not only able to improve quality performance on problems with large numbers of agents, but is able to perform on par with existing best approaches on problems with fewer agents. This is achieved without sacrificing on scalability/run-time performance of previous work.
format	text
author	RANJAN KUMAR, Rajiv VARAKANTHAM, Pradeep
author_facet	RANJAN KUMAR, Rajiv VARAKANTHAM, Pradeep
author_sort	RANJAN KUMAR, Rajiv
title	Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions
title_short	Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions
title_full	Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions
title_fullStr	Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions
title_full_unstemmed	Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions
title_sort	exploiting anonymity and homogeneity in factored dec-mdps through pre-computed binomial distributions
publisher	Institutional Knowledge at Singapore Management University
publishDate	2017
url	https://ink.library.smu.edu.sg/sis_research/3936 https://ink.library.smu.edu.sg/context/sis_research/article/4938/viewcontent/p732.pdf
_version_	1770573998375043072

Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions

Similar Items