History-Based Controller Design and Optimization for Partially Observable MDPs
Partially observable MDPs provide an elegant framework forsequential decision making. Finite-state controllers (FSCs) are often used to represent policies for infinite-horizon problems as they offer a compact representation, simple-to-execute plans, and adjustable tradeoff between computational comp...
Saved in:
Main Authors: | , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2015
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2915 https://ink.library.smu.edu.sg/context/sis_research/article/3915/viewcontent/History_Based_Controller_Design_afv.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3915 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-39152018-06-25T09:19:48Z History-Based Controller Design and Optimization for Partially Observable MDPs KUMAR, Akshat ZILBERSTEIN, Shlomo Partially observable MDPs provide an elegant framework forsequential decision making. Finite-state controllers (FSCs) are often used to represent policies for infinite-horizon problems as they offer a compact representation, simple-to-execute plans, and adjustable tradeoff between computational complexityand policy size. We develop novel connections between optimizing FSCs for POMDPs and the dual linear programfor MDPs. Building on that, we present a dual mixed integer linear program (MIP) for optimizing FSCs. To assign well-defined meaning to FSC nodes as well as aid in policy search, we show how to associate history-based features with each FSC node. Using this representation, we address another challenging problem, that of iteratively deciding which nodes to add to FSC to get a better policy. Using an efficient off-the-shelf MIP solver, we show that this new approach can find compact near-optimal FSCs for severallarge benchmark domains, and is competitive with previous best approaches. 2015-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2915 https://ink.library.smu.edu.sg/context/sis_research/article/3915/viewcontent/History_Based_Controller_Design_afv.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Computer Sciences Operations Research, Systems Engineering and Industrial Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Artificial Intelligence and Robotics Computer Sciences Operations Research, Systems Engineering and Industrial Engineering |
spellingShingle |
Artificial Intelligence and Robotics Computer Sciences Operations Research, Systems Engineering and Industrial Engineering KUMAR, Akshat ZILBERSTEIN, Shlomo History-Based Controller Design and Optimization for Partially Observable MDPs |
description |
Partially observable MDPs provide an elegant framework forsequential decision making. Finite-state controllers (FSCs) are often used to represent policies for infinite-horizon problems as they offer a compact representation, simple-to-execute plans, and adjustable tradeoff between computational complexityand policy size. We develop novel connections between optimizing FSCs for POMDPs and the dual linear programfor MDPs. Building on that, we present a dual mixed integer linear program (MIP) for optimizing FSCs. To assign well-defined meaning to FSC nodes as well as aid in policy search, we show how to associate history-based features with each FSC node. Using this representation, we address another challenging problem, that of iteratively deciding which nodes to add to FSC to get a better policy. Using an efficient off-the-shelf MIP solver, we show that this new approach can find compact near-optimal FSCs for severallarge benchmark domains, and is competitive with previous best approaches. |
format |
text |
author |
KUMAR, Akshat ZILBERSTEIN, Shlomo |
author_facet |
KUMAR, Akshat ZILBERSTEIN, Shlomo |
author_sort |
KUMAR, Akshat |
title |
History-Based Controller Design and Optimization for Partially Observable MDPs |
title_short |
History-Based Controller Design and Optimization for Partially Observable MDPs |
title_full |
History-Based Controller Design and Optimization for Partially Observable MDPs |
title_fullStr |
History-Based Controller Design and Optimization for Partially Observable MDPs |
title_full_unstemmed |
History-Based Controller Design and Optimization for Partially Observable MDPs |
title_sort |
history-based controller design and optimization for partially observable mdps |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2015 |
url |
https://ink.library.smu.edu.sg/sis_research/2915 https://ink.library.smu.edu.sg/context/sis_research/article/3915/viewcontent/History_Based_Controller_Design_afv.pdf |
_version_ |
1770572735323308032 |