Solving risk-sensitive POMDPs with and without cost observations

Partially Observable Markov Decision Processes (POMDPs) are often used to model planning problems under uncertainty. The goal in Risk-Sensitive POMDPs (RS-POMDPs) is to find a policy that maximizes the probability that the cumulative cost is within some user-defined cost threshold. In this paper, un...

Full description

Saved in:
Bibliographic Details
Main Authors: HOU, Ping, YEOH, William, Pradeep VARAKANTHAM
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2016
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/3605
https://ink.library.smu.edu.sg/context/sis_research/article/4606/viewcontent/RSPOMDP.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-4606
record_format dspace
spelling sg-smu-ink.sis_research-46062017-04-10T08:24:35Z Solving risk-sensitive POMDPs with and without cost observations HOU, Ping YEOH, William Pradeep VARAKANTHAM, Partially Observable Markov Decision Processes (POMDPs) are often used to model planning problems under uncertainty. The goal in Risk-Sensitive POMDPs (RS-POMDPs) is to find a policy that maximizes the probability that the cumulative cost is within some user-defined cost threshold. In this paper, unlike existing POMDP literature, we distinguish between the two cases of whether costs can or cannot be observed and show the empirical impact of cost observations. We also introduce a new search-based algorithm to solve RS-POMDPs and show that it is faster and more scalable than existing approaches in two synthetic domains and a taxi domain generated with real-world data. 2016-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/3605 https://ink.library.smu.edu.sg/context/sis_research/article/4606/viewcontent/RSPOMDP.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Theory and Algorithms
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Theory and Algorithms
spellingShingle Theory and Algorithms
HOU, Ping
YEOH, William
Pradeep VARAKANTHAM,
Solving risk-sensitive POMDPs with and without cost observations
description Partially Observable Markov Decision Processes (POMDPs) are often used to model planning problems under uncertainty. The goal in Risk-Sensitive POMDPs (RS-POMDPs) is to find a policy that maximizes the probability that the cumulative cost is within some user-defined cost threshold. In this paper, unlike existing POMDP literature, we distinguish between the two cases of whether costs can or cannot be observed and show the empirical impact of cost observations. We also introduce a new search-based algorithm to solve RS-POMDPs and show that it is faster and more scalable than existing approaches in two synthetic domains and a taxi domain generated with real-world data.
format text
author HOU, Ping
YEOH, William
Pradeep VARAKANTHAM,
author_facet HOU, Ping
YEOH, William
Pradeep VARAKANTHAM,
author_sort HOU, Ping
title Solving risk-sensitive POMDPs with and without cost observations
title_short Solving risk-sensitive POMDPs with and without cost observations
title_full Solving risk-sensitive POMDPs with and without cost observations
title_fullStr Solving risk-sensitive POMDPs with and without cost observations
title_full_unstemmed Solving risk-sensitive POMDPs with and without cost observations
title_sort solving risk-sensitive pomdps with and without cost observations
publisher Institutional Knowledge at Singapore Management University
publishDate 2016
url https://ink.library.smu.edu.sg/sis_research/3605
https://ink.library.smu.edu.sg/context/sis_research/article/4606/viewcontent/RSPOMDP.pdf
_version_ 1770573344849002496