Revisiting Risk-Sensitive MDPs: New Algorithms and Results
While Markov Decision Processes (MDPs) have been shown to be effective models for planning under uncertainty, theobjective to minimize the expected cumulative cost is inappropriate for high-stake planning problems. As such, Yu, Lin, and Yan (1998) introduced the Risk-Sensitive MDP (RSMDP) model, whe...
Saved in:
Main Authors: | , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2014
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2089 https://ink.library.smu.edu.sg/context/sis_research/article/3088/viewcontent/icaps12_rsmdp.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-3088 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-30882015-11-16T05:25:54Z Revisiting Risk-Sensitive MDPs: New Algorithms and Results HOU, Ping YEOH, William VARAKANTHAM, Pradeep Reddy While Markov Decision Processes (MDPs) have been shown to be effective models for planning under uncertainty, theobjective to minimize the expected cumulative cost is inappropriate for high-stake planning problems. As such, Yu, Lin, and Yan (1998) introduced the Risk-Sensitive MDP (RSMDP) model, where the objective is to find a policy that maximizes the probability that the cumulative cost is within some user-defined cost threshold. In this paper, we revisit this problem and introduce new algorithms that are based on classical techniques, such as depth-first search and dynamic programming, and a recently introduced technique called Topological Value Iteration (TVI). We demonstrate the applicability of our approach on randomly generated MDPs as well as domains from the ICAPS 2011 International Probabilistic Planning Competition (IPPC). 2014-06-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/2089 https://ink.library.smu.edu.sg/context/sis_research/article/3088/viewcontent/icaps12_rsmdp.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering |
spellingShingle |
Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering HOU, Ping YEOH, William VARAKANTHAM, Pradeep Reddy Revisiting Risk-Sensitive MDPs: New Algorithms and Results |
description |
While Markov Decision Processes (MDPs) have been shown to be effective models for planning under uncertainty, theobjective to minimize the expected cumulative cost is inappropriate for high-stake planning problems. As such, Yu, Lin, and Yan (1998) introduced the Risk-Sensitive MDP (RSMDP) model, where the objective is to find a policy that maximizes the probability that the cumulative cost is within some user-defined cost threshold. In this paper, we revisit this problem and introduce new algorithms that are based on classical techniques, such as depth-first search and dynamic programming, and a recently introduced technique called Topological Value Iteration (TVI). We demonstrate the applicability of our approach on randomly generated MDPs as well as domains from the ICAPS 2011 International Probabilistic Planning Competition (IPPC). |
format |
text |
author |
HOU, Ping YEOH, William VARAKANTHAM, Pradeep Reddy |
author_facet |
HOU, Ping YEOH, William VARAKANTHAM, Pradeep Reddy |
author_sort |
HOU, Ping |
title |
Revisiting Risk-Sensitive MDPs: New Algorithms and Results |
title_short |
Revisiting Risk-Sensitive MDPs: New Algorithms and Results |
title_full |
Revisiting Risk-Sensitive MDPs: New Algorithms and Results |
title_fullStr |
Revisiting Risk-Sensitive MDPs: New Algorithms and Results |
title_full_unstemmed |
Revisiting Risk-Sensitive MDPs: New Algorithms and Results |
title_sort |
revisiting risk-sensitive mdps: new algorithms and results |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2014 |
url |
https://ink.library.smu.edu.sg/sis_research/2089 https://ink.library.smu.edu.sg/context/sis_research/article/3088/viewcontent/icaps12_rsmdp.pdf |
_version_ |
1770571795782434816 |