Sampling based approaches for minimizing regret in uncertain Markov Decision Problems (MDPs)
Markov Decision Processes (MDPs) are an effective model to represent decision processes in the presence of transitional uncertainty and reward tradeoffs. However, due to the difficulty in exactly specifying the transition and reward functions in MDPs, researchers have proposed uncertain MDP models a...
Saved in:
Main Authors: | AHMED, Asrar, VARAKANTHAM, Pradeep, LOWALEKAR, Meghna, ADULYASAK, Yossiri, JAILLET, Patrick |
---|---|
格式: | text |
語言: | English |
出版: |
Institutional Knowledge at Singapore Management University
2017
|
主題: | |
在線閱讀: | https://ink.library.smu.edu.sg/sis_research/3937 https://ink.library.smu.edu.sg/context/sis_research/article/4939/viewcontent/Sampling_based_approach_regret_MDP_JAIR_pv.pdf |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|
機構: | Singapore Management University |
語言: | English |
相似書籍
-
Regret based Robust Solutions for Uncertain Markov Decision Processes
由: AHMED, Asrar, et al.
出版: (2013) -
Solving Uncertain MDPs with Objectives that are Separable over Instantiations of Model Uncertainty
由: ADULYASAK, Yossiri, et al.
出版: (2015) -
Exploiting anonymity and homogeneity in factored Dec-MDPs through pre-computed binomial distributions
由: RANJAN KUMAR, Rajiv, et al.
出版: (2017) -
Online spatio-temporal matching in stochastic and dynamic domains
由: LOWALEKAR, Meghna, et al.
出版: (2018) -
Decentralized Stochastic Planning with Anonymity in Interactions
由: VARAKANTHAM, Pradeep, et al.
出版: (2014)