Bootstrapping Monte Carlo tree search with an imperfect heuristic
We consider the problem of using a heuristic policy to improve the value approximation by the Upper Confidence Bound applied in Trees (UCT) algorithm in non-adversarial settings such as planning with large-state space Markov Decision Processes. Current improvements to UCT focus on either changing th...
Saved in:
Main Authors: | Nguyen T., Lee W., Tze-Yun LEONG |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2012
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/2999 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Bootstrapping Monte Carlo tree search with an imperfect heuristic
by: Nguyen, T.-H.D., et al.
Published: (2013) -
Stochastic gradient Hamiltonian Monte Carlo with variance reduction for Bayesian inference
by: LI, Zhize, et al.
Published: (2019) -
Water Quality Forecasting in Shrimp Cultures based on Monte Carlo Tree Search
by: Dhanachai Pinitsava, et al.
Published: (2022) -
Efficient meta neural heuristic for multi-objective combinatorial optimization
by: CHEN, Jinbiao, et al.
Published: (2023) -
Monte Carlo Tree Search and Delay-Aware Feedback Adaptation for Underwater Acoustic Link Tuning
by: Wu, Shuangshuang, et al.
Published: (2021)