Bootstrapping Monte Carlo tree search with an imperfect heuristic

We consider the problem of using a heuristic policy to improve the value approximation by the Upper Confidence Bound applied in Trees (UCT) algorithm in non-adversarial settings such as planning with large-state space Markov Decision Processes. Current improvements to UCT focus on either changing th...

Full description

Saved in:
Bibliographic Details
Main Authors: Nguyen T., Lee W., Tze-Yun LEONG
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2012
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/2999
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English