Bootstrapping Monte Carlo tree search with an imperfect heuristic

We consider the problem of using a heuristic policy to improve the value approximation by the Upper Confidence Bound applied in Trees (UCT) algorithm in non-adversarial settings such as planning with large-state space Markov Decision Processes. Current improvements to UCT focus on either changing th...

Full description

Saved in:

Bibliographic Details
Main Authors:	Nguyen T., Lee W., Tze-Yun LEONG
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2012
Subjects:	Databases and Information Systems
Online Access:	https://ink.library.smu.edu.sg/sis_research/2999
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Internet

https://ink.library.smu.edu.sg/sis_research/2999

Bootstrapping Monte Carlo tree search with an imperfect heuristic

Internet

Similar Items