PARALLEL MONTE CARLO METHOD IN GRID WORLD (REINFORCEMENT LEARNING) USING CUDA DYNAMIC PARALLELISM
Parallel Monte Carlo method for reinforcement learning problem has been shown to be able to accelerate agents’ experience quality gain per episode by increasing number of agents. Previous researches have experimented on this with up to 16 parallel agents. The rapid development of GPGPU, especiall...
Saved in:
Main Author: | |
---|---|
Format: | Theses |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/39712 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |