#TITLE_ALTERNATIVE#
The study of machine learning, including reinforcement learning, is currently developing. The use of Monte Carlo simulation is one way that can be used to solve reinforcement learning problems by generating random episodes. Although it is well known, its implementation is still lacking compared to o...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/27773 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:27773 |
---|---|
spelling |
id-itb.:277732018-09-03T08:46:28Z#TITLE_ALTERNATIVE# NUR KARIMAH (NIM: 13514106 ), HASNA Indonesia Final Project INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/27773 The study of machine learning, including reinforcement learning, is currently developing. The use of Monte Carlo simulation is one way that can be used to solve reinforcement learning problems by generating random episodes. Although it is well known, its implementation is still lacking compared to other methods such as Dynamic Programming and Temporal-Difference Learning. In fact, the use of this Monte Carlo simulation has advantages over the two methods, including learning is done by using a real sample, so there is no bias. In this final project, Monte Carlo simulation is used for finding a solution to reinforcement learning problem, which is pathfinding case in a gridworld environment. Then, a study was carried out on the influence of learning parameters and seeds of random number generator to the solution earned. From the experimental results, some main points were obtained. First, the random number generator seed does not have a significant influence on the solution. Second, the learning parameters have an influence on the results, including the number of episodes and the number of steps that give better results, but the training time becomes longer. Then, there is also the value of ε, which is the probability of random action, which affects the results. The greater the value of ε, the more likely random action is chosen than the best action, the longer the training time is needed, but the greater the goal percentage achieved. <br /> text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
The study of machine learning, including reinforcement learning, is currently developing. The use of Monte Carlo simulation is one way that can be used to solve reinforcement learning problems by generating random episodes. Although it is well known, its implementation is still lacking compared to other methods such as Dynamic Programming and Temporal-Difference Learning. In fact, the use of this Monte Carlo simulation has advantages over the two methods, including learning is done by using a real sample, so there is no bias. In this final project, Monte Carlo simulation is used for finding a solution to reinforcement learning problem, which is pathfinding case in a gridworld environment. Then, a study was carried out on the influence of learning parameters and seeds of random number generator to the solution earned. From the experimental results, some main points were obtained. First, the random number generator seed does not have a significant influence on the solution. Second, the learning parameters have an influence on the results, including the number of episodes and the number of steps that give better results, but the training time becomes longer. Then, there is also the value of ε, which is the probability of random action, which affects the results. The greater the value of ε, the more likely random action is chosen than the best action, the longer the training time is needed, but the greater the goal percentage achieved. <br />
|
format |
Final Project |
author |
NUR KARIMAH (NIM: 13514106 ), HASNA |
spellingShingle |
NUR KARIMAH (NIM: 13514106 ), HASNA #TITLE_ALTERNATIVE# |
author_facet |
NUR KARIMAH (NIM: 13514106 ), HASNA |
author_sort |
NUR KARIMAH (NIM: 13514106 ), HASNA |
title |
#TITLE_ALTERNATIVE# |
title_short |
#TITLE_ALTERNATIVE# |
title_full |
#TITLE_ALTERNATIVE# |
title_fullStr |
#TITLE_ALTERNATIVE# |
title_full_unstemmed |
#TITLE_ALTERNATIVE# |
title_sort |
#title_alternative# |
url |
https://digilib.itb.ac.id/gdl/view/27773 |
_version_ |
1822021461423947776 |