#TITLE_ALTERNATIVE#

The study of machine learning, including reinforcement learning, is currently developing. The use of Monte Carlo simulation is one way that can be used to solve reinforcement learning problems by generating random episodes. Although it is well known, its implementation is still lacking compared to o...

Full description

Saved in:

Bibliographic Details
Main Author:	NUR KARIMAH (NIM: 13514106 ), HASNA
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/27773
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:27773
spelling	id-itb.:277732018-09-03T08:46:28Z#TITLE_ALTERNATIVE# NUR KARIMAH (NIM: 13514106 ), HASNA Indonesia Final Project INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/27773 The study of machine learning, including reinforcement learning, is currently developing. The use of Monte Carlo simulation is one way that can be used to solve reinforcement learning problems by generating random episodes. Although it is well known, its implementation is still lacking compared to other methods such as Dynamic Programming and Temporal-Difference Learning. In fact, the use of this Monte Carlo simulation has advantages over the two methods, including learning is done by using a real sample, so there is no bias. In this final project, Monte Carlo simulation is used for finding a solution to reinforcement learning problem, which is pathfinding case in a gridworld environment. Then, a study was carried out on the influence of learning parameters and seeds of random number generator to the solution earned. From the experimental results, some main points were obtained. First, the random number generator seed does not have a significant influence on the solution. Second, the learning parameters have an influence on the results, including the number of episodes and the number of steps that give better results, but the training time becomes longer. Then, there is also the value of ε, which is the probability of random action, which affects the results. The greater the value of ε, the more likely random action is chosen than the best action, the longer the training time is needed, but the greater the goal percentage achieved. <br /> text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	The study of machine learning, including reinforcement learning, is currently developing. The use of Monte Carlo simulation is one way that can be used to solve reinforcement learning problems by generating random episodes. Although it is well known, its implementation is still lacking compared to other methods such as Dynamic Programming and Temporal-Difference Learning. In fact, the use of this Monte Carlo simulation has advantages over the two methods, including learning is done by using a real sample, so there is no bias. In this final project, Monte Carlo simulation is used for finding a solution to reinforcement learning problem, which is pathfinding case in a gridworld environment. Then, a study was carried out on the influence of learning parameters and seeds of random number generator to the solution earned. From the experimental results, some main points were obtained. First, the random number generator seed does not have a significant influence on the solution. Second, the learning parameters have an influence on the results, including the number of episodes and the number of steps that give better results, but the training time becomes longer. Then, there is also the value of ε, which is the probability of random action, which affects the results. The greater the value of ε, the more likely random action is chosen than the best action, the longer the training time is needed, but the greater the goal percentage achieved. <br />
format	Final Project
author	NUR KARIMAH (NIM: 13514106 ), HASNA
spellingShingle	NUR KARIMAH (NIM: 13514106 ), HASNA #TITLE_ALTERNATIVE#
author_facet	NUR KARIMAH (NIM: 13514106 ), HASNA
author_sort	NUR KARIMAH (NIM: 13514106 ), HASNA
title	#TITLE_ALTERNATIVE#
title_short	#TITLE_ALTERNATIVE#
title_full	#TITLE_ALTERNATIVE#
title_fullStr	#TITLE_ALTERNATIVE#
title_full_unstemmed	#TITLE_ALTERNATIVE#
title_sort	#title_alternative#
url	https://digilib.itb.ac.id/gdl/view/27773
_version_	1822021461423947776

#TITLE_ALTERNATIVE#

Similar Items