DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME
Reinforcement Learning is one of machine learning study where agents try to take action to maximize cumulative rewards. For that purpose, the agent saves the expected reward inside QTable. The need for memory for Q-Table can be replaced by using neural network to predict the Q-Value. This method is...
Saved in:
Main Author: | |
---|---|
Format: | Final Project |
Language: | Indonesia |
Online Access: | https://digilib.itb.ac.id/gdl/view/43577 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Institut Teknologi Bandung |
Language: | Indonesia |
id |
id-itb.:43577 |
---|---|
spelling |
id-itb.:435772019-09-27T14:03:40ZDEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME Fathur Rahman, Muhammad Indonesia Final Project reinforcement learning, deep q network, FightingICE INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/43577 Reinforcement Learning is one of machine learning study where agents try to take action to maximize cumulative rewards. For that purpose, the agent saves the expected reward inside QTable. The need for memory for Q-Table can be replaced by using neural network to predict the Q-Value. This method is called Deep Q Network. FightingICE is a fighting game created using Java. This game is created with the aim for artificial intelligence development. With that aim in the mind, this game is created so that it can help developer to develop their AI with its features. The Deep Q Network that have been created have input state taken from the game such as character information and projectile information. And the output from this network is Q(S, A) value for all available action in State S. This Network have 3 hidden layers with 256 unit within first hidden layer, 128 unit within second hidden layer and 64 unit within third hidden layer. The number of units in input and output layer is 150 unit and 41 unit. Activation function used is ReLU, while hidden layer uses linear activation function. This agent use epsilon greedy policy with 0.9 as its start value and 0.1 as its end value. Double Q-Learning is used for training the policy network. The result from this research is that agent using deep q network can beat random agent that considered as baseline agent. Furthermore, the agent can also compete with JayBot_GM, 3rd ranked agent during 2018 IEEE CIG Fighting Game AI Competition that use Genetic Algorithm and Monte Carlo Tree Search approach. text |
institution |
Institut Teknologi Bandung |
building |
Institut Teknologi Bandung Library |
continent |
Asia |
country |
Indonesia Indonesia |
content_provider |
Institut Teknologi Bandung |
collection |
Digital ITB |
language |
Indonesia |
description |
Reinforcement Learning is one of machine learning study where agents try to take action to maximize cumulative rewards. For that purpose, the agent saves the expected reward inside QTable. The need for memory for Q-Table can be replaced by using neural network to predict the Q-Value. This method is called Deep Q Network. FightingICE is a fighting game created using Java. This game is created with the aim for artificial intelligence development. With that aim in the mind, this game is created so that it can help developer to develop their AI with its features. The Deep Q Network that have been created have input state taken from the game such as character information and projectile information. And the output from this network is Q(S, A) value for all available action in State S. This Network have 3 hidden layers with 256 unit within first hidden layer, 128 unit within second hidden layer and 64 unit within third hidden layer. The number of units in input and output layer is 150 unit and 41 unit. Activation function used is ReLU, while hidden layer uses linear activation function. This agent use epsilon greedy policy with 0.9 as its start value and 0.1 as its end value. Double Q-Learning is used for training the policy network. The result from this research is that agent using deep q network can beat random agent that considered as baseline agent. Furthermore, the agent can also compete with JayBot_GM, 3rd ranked agent during 2018 IEEE CIG Fighting Game AI Competition that use Genetic Algorithm and Monte Carlo Tree Search approach. |
format |
Final Project |
author |
Fathur Rahman, Muhammad |
spellingShingle |
Fathur Rahman, Muhammad DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME |
author_facet |
Fathur Rahman, Muhammad |
author_sort |
Fathur Rahman, Muhammad |
title |
DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME |
title_short |
DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME |
title_full |
DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME |
title_fullStr |
DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME |
title_full_unstemmed |
DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME |
title_sort |
deep q network implementation for fightingice game |
url |
https://digilib.itb.ac.id/gdl/view/43577 |
_version_ |
1822270423800217600 |