DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME

Reinforcement Learning is one of machine learning study where agents try to take action to maximize cumulative rewards. For that purpose, the agent saves the expected reward inside QTable. The need for memory for Q-Table can be replaced by using neural network to predict the Q-Value. This method is...

Full description

Saved in:
Bibliographic Details
Main Author: Fathur Rahman, Muhammad
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/43577
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
id id-itb.:43577
spelling id-itb.:435772019-09-27T14:03:40ZDEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME Fathur Rahman, Muhammad Indonesia Final Project reinforcement learning, deep q network, FightingICE INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/43577 Reinforcement Learning is one of machine learning study where agents try to take action to maximize cumulative rewards. For that purpose, the agent saves the expected reward inside QTable. The need for memory for Q-Table can be replaced by using neural network to predict the Q-Value. This method is called Deep Q Network. FightingICE is a fighting game created using Java. This game is created with the aim for artificial intelligence development. With that aim in the mind, this game is created so that it can help developer to develop their AI with its features. The Deep Q Network that have been created have input state taken from the game such as character information and projectile information. And the output from this network is Q(S, A) value for all available action in State S. This Network have 3 hidden layers with 256 unit within first hidden layer, 128 unit within second hidden layer and 64 unit within third hidden layer. The number of units in input and output layer is 150 unit and 41 unit. Activation function used is ReLU, while hidden layer uses linear activation function. This agent use epsilon greedy policy with 0.9 as its start value and 0.1 as its end value. Double Q-Learning is used for training the policy network. The result from this research is that agent using deep q network can beat random agent that considered as baseline agent. Furthermore, the agent can also compete with JayBot_GM, 3rd ranked agent during 2018 IEEE CIG Fighting Game AI Competition that use Genetic Algorithm and Monte Carlo Tree Search approach. text
institution Institut Teknologi Bandung
building Institut Teknologi Bandung Library
continent Asia
country Indonesia
Indonesia
content_provider Institut Teknologi Bandung
collection Digital ITB
language Indonesia
description Reinforcement Learning is one of machine learning study where agents try to take action to maximize cumulative rewards. For that purpose, the agent saves the expected reward inside QTable. The need for memory for Q-Table can be replaced by using neural network to predict the Q-Value. This method is called Deep Q Network. FightingICE is a fighting game created using Java. This game is created with the aim for artificial intelligence development. With that aim in the mind, this game is created so that it can help developer to develop their AI with its features. The Deep Q Network that have been created have input state taken from the game such as character information and projectile information. And the output from this network is Q(S, A) value for all available action in State S. This Network have 3 hidden layers with 256 unit within first hidden layer, 128 unit within second hidden layer and 64 unit within third hidden layer. The number of units in input and output layer is 150 unit and 41 unit. Activation function used is ReLU, while hidden layer uses linear activation function. This agent use epsilon greedy policy with 0.9 as its start value and 0.1 as its end value. Double Q-Learning is used for training the policy network. The result from this research is that agent using deep q network can beat random agent that considered as baseline agent. Furthermore, the agent can also compete with JayBot_GM, 3rd ranked agent during 2018 IEEE CIG Fighting Game AI Competition that use Genetic Algorithm and Monte Carlo Tree Search approach.
format Final Project
author Fathur Rahman, Muhammad
spellingShingle Fathur Rahman, Muhammad
DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME
author_facet Fathur Rahman, Muhammad
author_sort Fathur Rahman, Muhammad
title DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME
title_short DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME
title_full DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME
title_fullStr DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME
title_full_unstemmed DEEP Q NETWORK IMPLEMENTATION FOR FIGHTINGICE GAME
title_sort deep q network implementation for fightingice game
url https://digilib.itb.ac.id/gdl/view/43577
_version_ 1822270423800217600