DEVELOPMENT OF CNN-BASED DEEP Q-LEARNING FOR PATH PLANNING SIMULATOR

This Final Project Report examines the development of a Deep Q-learning (DQL) model based on Convolutional Neural Network (CNN) for a path planning simulator. Path planning is the process of finding an optimal collision-free route from a starting point to an endpoint in a given environment. The Q...

Full description

Saved in:
Bibliographic Details
Main Author: Rionaldo Pasaribu, Jeremy
Format: Final Project
Language:Indonesia
Online Access:https://digilib.itb.ac.id/gdl/view/85048
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Institut Teknologi Bandung
Language: Indonesia
Description
Summary:This Final Project Report examines the development of a Deep Q-learning (DQL) model based on Convolutional Neural Network (CNN) for a path planning simulator. Path planning is the process of finding an optimal collision-free route from a starting point to an endpoint in a given environment. The Q-learning method used in traditional path planning often encounters challenges in storing large Q- tables as the number of states and actions increases. To address this issue, the DQL method is used, utilizing neural networks to replace the Q-table. This research aims to build a DQL model with CNN architecture and compare it to a Feed Forward Neural Network (FFNN) architecture. CNN is chosen for its ability to recognize spatial patterns in the simulation environment compared to FFNN. This study examines two DQL models: FFNN using the method based on Sumarudin et al. (2023) and CNN using an image-based method as a solution. The experimental results are conducted in two stages: training and testing for one maze and ten mazes. For a single maze, the FFNN model's success rate of 0.846 for testing surpasses the performance of the CNN model. For ten mazes, the CNN model's success rate of 0.5040 for testing exceeds the performance of the FFNN model. The experimental results show that the FFNN model performs better in testing on a single maze, but for testing on ten mazes, the CNN model demonstrates superior performance. This research suggests further exploration in testing with constantly changing mazes and hyperparameter optimization to achieve more effective model performance.