ROBUST MULTI-AGENT PATH PLANNING ALGORITHM FOR TARGET LOCALIZATION IN AN UNKNOWN ENVIRONMENT

This study proposes a new approach to improve the robustness of the path planning algorithm based on received signal strength (RSS) by defining several variables as the state of Q-learning. Usually, there are two approaches to defining state Q-learning in RSS-based target localization problems, both...

Full description

Saved in:

Bibliographic Details
Main Author:	Dawne, Axel
Format:	Final Project
Language:	Indonesia
Online Access:	https://digilib.itb.ac.id/gdl/view/73149
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Institut Teknologi Bandung
Language:	Indonesia

id	id-itb.:73149
spelling	id-itb.:731492023-06-15T13:43:16ZROBUST MULTI-AGENT PATH PLANNING ALGORITHM FOR TARGET LOCALIZATION IN AN UNKNOWN ENVIRONMENT Dawne, Axel Indonesia Final Project RSS, Q-learning, Boltzmann distribution, independent learners, cooperative multi-agent INSTITUT TEKNOLOGI BANDUNG https://digilib.itb.ac.id/gdl/view/73149 This study proposes a new approach to improve the robustness of the path planning algorithm based on received signal strength (RSS) by defining several variables as the state of Q-learning. Usually, there are two approaches to defining state Q-learning in RSS-based target localization problems, both of which only use one variable as state Q-learning. Either approach is potentially difficult to achieve convergence. While other approaches are based on unrealistic assumptions and conflict with radio-frequency (RF) wave propagation characteristics. In this study, it will be proven that if these assumptions are not met, the performance of the methods proposed by other studies will not be able to converge. In addition, this research proposes a new approach using two variables as state Q-learning. This approach is proven to be able to outperform approaches that only use one variable as the Q-learning state in terms of speed to achieve convergence and robustness. In addition, this study also proposes using the Boltzmann distribution to replace ?-greedy, which is commonly used as an exploratory-exploitation method in RSS-based target localization research. This aims to accelerate the learning process of the algorithm so that the algorithm does not require many training episodes to achieve convergence. The simulation results show that using the Boltzmann distribution can reduce the steps needed for an agent to reach the target by up to 70% in the first ten training episodes. Lastly, this research uses a cooperative multi-agent approach to solve the target localization problem based on RSS. In the approach taken, it is proposed to combine the Q-tables owned by each agent while still prioritizing the agent's own Q-value. The simulation results show that using this algorithm can reduce the number of steps by up to 18% compared to independent learners in the first ten training episodes text
institution	Institut Teknologi Bandung
building	Institut Teknologi Bandung Library
continent	Asia
country	Indonesia Indonesia
content_provider	Institut Teknologi Bandung
collection	Digital ITB
language	Indonesia
description	This study proposes a new approach to improve the robustness of the path planning algorithm based on received signal strength (RSS) by defining several variables as the state of Q-learning. Usually, there are two approaches to defining state Q-learning in RSS-based target localization problems, both of which only use one variable as state Q-learning. Either approach is potentially difficult to achieve convergence. While other approaches are based on unrealistic assumptions and conflict with radio-frequency (RF) wave propagation characteristics. In this study, it will be proven that if these assumptions are not met, the performance of the methods proposed by other studies will not be able to converge. In addition, this research proposes a new approach using two variables as state Q-learning. This approach is proven to be able to outperform approaches that only use one variable as the Q-learning state in terms of speed to achieve convergence and robustness. In addition, this study also proposes using the Boltzmann distribution to replace ?-greedy, which is commonly used as an exploratory-exploitation method in RSS-based target localization research. This aims to accelerate the learning process of the algorithm so that the algorithm does not require many training episodes to achieve convergence. The simulation results show that using the Boltzmann distribution can reduce the steps needed for an agent to reach the target by up to 70% in the first ten training episodes. Lastly, this research uses a cooperative multi-agent approach to solve the target localization problem based on RSS. In the approach taken, it is proposed to combine the Q-tables owned by each agent while still prioritizing the agent's own Q-value. The simulation results show that using this algorithm can reduce the number of steps by up to 18% compared to independent learners in the first ten training episodes
format	Final Project
author	Dawne, Axel
spellingShingle	Dawne, Axel ROBUST MULTI-AGENT PATH PLANNING ALGORITHM FOR TARGET LOCALIZATION IN AN UNKNOWN ENVIRONMENT
author_facet	Dawne, Axel
author_sort	Dawne, Axel
title	ROBUST MULTI-AGENT PATH PLANNING ALGORITHM FOR TARGET LOCALIZATION IN AN UNKNOWN ENVIRONMENT
title_short	ROBUST MULTI-AGENT PATH PLANNING ALGORITHM FOR TARGET LOCALIZATION IN AN UNKNOWN ENVIRONMENT
title_full	ROBUST MULTI-AGENT PATH PLANNING ALGORITHM FOR TARGET LOCALIZATION IN AN UNKNOWN ENVIRONMENT
title_fullStr	ROBUST MULTI-AGENT PATH PLANNING ALGORITHM FOR TARGET LOCALIZATION IN AN UNKNOWN ENVIRONMENT
title_full_unstemmed	ROBUST MULTI-AGENT PATH PLANNING ALGORITHM FOR TARGET LOCALIZATION IN AN UNKNOWN ENVIRONMENT
title_sort	robust multi-agent path planning algorithm for target localization in an unknown environment
url	https://digilib.itb.ac.id/gdl/view/73149
_version_	1822992855849762816

ROBUST MULTI-AGENT PATH PLANNING ALGORITHM FOR TARGET LOCALIZATION IN AN UNKNOWN ENVIRONMENT

Similar Items