Reinforcement learning for swarm systems

The application of deep reinforcement learning to swarm systems is currently an actively explored topic. Adapting multi-agent reinforcement learning algorithms to swarm systems is difficult because of dynamic neighbourhood sizes and the lack of agent identities. Hence a key component to building a g...

Full description

Saved in:
Bibliographic Details
Main Author: Arumugam, Ramaswamy
Other Authors: Zinovi Rabinovich
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/163384
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-163384
record_format dspace
spelling sg-ntu-dr.10356-1633842022-12-05T06:03:31Z Reinforcement learning for swarm systems Arumugam, Ramaswamy Zinovi Rabinovich School of Computer Science and Engineering zinovi@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence The application of deep reinforcement learning to swarm systems is currently an actively explored topic. Adapting multi-agent reinforcement learning algorithms to swarm systems is difficult because of dynamic neighbourhood sizes and the lack of agent identities. Hence a key component to building a good Swarm RL algorithm is an information summarization module. There is currently no consensus on the best way to summarize the information from an agent's neighbourhood. Therefore we explore various techniques for information summarization. We evaluate these techniques on two tasks - cover and cluster. We also introduce a new method for summarization based on selecting the top K most important pieces of information from an agent's observation. In this paper, we provide an experimental study of our algorithm and its efficacy. Bachelor of Engineering (Computer Science) 2022-12-05T06:03:31Z 2022-12-05T06:03:31Z 2022 Final Year Project (FYP) Arumugam, R. (2022). Reinforcement learning for swarm systems. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/163384 https://hdl.handle.net/10356/163384 en application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Arumugam, Ramaswamy
Reinforcement learning for swarm systems
description The application of deep reinforcement learning to swarm systems is currently an actively explored topic. Adapting multi-agent reinforcement learning algorithms to swarm systems is difficult because of dynamic neighbourhood sizes and the lack of agent identities. Hence a key component to building a good Swarm RL algorithm is an information summarization module. There is currently no consensus on the best way to summarize the information from an agent's neighbourhood. Therefore we explore various techniques for information summarization. We evaluate these techniques on two tasks - cover and cluster. We also introduce a new method for summarization based on selecting the top K most important pieces of information from an agent's observation. In this paper, we provide an experimental study of our algorithm and its efficacy.
author2 Zinovi Rabinovich
author_facet Zinovi Rabinovich
Arumugam, Ramaswamy
format Final Year Project
author Arumugam, Ramaswamy
author_sort Arumugam, Ramaswamy
title Reinforcement learning for swarm systems
title_short Reinforcement learning for swarm systems
title_full Reinforcement learning for swarm systems
title_fullStr Reinforcement learning for swarm systems
title_full_unstemmed Reinforcement learning for swarm systems
title_sort reinforcement learning for swarm systems
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/163384
_version_ 1751548556336955392