Reinforcement learning for swarm systems
The application of deep reinforcement learning to swarm systems is currently an actively explored topic. Adapting multi-agent reinforcement learning algorithms to swarm systems is difficult because of dynamic neighbourhood sizes and the lack of agent identities. Hence a key component to building a g...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Final Year Project |
Language: | English |
Published: |
Nanyang Technological University
2022
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/163384 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-163384 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1633842022-12-05T06:03:31Z Reinforcement learning for swarm systems Arumugam, Ramaswamy Zinovi Rabinovich School of Computer Science and Engineering zinovi@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence The application of deep reinforcement learning to swarm systems is currently an actively explored topic. Adapting multi-agent reinforcement learning algorithms to swarm systems is difficult because of dynamic neighbourhood sizes and the lack of agent identities. Hence a key component to building a good Swarm RL algorithm is an information summarization module. There is currently no consensus on the best way to summarize the information from an agent's neighbourhood. Therefore we explore various techniques for information summarization. We evaluate these techniques on two tasks - cover and cluster. We also introduce a new method for summarization based on selecting the top K most important pieces of information from an agent's observation. In this paper, we provide an experimental study of our algorithm and its efficacy. Bachelor of Engineering (Computer Science) 2022-12-05T06:03:31Z 2022-12-05T06:03:31Z 2022 Final Year Project (FYP) Arumugam, R. (2022). Reinforcement learning for swarm systems. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/163384 https://hdl.handle.net/10356/163384 en application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Arumugam, Ramaswamy Reinforcement learning for swarm systems |
description |
The application of deep reinforcement learning to swarm systems is currently an actively explored topic. Adapting multi-agent reinforcement learning algorithms to swarm systems is difficult because of dynamic neighbourhood sizes and the lack of agent identities. Hence a key component to building a good Swarm RL algorithm is an information summarization module. There is currently no consensus on the best way to summarize the information from an agent's neighbourhood. Therefore we explore various techniques for information summarization. We evaluate these techniques on two tasks - cover and cluster. We also introduce a new method for summarization based on selecting the top K most important pieces of information from an agent's observation. In this paper, we provide an experimental study of our algorithm and its efficacy. |
author2 |
Zinovi Rabinovich |
author_facet |
Zinovi Rabinovich Arumugam, Ramaswamy |
format |
Final Year Project |
author |
Arumugam, Ramaswamy |
author_sort |
Arumugam, Ramaswamy |
title |
Reinforcement learning for swarm systems |
title_short |
Reinforcement learning for swarm systems |
title_full |
Reinforcement learning for swarm systems |
title_fullStr |
Reinforcement learning for swarm systems |
title_full_unstemmed |
Reinforcement learning for swarm systems |
title_sort |
reinforcement learning for swarm systems |
publisher |
Nanyang Technological University |
publishDate |
2022 |
url |
https://hdl.handle.net/10356/163384 |
_version_ |
1751548556336955392 |