AATEAM : achieving the ad hoc teamwork by employing the attention mechanism
In the ad hoc teamwork setting, a team of agents needs to perform a task without prior coordination. The most advanced approach learns policies based on previous experiences and reuses one of the policies to interact with new teammates. However, the selected policy in many cases is sub-optimal. Swi...
Saved in:
Main Authors: | , , , |
---|---|
Other Authors: | |
Format: | Conference or Workshop Item |
Language: | English |
Published: |
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/144311 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-144311 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1443112020-10-28T02:36:55Z AATEAM : achieving the ad hoc teamwork by employing the attention mechanism Chen, Shuo Andrejczuk, Ewa Cao, Zhiguang Zhang, Jie School of Electrical and Electronic Engineering AAAI Conference on Artificial Intelligence Engineering::Electrical and electronic engineering Ad Hoc Teamwork Artificial Intelligence In the ad hoc teamwork setting, a team of agents needs to perform a task without prior coordination. The most advanced approach learns policies based on previous experiences and reuses one of the policies to interact with new teammates. However, the selected policy in many cases is sub-optimal. Switching between policies to adapt to new teammates’ behaviour takes time, which threatens the successful performance of a task. In this paper, we propose AATEAM – a method that uses the attention-based neural networks to cope with new teammates’ behaviour in real-time. We train one attention network per teammate type. The attention networks learn both to extract the temporal correlations from the sequence of states (i.e. contexts) and the mapping from contexts to actions. Each attention network also learns to predict a future state given the current context and its output action. The prediction accuracies help to determine which actions the ad hoc agent should take. We perform extensive experiments to show the effectiveness of our method. Accepted version 2020-10-28T02:36:55Z 2020-10-28T02:36:55Z 2020 Conference Paper Chen, S., Andrejczuk, E., Cao, Z., & Zhang, J. (2020). AATEAM : achieving the ad hoc teamwork by employing the attention mechanism. Proceedings of the AAAI Conference on Artificial Intelligence, 34(5). doi:10.1609/aaai.v34i05.6196 https://hdl.handle.net/10356/144311 10.1609/aaai.v34i05.6196 34 en © 2020 Association for the Advancement of Artificial Intelligence (AAAI). All rights reserved. This paper was published in Proceedings of the AAAI Conference on Artificial Intelligence and is made available with permission of Association for the Advancement of Artificial Intelligence (AAAI). application/pdf |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Electrical and electronic engineering Ad Hoc Teamwork Artificial Intelligence |
spellingShingle |
Engineering::Electrical and electronic engineering Ad Hoc Teamwork Artificial Intelligence Chen, Shuo Andrejczuk, Ewa Cao, Zhiguang Zhang, Jie AATEAM : achieving the ad hoc teamwork by employing the attention mechanism |
description |
In the ad hoc teamwork setting, a team of agents needs to perform a task without prior coordination. The most advanced approach learns policies based on previous experiences and reuses one of the policies to interact with new teammates.
However, the selected policy in many cases is sub-optimal. Switching between policies to adapt to new teammates’ behaviour takes time, which threatens the successful performance of a task. In this paper, we propose AATEAM –
a method that uses the attention-based neural networks to cope with new teammates’ behaviour in real-time. We train one attention network per teammate type. The attention networks learn both to extract the temporal correlations from the
sequence of states (i.e. contexts) and the mapping from contexts to actions. Each attention network also learns to predict a future state given the current context and its output action. The prediction accuracies help to determine which actions the
ad hoc agent should take. We perform extensive experiments to show the effectiveness of our method. |
author2 |
School of Electrical and Electronic Engineering |
author_facet |
School of Electrical and Electronic Engineering Chen, Shuo Andrejczuk, Ewa Cao, Zhiguang Zhang, Jie |
format |
Conference or Workshop Item |
author |
Chen, Shuo Andrejczuk, Ewa Cao, Zhiguang Zhang, Jie |
author_sort |
Chen, Shuo |
title |
AATEAM : achieving the ad hoc teamwork by employing the attention mechanism |
title_short |
AATEAM : achieving the ad hoc teamwork by employing the attention mechanism |
title_full |
AATEAM : achieving the ad hoc teamwork by employing the attention mechanism |
title_fullStr |
AATEAM : achieving the ad hoc teamwork by employing the attention mechanism |
title_full_unstemmed |
AATEAM : achieving the ad hoc teamwork by employing the attention mechanism |
title_sort |
aateam : achieving the ad hoc teamwork by employing the attention mechanism |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/144311 |
_version_ |
1683493252211146752 |