A hybrid agent architecture integrating desire, intention and reinforcement learning

This paper presents a hybrid agent architecture that integrates the behaviours of BDI agents, specifically desire and intention, with a neural network based reinforcement learner known as Temporal DifferenceFusion Architecture for Learning and COgNition (TD-FALCON). With the explicit maintenance of...

Full description

Saved in:
Bibliographic Details
Main Authors: TAN, Ah-hwee, ONG, Yew-Soon, TAPANUJ, Akejariyawong
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2011
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5244
https://ink.library.smu.edu.sg/context/sis_research/article/6247/viewcontent/1_s2.0_S0957417411000650_main.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6247
record_format dspace
spelling sg-smu-ink.sis_research-62472020-07-23T18:23:02Z A hybrid agent architecture integrating desire, intention and reinforcement learning TAN, Ah-hwee ONG, Yew-Soon TAPANUJ, Akejariyawong This paper presents a hybrid agent architecture that integrates the behaviours of BDI agents, specifically desire and intention, with a neural network based reinforcement learner known as Temporal DifferenceFusion Architecture for Learning and COgNition (TD-FALCON). With the explicit maintenance of goals, the agent performs reinforcement learning with the awareness of its objectives instead of relying on external reinforcement signals. More importantly, the intention module equips the hybrid architecture with deliberative planning capabilities, enabling the agent to purposefully maintain an agenda of actions to perform and reducing the need of constantly sensing the environment. Through reinforcement learning, plans can also be learned and evaluated without the rigidity of user-defined plans as used in traditional BDI systems. For intention and reinforcement learning to work cooperatively, two strategies are presented for combining the intention module and the reactive learning module for decision making in a real time environment. Our case study based on aminefield navigation domain investigates how the desire and intention modules may cooperatively enhance the capability of a pure reinforcement learner. The empirical results show that the hybrid architecture is able to learn plans efficiently and tap both intentional and reactive action execution to yield a robust performance. 2011-07-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5244 info:doi/10.1016/j.eswa.2011.01.045 https://ink.library.smu.edu.sg/context/sis_research/article/6247/viewcontent/1_s2.0_S0957417411000650_main.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University BDI architecture Reinforcement learning Plan learning Self-organizing neural networks Minefield navigation Computer and Systems Architecture Databases and Information Systems Software Engineering
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic BDI architecture
Reinforcement learning
Plan learning
Self-organizing neural networks
Minefield navigation
Computer and Systems Architecture
Databases and Information Systems
Software Engineering
spellingShingle BDI architecture
Reinforcement learning
Plan learning
Self-organizing neural networks
Minefield navigation
Computer and Systems Architecture
Databases and Information Systems
Software Engineering
TAN, Ah-hwee
ONG, Yew-Soon
TAPANUJ, Akejariyawong
A hybrid agent architecture integrating desire, intention and reinforcement learning
description This paper presents a hybrid agent architecture that integrates the behaviours of BDI agents, specifically desire and intention, with a neural network based reinforcement learner known as Temporal DifferenceFusion Architecture for Learning and COgNition (TD-FALCON). With the explicit maintenance of goals, the agent performs reinforcement learning with the awareness of its objectives instead of relying on external reinforcement signals. More importantly, the intention module equips the hybrid architecture with deliberative planning capabilities, enabling the agent to purposefully maintain an agenda of actions to perform and reducing the need of constantly sensing the environment. Through reinforcement learning, plans can also be learned and evaluated without the rigidity of user-defined plans as used in traditional BDI systems. For intention and reinforcement learning to work cooperatively, two strategies are presented for combining the intention module and the reactive learning module for decision making in a real time environment. Our case study based on aminefield navigation domain investigates how the desire and intention modules may cooperatively enhance the capability of a pure reinforcement learner. The empirical results show that the hybrid architecture is able to learn plans efficiently and tap both intentional and reactive action execution to yield a robust performance.
format text
author TAN, Ah-hwee
ONG, Yew-Soon
TAPANUJ, Akejariyawong
author_facet TAN, Ah-hwee
ONG, Yew-Soon
TAPANUJ, Akejariyawong
author_sort TAN, Ah-hwee
title A hybrid agent architecture integrating desire, intention and reinforcement learning
title_short A hybrid agent architecture integrating desire, intention and reinforcement learning
title_full A hybrid agent architecture integrating desire, intention and reinforcement learning
title_fullStr A hybrid agent architecture integrating desire, intention and reinforcement learning
title_full_unstemmed A hybrid agent architecture integrating desire, intention and reinforcement learning
title_sort hybrid agent architecture integrating desire, intention and reinforcement learning
publisher Institutional Knowledge at Singapore Management University
publishDate 2011
url https://ink.library.smu.edu.sg/sis_research/5244
https://ink.library.smu.edu.sg/context/sis_research/article/6247/viewcontent/1_s2.0_S0957417411000650_main.pdf
_version_ 1770575347486556160