A self-organizing neural architecture integrating desire, intention and reinforcement learning

This paper presents a self-organizing neural architecture that integrates the features of belief, desire, and intention (BDI) systems with reinforcement learning. Based on fusion Adaptive Resonance Theory (fusion ART), the proposed architecture provides a unified treatment for both intentional and r...

Full description

Saved in:
Bibliographic Details
Main Authors: TAN, Ah-hwee, FENG, Yu-Hong, ONG, Yew-Soon
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2010
Subjects:
BDI
Online Access:https://ink.library.smu.edu.sg/sis_research/5217
https://ink.library.smu.edu.sg/context/sis_research/article/6220/viewcontent/1_s2.0_S0925231209004196_main.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
Description
Summary:This paper presents a self-organizing neural architecture that integrates the features of belief, desire, and intention (BDI) systems with reinforcement learning. Based on fusion Adaptive Resonance Theory (fusion ART), the proposed architecture provides a unified treatment for both intentional and reactive cognitive functionalities. Operating with a sense-act-learn paradigm, the low level reactive module is a fusion ART network that learns action and value policies across the sensory, motor, and feedback channels. During performance, the actions executed by the reactive module are tracked by a high level intention module (also a fusion ART network) that learns to associate sequences of actions with context and goals. The intention module equips the architecture with deliberative planning capabilities, enabling it to purposefully maintain an agenda of actions to perform and to reduce the need of constantly sensing the environment. Through reinforcement learning, plans can also be evaluated and refined without the rigidity of user-defined plans. We examine two strategies for combining the intention and reactive modules for decision making in a real time environment. Our experiments based on a minefield navigation domain show that the integrated architecture is able to learn plans efficiently, achieve good plan utilization, and combine both intentional and reactive action execution to yield a robust performance.