Curiosity-driven learning in artificial intelligence and its applications

The integration of neural structures and bio-functionality into machine learning (ML) models is an emerging trend that aims to develop human-level artificial intelligence, enabling intelligent agents to learn efficiently and perform better. Curiosity, as a fundamental element of human cognition,...

Full description

Saved in:

Bibliographic Details
Main Author:	Sun, Chenyu
Other Authors:	Miao Chun Yan
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2023
Subjects:	Engineering::Computer science and engineering
Online Access:	https://hdl.handle.net/10356/172831
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-172831
record_format	dspace
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering
spellingShingle	Engineering::Computer science and engineering Sun, Chenyu Curiosity-driven learning in artificial intelligence and its applications
description	The integration of neural structures and bio-functionality into machine learning (ML) models is an emerging trend that aims to develop human-level artificial intelligence, enabling intelligent agents to learn efficiently and perform better. Curiosity, as a fundamental element of human cognition, is an important intrinsic motivation that drives human intelligence to seek interesting information and explore the world. Incorporating curiosity into computational frameworks is of great significance, as artificial curiosity provides a natural intrinsic motivation for efficient learning, bridging the gap between ML research and practical application scenarios, such as overfitting, poor generalization, limited training samples, low sample efficiency, multi-skill offline learning, and high computational costs. Firstly, a systematic review of existing curiosity-driven learning methods in the fields of Reinforcement Learning (RL), Recommendation, and Classification has identified that curiosity-driven learning has become increasingly popular with more challenging tasks to be addressed, where agents are self-motivated to learn novel knowledge. Secondly, to address the challenges of learning directly from high-dimensional observations in online RL, we propose a model-agnostic contrastive-curiosity-driven learning framework (CCLF). CCLF fully exploits sample importance and improves learning efficiency in a self-supervised manner through contrastive curiosity. This method prioritizes the experience replay, selects the most informative augmented inputs, and regularizes the Q-function and encoder to concentrate more on under-learned data. It also encourages the agent to explore with a curiosity-based reward. As a result, the agent can focus on more informative samples and learn representation invariances more efficiently, with significantly reduced augmented inputs. CCLF is designed to integrate with different RL algorithms and architectures seamlessly. It does not impose strict constraints on the underlying RL method, allowing it to be applied alongside a wide range of RL approaches. Thirdly, for offline RL in a multi-task setting, we propose a curiosity-driven unsupervised data collection (CUDC) method that expands the feature space using adaptive temporal distances for task-agnostic data collection. CUDC estimates the probability of the $k$-step future states being reachable from the current states and adapts how many steps into the future the dynamics model should predict. With this adaptive reachability mechanism, the feature representation can be diversified, and the agent can navigate itself to collect higher-quality data with curiosity. The collected dataset helps the offline RL agents perform multi-task learning more efficiently, improving their overall learning capabilities. Fourthly, we also propose a curiosity-driven single-hidden-layer feedforward neural network (CD-SLFN) to improve online sequential classification problems. Based on the psychological theory of human curiosity, the artificial curiosity is computationally defined and is integrated into a regularized SLFN to encourage curiosity-driven online learning. The proposed model can actively select the most representative data in a sequential manner and flexibly adapt the model complexity to avoid overfitting. Compared to other online classifiers, the proposed classifier with intrinsic motivation has superior generalization ability, especially in the early learning phase with limited data. The analysis conducted in this thesis demonstrates the feasibility and effectiveness of introducing curiosity-driven learning in various RL problems and online classification task. This approach promotes the development of artificial intelligence applications with more human-like behaviors.
author2	Miao Chun Yan
author_facet	Miao Chun Yan Sun, Chenyu
format	Thesis-Doctor of Philosophy
author	Sun, Chenyu
author_sort	Sun, Chenyu
title	Curiosity-driven learning in artificial intelligence and its applications
title_short	Curiosity-driven learning in artificial intelligence and its applications
title_full	Curiosity-driven learning in artificial intelligence and its applications
title_fullStr	Curiosity-driven learning in artificial intelligence and its applications
title_full_unstemmed	Curiosity-driven learning in artificial intelligence and its applications
title_sort	curiosity-driven learning in artificial intelligence and its applications
publisher	Nanyang Technological University
publishDate	2023
url	https://hdl.handle.net/10356/172831
_version_	1787590741504032768
spelling	sg-ntu-dr.10356-1728312024-01-04T06:32:51Z Curiosity-driven learning in artificial intelligence and its applications Sun, Chenyu Miao Chun Yan School of Computer Science and Engineering Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly (LILY) ASCYMiao@ntu.edu.sg Engineering::Computer science and engineering The integration of neural structures and bio-functionality into machine learning (ML) models is an emerging trend that aims to develop human-level artificial intelligence, enabling intelligent agents to learn efficiently and perform better. Curiosity, as a fundamental element of human cognition, is an important intrinsic motivation that drives human intelligence to seek interesting information and explore the world. Incorporating curiosity into computational frameworks is of great significance, as artificial curiosity provides a natural intrinsic motivation for efficient learning, bridging the gap between ML research and practical application scenarios, such as overfitting, poor generalization, limited training samples, low sample efficiency, multi-skill offline learning, and high computational costs. Firstly, a systematic review of existing curiosity-driven learning methods in the fields of Reinforcement Learning (RL), Recommendation, and Classification has identified that curiosity-driven learning has become increasingly popular with more challenging tasks to be addressed, where agents are self-motivated to learn novel knowledge. Secondly, to address the challenges of learning directly from high-dimensional observations in online RL, we propose a model-agnostic contrastive-curiosity-driven learning framework (CCLF). CCLF fully exploits sample importance and improves learning efficiency in a self-supervised manner through contrastive curiosity. This method prioritizes the experience replay, selects the most informative augmented inputs, and regularizes the Q-function and encoder to concentrate more on under-learned data. It also encourages the agent to explore with a curiosity-based reward. As a result, the agent can focus on more informative samples and learn representation invariances more efficiently, with significantly reduced augmented inputs. CCLF is designed to integrate with different RL algorithms and architectures seamlessly. It does not impose strict constraints on the underlying RL method, allowing it to be applied alongside a wide range of RL approaches. Thirdly, for offline RL in a multi-task setting, we propose a curiosity-driven unsupervised data collection (CUDC) method that expands the feature space using adaptive temporal distances for task-agnostic data collection. CUDC estimates the probability of the $k$-step future states being reachable from the current states and adapts how many steps into the future the dynamics model should predict. With this adaptive reachability mechanism, the feature representation can be diversified, and the agent can navigate itself to collect higher-quality data with curiosity. The collected dataset helps the offline RL agents perform multi-task learning more efficiently, improving their overall learning capabilities. Fourthly, we also propose a curiosity-driven single-hidden-layer feedforward neural network (CD-SLFN) to improve online sequential classification problems. Based on the psychological theory of human curiosity, the artificial curiosity is computationally defined and is integrated into a regularized SLFN to encourage curiosity-driven online learning. The proposed model can actively select the most representative data in a sequential manner and flexibly adapt the model complexity to avoid overfitting. Compared to other online classifiers, the proposed classifier with intrinsic motivation has superior generalization ability, especially in the early learning phase with limited data. The analysis conducted in this thesis demonstrates the feasibility and effectiveness of introducing curiosity-driven learning in various RL problems and online classification task. This approach promotes the development of artificial intelligence applications with more human-like behaviors. Doctor of Philosophy 2023-12-26T06:27:12Z 2023-12-26T06:27:12Z 2023 Thesis-Doctor of Philosophy Sun, C. (2023). Curiosity-driven learning in artificial intelligence and its applications. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/172831 https://hdl.handle.net/10356/172831 10.32657/10356/172831 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University

Curiosity-driven learning in artificial intelligence and its applications

Similar Items