Curiosity-driven learning in artificial intelligence and its applications

The integration of neural structures and bio-functionality into machine learning (ML) models is an emerging trend that aims to develop human-level artificial intelligence, enabling intelligent agents to learn efficiently and perform better. Curiosity, as a fundamental element of human cognition,...

Full description

Saved in:
Bibliographic Details
Main Author: Sun, Chenyu
Other Authors: Miao Chun Yan
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2023
Subjects:
Online Access:https://hdl.handle.net/10356/172831
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-172831
record_format dspace
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering
spellingShingle Engineering::Computer science and engineering
Sun, Chenyu
Curiosity-driven learning in artificial intelligence and its applications
description The integration of neural structures and bio-functionality into machine learning (ML) models is an emerging trend that aims to develop human-level artificial intelligence, enabling intelligent agents to learn efficiently and perform better. Curiosity, as a fundamental element of human cognition, is an important intrinsic motivation that drives human intelligence to seek interesting information and explore the world. Incorporating curiosity into computational frameworks is of great significance, as artificial curiosity provides a natural intrinsic motivation for efficient learning, bridging the gap between ML research and practical application scenarios, such as overfitting, poor generalization, limited training samples, low sample efficiency, multi-skill offline learning, and high computational costs. Firstly, a systematic review of existing curiosity-driven learning methods in the fields of Reinforcement Learning (RL), Recommendation, and Classification has identified that curiosity-driven learning has become increasingly popular with more challenging tasks to be addressed, where agents are self-motivated to learn novel knowledge. Secondly, to address the challenges of learning directly from high-dimensional observations in online RL, we propose a model-agnostic contrastive-curiosity-driven learning framework (CCLF). CCLF fully exploits sample importance and improves learning efficiency in a self-supervised manner through contrastive curiosity. This method prioritizes the experience replay, selects the most informative augmented inputs, and regularizes the Q-function and encoder to concentrate more on under-learned data. It also encourages the agent to explore with a curiosity-based reward. As a result, the agent can focus on more informative samples and learn representation invariances more efficiently, with significantly reduced augmented inputs. CCLF is designed to integrate with different RL algorithms and architectures seamlessly. It does not impose strict constraints on the underlying RL method, allowing it to be applied alongside a wide range of RL approaches. Thirdly, for offline RL in a multi-task setting, we propose a curiosity-driven unsupervised data collection (CUDC) method that expands the feature space using adaptive temporal distances for task-agnostic data collection. CUDC estimates the probability of the $k$-step future states being reachable from the current states and adapts how many steps into the future the dynamics model should predict. With this adaptive reachability mechanism, the feature representation can be diversified, and the agent can navigate itself to collect higher-quality data with curiosity. The collected dataset helps the offline RL agents perform multi-task learning more efficiently, improving their overall learning capabilities. Fourthly, we also propose a curiosity-driven single-hidden-layer feedforward neural network (CD-SLFN) to improve online sequential classification problems. Based on the psychological theory of human curiosity, the artificial curiosity is computationally defined and is integrated into a regularized SLFN to encourage curiosity-driven online learning. The proposed model can actively select the most representative data in a sequential manner and flexibly adapt the model complexity to avoid overfitting. Compared to other online classifiers, the proposed classifier with intrinsic motivation has superior generalization ability, especially in the early learning phase with limited data. The analysis conducted in this thesis demonstrates the feasibility and effectiveness of introducing curiosity-driven learning in various RL problems and online classification task. This approach promotes the development of artificial intelligence applications with more human-like behaviors.
author2 Miao Chun Yan
author_facet Miao Chun Yan
Sun, Chenyu
format Thesis-Doctor of Philosophy
author Sun, Chenyu
author_sort Sun, Chenyu
title Curiosity-driven learning in artificial intelligence and its applications
title_short Curiosity-driven learning in artificial intelligence and its applications
title_full Curiosity-driven learning in artificial intelligence and its applications
title_fullStr Curiosity-driven learning in artificial intelligence and its applications
title_full_unstemmed Curiosity-driven learning in artificial intelligence and its applications
title_sort curiosity-driven learning in artificial intelligence and its applications
publisher Nanyang Technological University
publishDate 2023
url https://hdl.handle.net/10356/172831
_version_ 1787590741504032768
spelling sg-ntu-dr.10356-1728312024-01-04T06:32:51Z Curiosity-driven learning in artificial intelligence and its applications Sun, Chenyu Miao Chun Yan School of Computer Science and Engineering Joint NTU-UBC Research Centre of Excellence in Active Living for the Elderly (LILY) ASCYMiao@ntu.edu.sg Engineering::Computer science and engineering The integration of neural structures and bio-functionality into machine learning (ML) models is an emerging trend that aims to develop human-level artificial intelligence, enabling intelligent agents to learn efficiently and perform better. Curiosity, as a fundamental element of human cognition, is an important intrinsic motivation that drives human intelligence to seek interesting information and explore the world. Incorporating curiosity into computational frameworks is of great significance, as artificial curiosity provides a natural intrinsic motivation for efficient learning, bridging the gap between ML research and practical application scenarios, such as overfitting, poor generalization, limited training samples, low sample efficiency, multi-skill offline learning, and high computational costs. Firstly, a systematic review of existing curiosity-driven learning methods in the fields of Reinforcement Learning (RL), Recommendation, and Classification has identified that curiosity-driven learning has become increasingly popular with more challenging tasks to be addressed, where agents are self-motivated to learn novel knowledge. Secondly, to address the challenges of learning directly from high-dimensional observations in online RL, we propose a model-agnostic contrastive-curiosity-driven learning framework (CCLF). CCLF fully exploits sample importance and improves learning efficiency in a self-supervised manner through contrastive curiosity. This method prioritizes the experience replay, selects the most informative augmented inputs, and regularizes the Q-function and encoder to concentrate more on under-learned data. It also encourages the agent to explore with a curiosity-based reward. As a result, the agent can focus on more informative samples and learn representation invariances more efficiently, with significantly reduced augmented inputs. CCLF is designed to integrate with different RL algorithms and architectures seamlessly. It does not impose strict constraints on the underlying RL method, allowing it to be applied alongside a wide range of RL approaches. Thirdly, for offline RL in a multi-task setting, we propose a curiosity-driven unsupervised data collection (CUDC) method that expands the feature space using adaptive temporal distances for task-agnostic data collection. CUDC estimates the probability of the $k$-step future states being reachable from the current states and adapts how many steps into the future the dynamics model should predict. With this adaptive reachability mechanism, the feature representation can be diversified, and the agent can navigate itself to collect higher-quality data with curiosity. The collected dataset helps the offline RL agents perform multi-task learning more efficiently, improving their overall learning capabilities. Fourthly, we also propose a curiosity-driven single-hidden-layer feedforward neural network (CD-SLFN) to improve online sequential classification problems. Based on the psychological theory of human curiosity, the artificial curiosity is computationally defined and is integrated into a regularized SLFN to encourage curiosity-driven online learning. The proposed model can actively select the most representative data in a sequential manner and flexibly adapt the model complexity to avoid overfitting. Compared to other online classifiers, the proposed classifier with intrinsic motivation has superior generalization ability, especially in the early learning phase with limited data. The analysis conducted in this thesis demonstrates the feasibility and effectiveness of introducing curiosity-driven learning in various RL problems and online classification task. This approach promotes the development of artificial intelligence applications with more human-like behaviors. Doctor of Philosophy 2023-12-26T06:27:12Z 2023-12-26T06:27:12Z 2023 Thesis-Doctor of Philosophy Sun, C. (2023). Curiosity-driven learning in artificial intelligence and its applications. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/172831 https://hdl.handle.net/10356/172831 10.32657/10356/172831 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University