Improving deep reinforcement learning with advanced exploration and transfer learning techniques

Deep reinforcement learning utilizes deep neural networks as the function approximator to model the reinforcement learning policy and enables the policy to be trained in an end-to-end manner. When applied to complex real world problems such as video games playing and natural language processing, the...

Full description

Saved in:
Bibliographic Details
Main Author: Yin, Haiyan
Other Authors: Pan Jialin, Sinno
Format: Thesis-Doctor of Philosophy
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/137772
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-137772
record_format dspace
spelling sg-ntu-dr.10356-1377722020-10-28T08:40:39Z Improving deep reinforcement learning with advanced exploration and transfer learning techniques Yin, Haiyan Pan Jialin, Sinno School of Computer Science and Engineering sinnopan@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Deep reinforcement learning utilizes deep neural networks as the function approximator to model the reinforcement learning policy and enables the policy to be trained in an end-to-end manner. When applied to complex real world problems such as video games playing and natural language processing, the deep reinforcement learning algorithms often engage tremendous parameters with intractable search space, which is a result from the low-level modelling of state space or the complex nature of the problem. Therefore, constructing an effective exploration strategy to search through the solution space is crucial for deriving a policy that can tackle challenging problems. Furthermore, considering the considerable amount of computational resource and time consumed for policy training, it is also crucial to develop the transferability of the algorithm to create versatile and generalizable policy. In this thesis, I present a study on improving the deep reinforcement learning algorithms from the perspectives of exploration and transfer learning. The study of exploration mainly focuses on solving hard exploration problems in Atari 2600 games suite and the partially observable navigation domains with extremely sparse rewards. The following three exploration algorithms are discussed: a planning-based algorithm with deep hashing techniques to improve the search efficiency, a distributed framework with an exploration incentivizing novelty model to increase the sample throughput while gathering more novel experiences, and a sequence-level novelty model designated for sparse rewarded partially observable domains. With the attempt to improve the generalization ability of the policy, I discuss two policy transfer algorithms, which work on multi-task policy distillation and zero-shot policy transfer tasks, respectively. The above mentioned study has been evaluated in video games playing domains with high dimensional pixel-level inputs. The testified domains consist of Atari 2600 games suite, ViZDoom and DeepMind Lab. As a result, the presented approaches demonstrate desirable properties for improving the policy performance with the advanced exploration or transfer learning mechanism. Finally, I conclude by discussing open questions and future directions in applying the presented exploration and transfer learning techniques in more general and practical scenarios. Doctor of Philosophy 2020-04-14T04:48:30Z 2020-04-14T04:48:30Z 2019 Thesis-Doctor of Philosophy Yin, H. (2019). Improving deep reinforcement learning with advanced exploration and transfer learning techniques. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/137772 10.32657/10356/137772 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Yin, Haiyan
Improving deep reinforcement learning with advanced exploration and transfer learning techniques
description Deep reinforcement learning utilizes deep neural networks as the function approximator to model the reinforcement learning policy and enables the policy to be trained in an end-to-end manner. When applied to complex real world problems such as video games playing and natural language processing, the deep reinforcement learning algorithms often engage tremendous parameters with intractable search space, which is a result from the low-level modelling of state space or the complex nature of the problem. Therefore, constructing an effective exploration strategy to search through the solution space is crucial for deriving a policy that can tackle challenging problems. Furthermore, considering the considerable amount of computational resource and time consumed for policy training, it is also crucial to develop the transferability of the algorithm to create versatile and generalizable policy. In this thesis, I present a study on improving the deep reinforcement learning algorithms from the perspectives of exploration and transfer learning. The study of exploration mainly focuses on solving hard exploration problems in Atari 2600 games suite and the partially observable navigation domains with extremely sparse rewards. The following three exploration algorithms are discussed: a planning-based algorithm with deep hashing techniques to improve the search efficiency, a distributed framework with an exploration incentivizing novelty model to increase the sample throughput while gathering more novel experiences, and a sequence-level novelty model designated for sparse rewarded partially observable domains. With the attempt to improve the generalization ability of the policy, I discuss two policy transfer algorithms, which work on multi-task policy distillation and zero-shot policy transfer tasks, respectively. The above mentioned study has been evaluated in video games playing domains with high dimensional pixel-level inputs. The testified domains consist of Atari 2600 games suite, ViZDoom and DeepMind Lab. As a result, the presented approaches demonstrate desirable properties for improving the policy performance with the advanced exploration or transfer learning mechanism. Finally, I conclude by discussing open questions and future directions in applying the presented exploration and transfer learning techniques in more general and practical scenarios.
author2 Pan Jialin, Sinno
author_facet Pan Jialin, Sinno
Yin, Haiyan
format Thesis-Doctor of Philosophy
author Yin, Haiyan
author_sort Yin, Haiyan
title Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_short Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_full Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_fullStr Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_full_unstemmed Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_sort improving deep reinforcement learning with advanced exploration and transfer learning techniques
publisher Nanyang Technological University
publishDate 2020
url https://hdl.handle.net/10356/137772
_version_ 1683493626849525760