Improving deep reinforcement learning with advanced exploration and transfer learning techniques

Deep reinforcement learning utilizes deep neural networks as the function approximator to model the reinforcement learning policy and enables the policy to be trained in an end-to-end manner. When applied to complex real world problems such as video games playing and natural language processing, the...

Full description

Saved in:

Bibliographic Details
Main Author:	Yin, Haiyan
Other Authors:	Pan Jialin, Sinno
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2020
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Online Access:	https://hdl.handle.net/10356/137772
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-137772
record_format	dspace
spelling	sg-ntu-dr.10356-1377722020-10-28T08:40:39Z Improving deep reinforcement learning with advanced exploration and transfer learning techniques Yin, Haiyan Pan Jialin, Sinno School of Computer Science and Engineering sinnopan@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Deep reinforcement learning utilizes deep neural networks as the function approximator to model the reinforcement learning policy and enables the policy to be trained in an end-to-end manner. When applied to complex real world problems such as video games playing and natural language processing, the deep reinforcement learning algorithms often engage tremendous parameters with intractable search space, which is a result from the low-level modelling of state space or the complex nature of the problem. Therefore, constructing an effective exploration strategy to search through the solution space is crucial for deriving a policy that can tackle challenging problems. Furthermore, considering the considerable amount of computational resource and time consumed for policy training, it is also crucial to develop the transferability of the algorithm to create versatile and generalizable policy. In this thesis, I present a study on improving the deep reinforcement learning algorithms from the perspectives of exploration and transfer learning. The study of exploration mainly focuses on solving hard exploration problems in Atari 2600 games suite and the partially observable navigation domains with extremely sparse rewards. The following three exploration algorithms are discussed: a planning-based algorithm with deep hashing techniques to improve the search efficiency, a distributed framework with an exploration incentivizing novelty model to increase the sample throughput while gathering more novel experiences, and a sequence-level novelty model designated for sparse rewarded partially observable domains. With the attempt to improve the generalization ability of the policy, I discuss two policy transfer algorithms, which work on multi-task policy distillation and zero-shot policy transfer tasks, respectively. The above mentioned study has been evaluated in video games playing domains with high dimensional pixel-level inputs. The testified domains consist of Atari 2600 games suite, ViZDoom and DeepMind Lab. As a result, the presented approaches demonstrate desirable properties for improving the policy performance with the advanced exploration or transfer learning mechanism. Finally, I conclude by discussing open questions and future directions in applying the presented exploration and transfer learning techniques in more general and practical scenarios. Doctor of Philosophy 2020-04-14T04:48:30Z 2020-04-14T04:48:30Z 2019 Thesis-Doctor of Philosophy Yin, H. (2019). Improving deep reinforcement learning with advanced exploration and transfer learning techniques. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/137772 10.32657/10356/137772 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Yin, Haiyan Improving deep reinforcement learning with advanced exploration and transfer learning techniques
description	Deep reinforcement learning utilizes deep neural networks as the function approximator to model the reinforcement learning policy and enables the policy to be trained in an end-to-end manner. When applied to complex real world problems such as video games playing and natural language processing, the deep reinforcement learning algorithms often engage tremendous parameters with intractable search space, which is a result from the low-level modelling of state space or the complex nature of the problem. Therefore, constructing an effective exploration strategy to search through the solution space is crucial for deriving a policy that can tackle challenging problems. Furthermore, considering the considerable amount of computational resource and time consumed for policy training, it is also crucial to develop the transferability of the algorithm to create versatile and generalizable policy. In this thesis, I present a study on improving the deep reinforcement learning algorithms from the perspectives of exploration and transfer learning. The study of exploration mainly focuses on solving hard exploration problems in Atari 2600 games suite and the partially observable navigation domains with extremely sparse rewards. The following three exploration algorithms are discussed: a planning-based algorithm with deep hashing techniques to improve the search efficiency, a distributed framework with an exploration incentivizing novelty model to increase the sample throughput while gathering more novel experiences, and a sequence-level novelty model designated for sparse rewarded partially observable domains. With the attempt to improve the generalization ability of the policy, I discuss two policy transfer algorithms, which work on multi-task policy distillation and zero-shot policy transfer tasks, respectively. The above mentioned study has been evaluated in video games playing domains with high dimensional pixel-level inputs. The testified domains consist of Atari 2600 games suite, ViZDoom and DeepMind Lab. As a result, the presented approaches demonstrate desirable properties for improving the policy performance with the advanced exploration or transfer learning mechanism. Finally, I conclude by discussing open questions and future directions in applying the presented exploration and transfer learning techniques in more general and practical scenarios.
author2	Pan Jialin, Sinno
author_facet	Pan Jialin, Sinno Yin, Haiyan
format	Thesis-Doctor of Philosophy
author	Yin, Haiyan
author_sort	Yin, Haiyan
title	Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_short	Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_full	Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_fullStr	Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_full_unstemmed	Improving deep reinforcement learning with advanced exploration and transfer learning techniques
title_sort	improving deep reinforcement learning with advanced exploration and transfer learning techniques
publisher	Nanyang Technological University
publishDate	2020
url	https://hdl.handle.net/10356/137772
_version_	1683493626849525760

Improving deep reinforcement learning with advanced exploration and transfer learning techniques

Similar Items