Leveraging deep generative models for non-parametric distributions in reinforcement learning

This thesis explores the use of deep generative models to enhance distribution representations in reinforcement learning (RL), leading to improved exploration, stability, and performance. It focuses on two roles of distributions in RL: policy distributions and action distributions. For policy distri...

Full description

Saved in:

Bibliographic Details
Main Author:	Tang, Shi Yuan
Other Authors:	Zhang Jie
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Computer and Information Science
Online Access:	https://hdl.handle.net/10356/173455
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-173455
record_format	dspace
spelling	sg-ntu-dr.10356-1734552024-03-07T08:52:05Z Leveraging deep generative models for non-parametric distributions in reinforcement learning Tang, Shi Yuan Zhang Jie School of Computer Science and Engineering ZhangJ@ntu.edu.sg Computer and Information Science This thesis explores the use of deep generative models to enhance distribution representations in reinforcement learning (RL), leading to improved exploration, stability, and performance. It focuses on two roles of distributions in RL: policy distributions and action distributions. For policy distributions, an adversarial hypernetwork (AH) architecture enables multi-policy learning, allowing algorithms to converge to diverse local optima. The AH framework is generalized to single-policy RL algorithms, with a self-distillation mechanism for better learning efficiency. In the second part, the thesis investigates the benefits of using deep generative Fully-parameterized Quantile Function (FQF) in the actor of Soft Actor-Critic (SAC) to overcome uni-modality assumptions in stochastic policy implementations. The thesis demonstrates the theoretical boundedness of the entropy regularization term in FQF. Overall, this work proposes leveraging deep generative models to address performance inconsistencies and limitations of traditional modality assumptions in RL distributions. Doctor of Philosophy 2024-02-05T08:53:43Z 2024-02-05T08:53:43Z 2023 Thesis-Doctor of Philosophy Tang, S. Y. (2023). Leveraging deep generative models for non-parametric distributions in reinforcement learning. Doctoral thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/173455 https://hdl.handle.net/10356/173455 10.32657/10356/173455 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Computer and Information Science
spellingShingle	Computer and Information Science Tang, Shi Yuan Leveraging deep generative models for non-parametric distributions in reinforcement learning
description	This thesis explores the use of deep generative models to enhance distribution representations in reinforcement learning (RL), leading to improved exploration, stability, and performance. It focuses on two roles of distributions in RL: policy distributions and action distributions. For policy distributions, an adversarial hypernetwork (AH) architecture enables multi-policy learning, allowing algorithms to converge to diverse local optima. The AH framework is generalized to single-policy RL algorithms, with a self-distillation mechanism for better learning efficiency. In the second part, the thesis investigates the benefits of using deep generative Fully-parameterized Quantile Function (FQF) in the actor of Soft Actor-Critic (SAC) to overcome uni-modality assumptions in stochastic policy implementations. The thesis demonstrates the theoretical boundedness of the entropy regularization term in FQF. Overall, this work proposes leveraging deep generative models to address performance inconsistencies and limitations of traditional modality assumptions in RL distributions.
author2	Zhang Jie
author_facet	Zhang Jie Tang, Shi Yuan
format	Thesis-Doctor of Philosophy
author	Tang, Shi Yuan
author_sort	Tang, Shi Yuan
title	Leveraging deep generative models for non-parametric distributions in reinforcement learning
title_short	Leveraging deep generative models for non-parametric distributions in reinforcement learning
title_full	Leveraging deep generative models for non-parametric distributions in reinforcement learning
title_fullStr	Leveraging deep generative models for non-parametric distributions in reinforcement learning
title_full_unstemmed	Leveraging deep generative models for non-parametric distributions in reinforcement learning
title_sort	leveraging deep generative models for non-parametric distributions in reinforcement learning
publisher	Nanyang Technological University
publishDate	2024
url	https://hdl.handle.net/10356/173455
_version_	1794549355297898496

Leveraging deep generative models for non-parametric distributions in reinforcement learning

Similar Items