Combining PSR theory with distributional reinforcement learning

This work focuses on using Distributional Reinforcement Learning (DRL) in a partially observable environment that is modelled via Predictive State Representation Theory (PSR). We aim to integrate the beneﬁts of DRL and PSR to obtain a model-based reinforcement learning method that is capable of prov...

Full description

Saved in:

Bibliographic Details
Main Author:	Zhou, Jingzhe
Other Authors:	Zinovi Rabinovich
Format:	Thesis-Master by Research
Language:	English
Published:	Nanyang Technological University 2020
Subjects:	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Online Access:	https://hdl.handle.net/10356/139946
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

id	sg-ntu-dr.10356-139946
record_format	dspace
spelling	sg-ntu-dr.10356-1399462020-10-28T08:29:13Z Combining PSR theory with distributional reinforcement learning Zhou, Jingzhe Zinovi Rabinovich School of Computer Science and Engineering zinovi@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence This work focuses on using Distributional Reinforcement Learning (DRL) in a partially observable environment that is modelled via Predictive State Representation Theory (PSR). We aim to integrate the beneﬁts of DRL and PSR to obtain a model-based reinforcement learning method that is capable of providing complete (distributional) performance information about a policy using an observation-only environment model. PSR theory is one of the advanced techniques used to model a dynamical system on a partially observable environment. Unlike traditional partially observable Markov models, such as POMDP, which capture the uncertainty of the environment using belief states, PSR model describes the partially observable environment based on probabilities of executable and observable future events. Distributional Reinforcement Learning (DRL), proposed by MG Bellemare, is a learning paradigm that aims to improve learning by modelling the rewards as probability distributions instead of scalar expectations. Master of Engineering 2020-05-22T13:01:14Z 2020-05-22T13:01:14Z 2020 Thesis-Master by Research Zhou, J. (2020). Combining PSR theory with distributional reinforcement learning. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/139946 10.32657/10356/139946 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University
institution	Nanyang Technological University
building	NTU Library
continent	Asia
country	Singapore Singapore
content_provider	NTU Library
collection	DR-NTU
language	English
topic	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle	Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Zhou, Jingzhe Combining PSR theory with distributional reinforcement learning
description	This work focuses on using Distributional Reinforcement Learning (DRL) in a partially observable environment that is modelled via Predictive State Representation Theory (PSR). We aim to integrate the beneﬁts of DRL and PSR to obtain a model-based reinforcement learning method that is capable of providing complete (distributional) performance information about a policy using an observation-only environment model. PSR theory is one of the advanced techniques used to model a dynamical system on a partially observable environment. Unlike traditional partially observable Markov models, such as POMDP, which capture the uncertainty of the environment using belief states, PSR model describes the partially observable environment based on probabilities of executable and observable future events. Distributional Reinforcement Learning (DRL), proposed by MG Bellemare, is a learning paradigm that aims to improve learning by modelling the rewards as probability distributions instead of scalar expectations.
author2	Zinovi Rabinovich
author_facet	Zinovi Rabinovich Zhou, Jingzhe
format	Thesis-Master by Research
author	Zhou, Jingzhe
author_sort	Zhou, Jingzhe
title	Combining PSR theory with distributional reinforcement learning
title_short	Combining PSR theory with distributional reinforcement learning
title_full	Combining PSR theory with distributional reinforcement learning
title_fullStr	Combining PSR theory with distributional reinforcement learning
title_full_unstemmed	Combining PSR theory with distributional reinforcement learning
title_sort	combining psr theory with distributional reinforcement learning
publisher	Nanyang Technological University
publishDate	2020
url	https://hdl.handle.net/10356/139946
_version_	1683494293176582144

Combining PSR theory with distributional reinforcement learning

Similar Items