Combining PSR theory with distributional reinforcement learning
This work focuses on using Distributional Reinforcement Learning (DRL) in a partially observable environment that is modelled via Predictive State Representation Theory (PSR). We aim to integrate the benefits of DRL and PSR to obtain a model-based reinforcement learning method that is capable of prov...
Saved in:
Main Author: | |
---|---|
Other Authors: | |
Format: | Thesis-Master by Research |
Language: | English |
Published: |
Nanyang Technological University
2020
|
Subjects: | |
Online Access: | https://hdl.handle.net/10356/139946 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Nanyang Technological University |
Language: | English |
id |
sg-ntu-dr.10356-139946 |
---|---|
record_format |
dspace |
spelling |
sg-ntu-dr.10356-1399462020-10-28T08:29:13Z Combining PSR theory with distributional reinforcement learning Zhou, Jingzhe Zinovi Rabinovich School of Computer Science and Engineering zinovi@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence This work focuses on using Distributional Reinforcement Learning (DRL) in a partially observable environment that is modelled via Predictive State Representation Theory (PSR). We aim to integrate the benefits of DRL and PSR to obtain a model-based reinforcement learning method that is capable of providing complete (distributional) performance information about a policy using an observation-only environment model. PSR theory is one of the advanced techniques used to model a dynamical system on a partially observable environment. Unlike traditional partially observable Markov models, such as POMDP, which capture the uncertainty of the environment using belief states, PSR model describes the partially observable environment based on probabilities of executable and observable future events. Distributional Reinforcement Learning (DRL), proposed by MG Bellemare, is a learning paradigm that aims to improve learning by modelling the rewards as probability distributions instead of scalar expectations. Master of Engineering 2020-05-22T13:01:14Z 2020-05-22T13:01:14Z 2020 Thesis-Master by Research Zhou, J. (2020). Combining PSR theory with distributional reinforcement learning. Master's thesis, Nanyang Technological University, Singapore. https://hdl.handle.net/10356/139946 10.32657/10356/139946 en This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License (CC BY-NC 4.0). application/pdf Nanyang Technological University |
institution |
Nanyang Technological University |
building |
NTU Library |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
NTU Library |
collection |
DR-NTU |
language |
English |
topic |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence |
spellingShingle |
Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence Zhou, Jingzhe Combining PSR theory with distributional reinforcement learning |
description |
This work focuses on using Distributional Reinforcement Learning (DRL) in a partially observable environment that is modelled via Predictive State Representation Theory (PSR). We aim to integrate the benefits of DRL and PSR to obtain a model-based reinforcement learning method that is capable of providing complete (distributional) performance information about a policy using an observation-only environment model. PSR theory is one of the advanced techniques used to model a dynamical system on a partially observable environment. Unlike traditional partially observable Markov models, such as POMDP, which capture the uncertainty of the environment using belief states, PSR model describes the partially observable environment based on probabilities of executable and observable future events. Distributional Reinforcement Learning (DRL), proposed by MG Bellemare, is a learning paradigm that aims to improve learning by modelling the rewards as probability distributions instead of scalar expectations. |
author2 |
Zinovi Rabinovich |
author_facet |
Zinovi Rabinovich Zhou, Jingzhe |
format |
Thesis-Master by Research |
author |
Zhou, Jingzhe |
author_sort |
Zhou, Jingzhe |
title |
Combining PSR theory with distributional reinforcement learning |
title_short |
Combining PSR theory with distributional reinforcement learning |
title_full |
Combining PSR theory with distributional reinforcement learning |
title_fullStr |
Combining PSR theory with distributional reinforcement learning |
title_full_unstemmed |
Combining PSR theory with distributional reinforcement learning |
title_sort |
combining psr theory with distributional reinforcement learning |
publisher |
Nanyang Technological University |
publishDate |
2020 |
url |
https://hdl.handle.net/10356/139946 |
_version_ |
1683494293176582144 |