World model with PSR components

The world model framework is a successful and compact model that can quickly learn the spatial and temporal representation of the environment and then the policy to solve the task. It comprises of three components – a VAE that compresses visual information to abstract representations, an internal mo...

Full description

Saved in:
Bibliographic Details
Main Author: Tng, Jun Wei
Other Authors: Zinovi Rabinovich
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2022
Subjects:
Online Access:https://hdl.handle.net/10356/162992
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
id sg-ntu-dr.10356-162992
record_format dspace
spelling sg-ntu-dr.10356-1629922022-11-14T11:55:24Z World model with PSR components Tng, Jun Wei Zinovi Rabinovich School of Computer Science and Engineering zinovi@ntu.edu.sg Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence The world model framework is a successful and compact model that can quickly learn the spatial and temporal representation of the environment and then the policy to solve the task. It comprises of three components – a VAE that compresses visual information to abstract representations, an internal model for predicting the next observation frame, and a controller that decides on an action based on its policy. We investigate the use of PSRNN as an alternative internal model for the world model framework. The model was evaluated on two different environments and its performance was compared to that of MDN-RNN. It was found that when visual data was encoded to small latent spaces, PSRNN performed better than MDN-RNN on both environments. However, both agents did not manage to solve the tasks in both environments, which were likely due to the limitation of the controller model in the world model framework. Bachelor of Engineering (Computer Science) 2022-11-14T11:54:24Z 2022-11-14T11:54:24Z 2022 Final Year Project (FYP) Tng, J. W. (2022). World model with PSR components. Final Year Project (FYP), Nanyang Technological University, Singapore. https://hdl.handle.net/10356/162992 https://hdl.handle.net/10356/162992 en SCSE21-0788 application/pdf Nanyang Technological University
institution Nanyang Technological University
building NTU Library
continent Asia
country Singapore
Singapore
content_provider NTU Library
collection DR-NTU
language English
topic Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
spellingShingle Engineering::Computer science and engineering::Computing methodologies::Artificial intelligence
Tng, Jun Wei
World model with PSR components
description The world model framework is a successful and compact model that can quickly learn the spatial and temporal representation of the environment and then the policy to solve the task. It comprises of three components – a VAE that compresses visual information to abstract representations, an internal model for predicting the next observation frame, and a controller that decides on an action based on its policy. We investigate the use of PSRNN as an alternative internal model for the world model framework. The model was evaluated on two different environments and its performance was compared to that of MDN-RNN. It was found that when visual data was encoded to small latent spaces, PSRNN performed better than MDN-RNN on both environments. However, both agents did not manage to solve the tasks in both environments, which were likely due to the limitation of the controller model in the world model framework.
author2 Zinovi Rabinovich
author_facet Zinovi Rabinovich
Tng, Jun Wei
format Final Year Project
author Tng, Jun Wei
author_sort Tng, Jun Wei
title World model with PSR components
title_short World model with PSR components
title_full World model with PSR components
title_fullStr World model with PSR components
title_full_unstemmed World model with PSR components
title_sort world model with psr components
publisher Nanyang Technological University
publishDate 2022
url https://hdl.handle.net/10356/162992
_version_ 1751548511756746752