Non-repetitive gaming experience as a curriculum design problem

In this project I seek to investigate problems where the distributions of trajectories of an agent through the environment is to be optimized, as opposed to optimizing for an end state. As part of this effort I developed an environment modelling a simulated player at a Lottery Game in a casino. A Dr...

Full description

Saved in:
Bibliographic Details
Main Author: Quek, Yufei
Other Authors: Zinovi Rabinovich
Format: Final Year Project
Language:English
Published: Nanyang Technological University 2020
Subjects:
Online Access:https://hdl.handle.net/10356/138141
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Nanyang Technological University
Language: English
Description
Summary:In this project I seek to investigate problems where the distributions of trajectories of an agent through the environment is to be optimized, as opposed to optimizing for an end state. As part of this effort I developed an environment modelling a simulated player at a Lottery Game in a casino. A Drama Manager is able to take certain actions which affects the environment and thus indirectly affect the player’s experience. By formulating the player’s experience – trajectory, or sequence of states – through the lottery game as a Markov Decision Problem, we have a well-studied framework on which existing algorithms can be applied to solve. Concurrently with the environment development, I built a Drama Manager that learns to solve the environment as a proof of concept. The complexity of the environment and the Drama Manager are increased concurrently, arriving at a complex stochastic environment that supports trajectory-based learning approaches and provides conflicting optimization goals. Correspondingly, the Drama Manager is able to make progress towards solving the final form of the environment.