Time-inconsistency in reinforcement learning: an equilibrium control paradigm

Time inconsistency (TIC) describes a situation in which a plan, consisting of current and future actions, that is optimal today may no longer be optimal in the future. In reinforcement learning (RL), TIC often arises as we encode realistic human preferences or specific behaviors into an agent's...

Full description

Saved in:

Bibliographic Details
Main Author:	Lesmana, Nixie Sapphira
Other Authors:	Patrick Pun Chi Seng
Format:	Thesis-Doctor of Philosophy
Language:	English
Published:	Nanyang Technological University 2024
Subjects:	Science::Mathematics::Applied mathematics Engineering::Computer science and engineering::Computing methodologies
Online Access:	https://hdl.handle.net/10356/173187
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Nanyang Technological University
Language:	English

Be the first to leave a comment!

Time-inconsistency in reinforcement learning: an equilibrium control paradigm

Similar Items