Time-inconsistency in reinforcement learning: an equilibrium control paradigm

Time inconsistency (TIC) describes a situation in which a plan, consisting of current and future actions, that is optimal today may no longer be optimal in the future. In reinforcement learning (RL), TIC often arises as we encode realistic human preferences or specific behaviors into an agent's...

全面介紹

Saved in:
書目詳細資料
主要作者: Lesmana, Nixie Sapphira
其他作者: Patrick Pun Chi Seng
格式: Thesis-Doctor of Philosophy
語言:English
出版: Nanyang Technological University 2024
主題:
在線閱讀:https://hdl.handle.net/10356/173187
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!
機構: Nanyang Technological University
語言: English