Imitate the good and avoid the bad: An incremental approach to safe reinforcement learning

A popular framework for enforcing safe actions in Reinforcement Learning (RL) is Constrained RL, where trajectory based constraints on expected cost (or other cost measures) are employed to enforce safety and more importantly these constraints are enforced while maximizing expected reward. Most rece...

全面介紹

Saved in:
書目詳細資料
Main Authors: HOANG, Minh Huy, TIEN, Mai Anh, VARAKANTHAM, Pradeep
格式: text
語言:English
出版: Institutional Knowledge at Singapore Management University 2024
主題:
在線閱讀:https://ink.library.smu.edu.sg/sis_research/8594
https://ink.library.smu.edu.sg/context/sis_research/article/9597/viewcontent/imitate_the_good.pdf
標簽: 添加標簽
沒有標簽, 成為第一個標記此記錄!