Imitate the good and avoid the bad: An incremental approach to safe reinforcement learning
A popular framework for enforcing safe actions in Reinforcement Learning (RL) is Constrained RL, where trajectory based constraints on expected cost (or other cost measures) are employed to enforce safety and more importantly these constraints are enforced while maximizing expected reward. Most rece...
Saved in:
Main Authors: | , , |
---|---|
格式: | text |
語言: | English |
出版: |
Institutional Knowledge at Singapore Management University
2024
|
主題: | |
在線閱讀: | https://ink.library.smu.edu.sg/sis_research/8594 https://ink.library.smu.edu.sg/context/sis_research/article/9597/viewcontent/imitate_the_good.pdf |
標簽: |
添加標簽
沒有標簽, 成為第一個標記此記錄!
|