Imitate the good and avoid the bad: An incremental approach to safe reinforcement learning

A popular framework for enforcing safe actions in Reinforcement Learning (RL) is Constrained RL, where trajectory based constraints on expected cost (or other cost measures) are employed to enforce safety and more importantly these constraints are enforced while maximizing expected reward. Most rece...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	HOANG, Minh Huy, TIEN, Mai Anh, VARAKANTHAM, Pradeep
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2024
الموضوعات:	Databases and Information Systems Theory and Algorithms
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/sis_research/8594 https://ink.library.smu.edu.sg/context/sis_research/article/9597/viewcontent/imitate_the_good.pdf
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Singapore Management University
اللغة:	English

الانترنت

https://ink.library.smu.edu.sg/sis_research/8594
https://ink.library.smu.edu.sg/context/sis_research/article/9597/viewcontent/imitate_the_good.pdf

Imitate the good and avoid the bad: An incremental approach to safe reinforcement learning

الانترنت

مواد مشابهة