Safe MDP planning by learning temporal patterns of undesirable trajectories and averting negative side effects

In safe MDP planning, a cost function based on the current state and action is often used to specify safety aspects. In real world, often the state representation used may lack sufficient fidelity to specify such safety constraints. Operating based on an incomplete model can often produce unintended...

Full description

Saved in:
Bibliographic Details
Main Authors: LOW, Siow Meng, KUMAR, Akshat, SANNER, Scott
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2023
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/8604
https://ink.library.smu.edu.sg/context/sis_research/article/9607/viewcontent/2304.03081.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English