Safe MDP planning by learning temporal patterns of undesirable trajectories and averting negative side effects
In safe MDP planning, a cost function based on the current state and action is often used to specify safety aspects. In real world, often the state representation used may lack sufficient fidelity to specify such safety constraints. Operating based on an incomplete model can often produce unintended...
Saved in:
Main Authors: | LOW, Siow Meng, KUMAR, Akshat, SANNER, Scott |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2023
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/8604 https://ink.library.smu.edu.sg/context/sis_research/article/9607/viewcontent/2304.03081.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
Similar Items
-
Sample-efficient iterative lower bound optimization of deep reactive policies for planning in continuous MDPs
by: LOW, Siow Meng, et al.
Published: (2022) -
Trajectory optimization for safe navigation in maritime traffic using historical data
by: BASRUR, Chaithanya, et al.
Published: (2022) -
Trajectory similarity learning with auxiliary supervision and optimal matching
by: ZHANG, Hanyuan, et al.
Published: (2020) -
Belief-evidence fusion in a hybrid intelligent system
by: Marcos, Nelson, et al.
Published: (2004) -
MEMS ultrasonic transducers for safe, low-power and portable eye-blinking monitoring
by: SUN, Sheng, et al.
Published: (2022)