Safe MDP planning by learning temporal patterns of undesirable trajectories and averting negative side effects

Safe MDP planning by learning temporal patterns of undesirable trajectories and averting negative side effects

In safe MDP planning, a cost function based on the current state and action is often used to specify safety aspects. In real world, often the state representation used may lack sufficient fidelity to specify such safety constraints. Operating based on an incomplete model can often produce unintended...

Full description

Saved in:

Bibliographic Details
Main Authors:	LOW, Siow Meng, KUMAR, Akshat, SANNER, Scott
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	Artificial intelligence Cost functions Lagrange multipliers Learning systems Artificial Intelligence and Robotics Databases and Information Systems Programming Languages and Compilers
Online Access:	https://ink.library.smu.edu.sg/sis_research/8604 https://ink.library.smu.edu.sg/context/sis_research/article/9607/viewcontent/2304.03081.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Sample-efficient iterative lower bound optimization of deep reactive policies for planning in continuous MDPs
by: LOW, Siow Meng, et al.
Published: (2022)

BiST: Bi-directional spatio-temporal reasoning for video-grounded dialogues
by: LE, Hung, et al.
Published: (2020)

Trajectory optimization for safe navigation in maritime traffic using historical data
by: BASRUR, Chaithanya, et al.
Published: (2022)

Trajectory similarity learning with auxiliary supervision and optimal matching
by: ZHANG, Hanyuan, et al.
Published: (2020)

Belief-evidence fusion in a hybrid intelligent system
by: Marcos, Nelson, et al.
Published: (2004)

MEMS ultrasonic transducers for safe, low-power and portable eye-blinking monitoring
by: SUN, Sheng, et al.
Published: (2022)

Eye-tracking monitoring based on PMUT arrays
by: SUN, Sheng, et al.
Published: (2021)

Learning and evaluating Chinese idiom embeddings
by: TAN, Minghuan, et al.
Published: (2021)

Sound and complete witnesses for template-based verification of LTL properties on polynomial programs
by: CHATTERJEE, Krishnendu, et al.
Published: (2024)

ReEvo: Large language models as hyper-heuristics with reflective evolution
by: YE, Haoran, et al.
Published: (2024)

Reverse modeling in large language models
by: YU, Sicheng, et al.
Published: (2025)

Frame-voyager: Learning to query frames for video large language models
by: YU, Sicheng, et al.
Published: (2025)

UniConv: A unified conversational neural architecture for multi-domain task-oriented dialogues
by: LE, Hung, et al.
Published: (2020)

Video-grounded dialogues with pretrained generation language models
by: LE, Hung, et al.
Published: (2020)

Dynamic topic models for temporal document networks
by: ZHANG, Ce, et al.
Published: (2022)

Tests of Functional Form and Heteroscedasticity
by: YANG, Zhenlin, et al.
Published: (2003)

Inferring door locations from a teammate's trajectory in stealth human-robot team operations
by: OH, Jean, et al.
Published: (2015)

Superminds at work: The promise of human-AI collaboration
by: MALONE, Thomas W.
Published: (2024)

Using constraint programming and graph representation learning for generating interpretable cloud security policies
by: KAZDAGLI, Mikhail, et al.
Published: (2022)

A mixed-integer linear programming reduction of disjoint bilinear programs via symbolic variable elimination
by: JEONG, Jihwan, et al.
Published: (2023)

FlowPG: Action-constrained policy gradient with normalizing flows
by: BRAHMANAGE JANAKA CHATHURANGA THILAKARATHNA,, et al.
Published: (2023)

A balanced view of artificial intelligence
by: Ngo, Courtney Anne M.
Published: (2018)

Can we regulate artificial intelligence?
by: Javier, Cholo E.
Published: (2024)

How to think like an AI
by: Lugtu, Reynaldo C., Jr.
Published: (2024)

Is AI making us smarter or dumber?
by: Lugtu, Reynaldo C.
Published: (2024)

The rise of generation AI
by: Lugtu, Reynaldo C., Jr.
Published: (2025)

Teaching use of AI with meta-reflections
by: Aure, Patrick Adriel H.
Published: (2023)

Tracing Linguistic Relations in Winning and Losing Sides of Explicit Opposing Groups
by: Sanli, Ceyda, et al.
Published: (2017)

How students deal with AI -- one thoughtful question at a time
by: Aure, Patrick Adriel H.
Published: (2025)

Visualization for analyzing trajectory-based metaheuristic search algorithms
by: HALIM, Steven, et al.
Published: (2006)

Parameter Learning for Latent Network Diffusion
by: WU, Xiaojian, et al.
Published: (2013)

Closing the data-decisions loop: Deploying artificial intelligence for dynamic resource management
by: VARAKANTHAM, Pradeep
Published: (2020)

Side profile facial recognition using CNN
by: Varthamanan Manisha
Published: (2024)

Decompiling x86 Deep Neural Network executables
by: LIU, Zhibo, et al.
Published: (2023)

Mobile AI-generated content (AIGC) services (Mobile)
by: Yap, Xuan Ying
Published: (2024)

FloTra: Flower-shape trajectory mining for instance-specific parameter tuning
by: LINDAWATI, Lindawati, et al.
Published: (2013)

Probabilistic Inference Based Message-Passing for Resource Constrained DCOPs
by: GHOSH, Supriyo, et al.
Published: (2015)

Intelligent trajectory prediction algorithm design for dynamic obstacles under factory environments
by: Tan, Melvis Min Da
Published: (2024)

ChatGPT's impact
by: Lim, Donald Patrick L.
Published: (2023)

Authentic and insightful use of generative AI
by: Aure, Patrick Adriel H.
Published: (2023)