Optimizing expectation with guarantees in POMDPs

Optimizing expectation with guarantees in POMDPs

A standard objective in partially-observable Markov decision processes (POMDPs) is to find a policy that maximizes the expected discounted-sum payoff. However, such policies may still permit unlikely but highly undesirable outcomes, which is problematic especially in safety-critical applications. Re...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	CHATTERJEE, Krishnendu, PEREZ, Guillermo A., RASKIN, Jean-François, ZIKELIC, Dorde
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2017
الموضوعات:	Artificial Intelligence and Robotics
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/sis_research/9071 https://ink.library.smu.edu.sg/context/sis_research/article/10074/viewcontent/11046_13_14574_1_2_20201228.pdf
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

مواد مشابهة

Solving long-run average reward robust MDPs via stochastic games
بواسطة: CHATTERJEE, Krishnendu, وآخرون
منشور في: (2024)

Anytime Planning for Decentralized POMDPs using Expectation Maximization
بواسطة: KUMAR, Akshat, وآخرون
منشور في: (2010)

Sound and complete witnesses for template-based verification of LTL properties on polynomial programs
بواسطة: CHATTERJEE, Krishnendu, وآخرون
منشور في: (2024)

Networked Distributed POMDPs: A Synthesis of Distributed Constraint Optimization and POMDPs
بواسطة: NAIR, Ranjit, وآخرون
منشور في: (2005)

Automated Generation of Interaction Graphs for Value-Factored Decentralized POMDPs
بواسطة: YEOH, William, وآخرون
منشور في: (2013)

Certified policy verification and synthesis for MDPs under distributional reach-avoidance properties
بواسطة: AKSHAY, S., وآخرون
منشور في: (2024)

Towards Efficient Computation of Quality Bounded Solutions in POMDPs: Expected Value Approximation and Dynamic Disjunctive Beliefs
بواسطة: VARAKANTHAM, Pradeep Reddy, وآخرون
منشور في: (2007)

Reachability Poorman discrete-bidding games
بواسطة: AVNI, Guy, وآخرون
منشور في: (2023)

Scalable verification of quantized neural networks
بواسطة: HENZINGER, Thomas A., وآخرون
منشور في: (2021)

Message-Passing Algorithms for Large Structured Decentralized POMDPs
بواسطة: KUMAR, Akshat, وآخرون
منشور في: (2011)

Offline RL with discrete proxy representations for generalizability in POMDPs
بواسطة: GU, Pengjie, وآخرون
منشور في: (2023)

Constraint-Based Dynamic Programming for Decentralized POMDPs with Structured Interactions
بواسطة: KUMAR, Akshat, وآخرون
منشور في: (2009)

Implementation Techniques for Solving POMDPs in Personal Assistant Domains
بواسطة: VARAKANTHAM, Pradeep Reddy, وآخرون
منشور في: (2006)

Winning back the CUP for Distributed POMDPs: Planning over continuous belief spaces
بواسطة: VARAKANTHAM, Pradeep, وآخرون
منشور في: (2006)

Letting loose a SPIDER on a network of POMDPs: Generating quality guranteed policies
بواسطة: VARAKANTHAM, Pradeep Reddy, وآخرون
منشور في: (2007)

Dual formulations for optimizing Dec-POMDP controllers
بواسطة: Akshat KUMAR,, وآخرون
منشور في: (2016)

Learning control policies for stochastic systems with reach-avoid guarantees
بواسطة: ZIKELIC, Dorde, وآخرون
منشور في: (2023)

Prioritized Shaping of Models for Solving DEC-POMDPs
بواسطة: VARAKANTHAM, Pradeep Reddy, وآخرون
منشور في: (2012)

Introducing Communication in Dis-POMDPs with Locality of Interaction
بواسطة: TASAKI, Makoto, وآخرون
منشور في: (2010)

A POMDP model for guiding taxi cruising in a congested urban city
بواسطة: AGUSSURJA, Lucas, وآخرون
منشور في: (2011)

Scalable distributional robustness in a class of non convex optimization with guarantees
بواسطة: BOSE, Avinandan, وآخرون
منشور في: (2022)

Sample-efficient iterative lower bound optimization of deep reactive policies for planning in continuous MDPs
بواسطة: LOW, Siow Meng, وآخرون
منشور في: (2022)

Distributed Model Shaping for Scaling to Decentralized POMDPs with hundreds of agents
بواسطة: VELAGAPUDI, Prasanna, وآخرون
منشور في: (2011)

CAPIR: Collaborative action planning with intention recognition
بواسطة: Nguyen T.,, وآخرون
منشور في: (2011)

Compositional policy learning in stochastic control systems with formal guarantees
بواسطة: ZIKELIC, Dorde, وآخرون
منشور في: (2024)

Inferring door locations from a teammate's trajectory in stealth human-robot team operations
بواسطة: OH, Jean, وآخرون
منشور في: (2015)

Trust oriented decision making via POMDPs
بواسطة: Aravazhi Irissappane, Athirai
منشور في: (2016)

A balanced view of artificial intelligence
بواسطة: Ngo, Courtney Anne M.
منشور في: (2018)

Can we regulate artificial intelligence?
بواسطة: Javier, Cholo E.
منشور في: (2024)

How to think like an AI
بواسطة: Lugtu, Reynaldo C., Jr.
منشور في: (2024)

Is AI making us smarter or dumber?
بواسطة: Lugtu, Reynaldo C.
منشور في: (2024)

The rise of generation AI
بواسطة: Lugtu, Reynaldo C., Jr.
منشور في: (2025)

Teaching use of AI with meta-reflections
بواسطة: Aure, Patrick Adriel H.
منشور في: (2023)

Adaptive decision support for structured organizations: A case for OrgPOMDPs
بواسطة: VARAKANTHAM, Pradeep Reddy, وآخرون
منشور في: (2011)

Exploiting Belief Bounds: Practical POMDPs for Personal Assistant Agents
بواسطة: VARAKANTHAM, Pradeep, وآخرون
منشور في: (2005)

ChatGPT's impact
بواسطة: Lim, Donald Patrick L.
منشور في: (2023)

Authentic and insightful use of generative AI
بواسطة: Aure, Patrick Adriel H.
منشور في: (2023)

Vaccinating against the AI chatbot hype
بواسطة: Teehankee, Benito L.
منشور في: (2024)

An approach for self-training audio event detectors using web data
بواسطة: ELIZALDE, Benjamin, وآخرون
منشور في: (2017)

The wonders of the spreadsheet tool for data management and insights
بواسطة: CHEONG, Michelle L. F.
منشور في: (2017)