FlowPG: Action-constrained policy gradient with normalizing flows

FlowPG: Action-constrained policy gradient with normalizing flows

Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-critical and resource-allocation related decision making problems. A major challenge in ACRL is to ensure agent taking a valid action satisfying constraints in each RL step. Commonly used approach of using a pr...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلفون الرئيسيون:	BRAHMANAGE JANAKA CHATHURANGA THILAKARATHNA, LING, Jiajing, KUMAR, Akshat
التنسيق:	text
اللغة:	English
منشور في:	Institutional Knowledge at Singapore Management University 2023
الموضوعات:	Artificial Intelligence and Robotics Databases and Information Systems
الوصول للمادة أونلاين:	https://ink.library.smu.edu.sg/sis_research/8551 https://ink.library.smu.edu.sg/context/sis_research/article/9554/viewcontent/11351_flowpg_action_constrained_poli.pdf
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!
المؤسسة:	Singapore Management University
اللغة:	English

مواد مشابهة

Automata-guided control-flow-sensitive fuzz driver generation
بواسطة: ZHANG, Cen, وآخرون
منشور في: (2023)

Sample-efficient iterative lower bound optimization of deep reactive policies for planning in continuous MDPs
بواسطة: LOW, Siow Meng, وآخرون
منشور في: (2022)

Probabilistic Inference Based Message-Passing for Resource Constrained DCOPs
بواسطة: GHOSH, Supriyo, وآخرون
منشور في: (2015)

Constrained multiagent reinforcement learning for large agent population
بواسطة: LING, Jiajing, وآخرون
منشور في: (2022)

Towards gradient-based time-series explanations through a spatiotemporal attention network
بواسطة: LEE, Min Hun
منشور في: (2024)

CAPIR: Collaborative action planning with intention recognition
بواسطة: Nguyen T.,, وآخرون
منشور في: (2011)

Constrained multiagent reinforcement learning for large agent population
بواسطة: LING, Jiajing, وآخرون
منشور في: (2023)

Parameter Learning for Latent Network Diffusion
بواسطة: WU, Xiaojian, وآخرون
منشور في: (2013)

Handling long and richly constrained tasks through constrained hierarchical reinforcement learning
بواسطة: LU, Yuxiao, وآخرون
منشور في: (2024)

Constrained reinforcement learning in hard exploration problems
بواسطة: PATHMANATHAN, Pankayaraj, وآخرون
منشور في: (2023)

Accurate generation of trigger-action programs with domain-adapted sequence-to-sequence learning
بواسطة: IMAM NUR BANI YUSUF,, وآخرون
منشور في: (2022)

FireEye: Cybersecurity in action
بواسطة: Singapore Management University
منشور في: (2021)

Using constraint programming and graph representation learning for generating interpretable cloud security policies
بواسطة: KAZDAGLI, Mikhail, وآخرون
منشور في: (2022)

Improving patient flow in emergency department through dynamic priority queue
بواسطة: TAN, Kar Way, وآخرون
منشور في: (2012)

Collective Diffusion Over Networks: Models and Inference
بواسطة: KUMAR, Akshat, وآخرون
منشور في: (2013)

Automated Generation of Interaction Graphs for Value-Factored Decentralized POMDPs
بواسطة: YEOH, William, وآخرون
منشور في: (2013)

Lagrangian Relaxation Techniques for Scalable Spatial Conservation Planning
بواسطة: KUMAR, Akshat, وآخرون
منشور في: (2012)

H-DPOP: Using Hard Constraints for Search Space Pruning in DCOP
بواسطة: KUMAR, Akshat, وآخرون
منشور في: (2008)

Probabilistic Inference Techniques for Scalable Multiagent Decision Making
بواسطة: Akshat KUMAR,, وآخرون
منشور في: (2015)

Building action sets in a deep reinforcement learner
بواسطة: WANG, Yongzhao, وآخرون
منشور في: (2021)

Learning and Controlling Network Diffusion in Dependent Cascade Models
بواسطة: DU, Jiali, وآخرون
منشور في: (2015)

Robust decision making for stochastic network design
بواسطة: Akshat KUMAR,, وآخرون
منشور في: (2016)

Optimization Approaches for Solving Chance Constrained Stochastic Orienteering Problems
بواسطة: VARAKANTHAM, Pradeep, وآخرون
منشور في: (2013)

A balanced view of artificial intelligence
بواسطة: Ngo, Courtney Anne M.
منشور في: (2018)

Can we regulate artificial intelligence?
بواسطة: Javier, Cholo E.
منشور في: (2024)

How to think like an AI
بواسطة: Lugtu, Reynaldo C., Jr.
منشور في: (2024)

Is AI making us smarter or dumber?
بواسطة: Lugtu, Reynaldo C.
منشور في: (2024)

The rise of generation AI
بواسطة: Lugtu, Reynaldo C., Jr.
منشور في: (2025)

Teaching use of AI with meta-reflections
بواسطة: Aure, Patrick Adriel H.
منشور في: (2023)

Generative flows with invertible attentions
بواسطة: SUKTHANKER, Rhea Sanjay, وآخرون
منشور في: (2022)

Imitating cost-constrained behaviors in reinforcement learning
بواسطة: SHAO, Qian, وآخرون
منشور في: (2024)

Influence Diagrams With Memory States: Representation and Algorithms
بواسطة: WU, Xiaojian, وآخرون
منشور في: (2011)

An approach for self-training audio event detectors using web data
بواسطة: ELIZALDE, Benjamin, وآخرون
منشور في: (2017)

An Artificial Immune System based Approach for English Grammar Correction
بواسطة: KUMAR, Akshat, وآخرون
منشور في: (2007)

ChatGPT's impact
بواسطة: Lim, Donald Patrick L.
منشور في: (2023)

Authentic and insightful use of generative AI
بواسطة: Aure, Patrick Adriel H.
منشور في: (2023)

Vaccinating against the AI chatbot hype
بواسطة: Teehankee, Benito L.
منشور في: (2024)

Collective multiagent sequential decision making under uncertainty
بواسطة: NGUYEN, Duc Thien, وآخرون
منشور في: (2017)

Approximate Inference in Collective Graphical Models
بواسطة: SHELDON, Daniel, وآخرون
منشور في: (2013)

Message-Passing Algorithms for Quadratic Programming Formulations of MAP Estimation
بواسطة: KUMAR, Akshat, وآخرون
منشور في: (2011)