FlowPG: Action-constrained policy gradient with normalizing flows

FlowPG: Action-constrained policy gradient with normalizing flows

Action-constrained reinforcement learning (ACRL) is a popular approach for solving safety-critical and resource-allocation related decision making problems. A major challenge in ACRL is to ensure agent taking a valid action satisfying constraints in each RL step. Commonly used approach of using a pr...

Saved in:

書目詳細資料
Main Authors:	BRAHMANAGE JANAKA CHATHURANGA THILAKARATHNA, LING, Jiajing, KUMAR, Akshat
格式:	text
語言:	English
出版:	Institutional Knowledge at Singapore Management University 2023
主題:	Artificial Intelligence and Robotics Databases and Information Systems
在線閱讀:	https://ink.library.smu.edu.sg/sis_research/8551 https://ink.library.smu.edu.sg/context/sis_research/article/9554/viewcontent/11351_flowpg_action_constrained_poli.pdf
標簽:	添加標簽沒有標簽, 成為第一個標記此記錄!
機構:	Singapore Management University
語言:	English

相似書籍

Automata-guided control-flow-sensitive fuzz driver generation
由: ZHANG, Cen, et al.
出版: (2023)

Sample-efficient iterative lower bound optimization of deep reactive policies for planning in continuous MDPs
由: LOW, Siow Meng, et al.
出版: (2022)

Probabilistic Inference Based Message-Passing for Resource Constrained DCOPs
由: GHOSH, Supriyo, et al.
出版: (2015)

Constrained multiagent reinforcement learning for large agent population
由: LING, Jiajing, et al.
出版: (2022)

Towards gradient-based time-series explanations through a spatiotemporal attention network
由: LEE, Min Hun
出版: (2024)

CAPIR: Collaborative action planning with intention recognition
由: Nguyen T.,, et al.
出版: (2011)

Constrained multiagent reinforcement learning for large agent population
由: LING, Jiajing, et al.
出版: (2023)

Parameter Learning for Latent Network Diffusion
由: WU, Xiaojian, et al.
出版: (2013)

Handling long and richly constrained tasks through constrained hierarchical reinforcement learning
由: LU, Yuxiao, et al.
出版: (2024)

Constrained reinforcement learning in hard exploration problems
由: PATHMANATHAN, Pankayaraj, et al.
出版: (2023)

Accurate generation of trigger-action programs with domain-adapted sequence-to-sequence learning
由: IMAM NUR BANI YUSUF,, et al.
出版: (2022)

FireEye: Cybersecurity in action
由: Singapore Management University
出版: (2021)

Using constraint programming and graph representation learning for generating interpretable cloud security policies
由: KAZDAGLI, Mikhail, et al.
出版: (2022)

Improving patient flow in emergency department through dynamic priority queue
由: TAN, Kar Way, et al.
出版: (2012)

Collective Diffusion Over Networks: Models and Inference
由: KUMAR, Akshat, et al.
出版: (2013)

Automated Generation of Interaction Graphs for Value-Factored Decentralized POMDPs
由: YEOH, William, et al.
出版: (2013)

Lagrangian Relaxation Techniques for Scalable Spatial Conservation Planning
由: KUMAR, Akshat, et al.
出版: (2012)

H-DPOP: Using Hard Constraints for Search Space Pruning in DCOP
由: KUMAR, Akshat, et al.
出版: (2008)

Probabilistic Inference Techniques for Scalable Multiagent Decision Making
由: Akshat KUMAR,, et al.
出版: (2015)

Building action sets in a deep reinforcement learner
由: WANG, Yongzhao, et al.
出版: (2021)

Learning and Controlling Network Diffusion in Dependent Cascade Models
由: DU, Jiali, et al.
出版: (2015)

Robust decision making for stochastic network design
由: Akshat KUMAR,, et al.
出版: (2016)

Optimization Approaches for Solving Chance Constrained Stochastic Orienteering Problems
由: VARAKANTHAM, Pradeep, et al.
出版: (2013)

A balanced view of artificial intelligence
由: Ngo, Courtney Anne M.
出版: (2018)

Can we regulate artificial intelligence?
由: Javier, Cholo E.
出版: (2024)

How to think like an AI
由: Lugtu, Reynaldo C., Jr.
出版: (2024)

Is AI making us smarter or dumber?
由: Lugtu, Reynaldo C.
出版: (2024)

The rise of generation AI
由: Lugtu, Reynaldo C., Jr.
出版: (2025)

Teaching use of AI with meta-reflections
由: Aure, Patrick Adriel H.
出版: (2023)

Generative flows with invertible attentions
由: SUKTHANKER, Rhea Sanjay, et al.
出版: (2022)

Imitating cost-constrained behaviors in reinforcement learning
由: SHAO, Qian, et al.
出版: (2024)

Influence Diagrams With Memory States: Representation and Algorithms
由: WU, Xiaojian, et al.
出版: (2011)

An approach for self-training audio event detectors using web data
由: ELIZALDE, Benjamin, et al.
出版: (2017)

An Artificial Immune System based Approach for English Grammar Correction
由: KUMAR, Akshat, et al.
出版: (2007)

Collective multiagent sequential decision making under uncertainty
由: NGUYEN, Duc Thien, et al.
出版: (2017)

ChatGPT's impact
由: Lim, Donald Patrick L.
出版: (2023)

Authentic and insightful use of generative AI
由: Aure, Patrick Adriel H.
出版: (2023)

Vaccinating against the AI chatbot hype
由: Teehankee, Benito L.
出版: (2024)

Approximate Inference in Collective Graphical Models
由: SHELDON, Daniel, et al.
出版: (2013)

Message-Passing Algorithms for Quadratic Programming Formulations of MAP Estimation
由: KUMAR, Akshat, et al.
出版: (2011)