Policy gradient with value function approximation for collective multiagent planning

Policy gradient with value function approximation for collective multiagent planning

Decentralized (PO)MDPs provide an expressive framework for sequential decision making in a multiagent system. Given their computational complexity, recent research has focused on tractable yet practical subclasses of Dec-POMDPs. We address such a subclass called CDec-POMDP where the collective behav...

Full description

Saved in:

Bibliographic Details
Main Authors:	NGUYEN, Duc Thien, KUMAR, Akshat, LAU, Hoong Chuin
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2017
Subjects:	Collective behavior Environment dynamics Multi-agent planning Optimization problems Reinforcement learning method Sequential decision making Synthetic benchmark Value function approximation Artificial Intelligence and Robotics Computer Sciences Operations Research, Systems Engineering and Industrial Engineering
Online Access:	https://ink.library.smu.edu.sg/sis_research/3871 https://ink.library.smu.edu.sg/context/sis_research/article/4873/viewcontent/7019_policy_gradient_with_value_function_approximation_for_collective_multiagent_planning.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

Similar Items

Credit assignment for collective multiagent RL with global rewards
by: NGUYEN, Duc Thien, et al.
Published: (2018)

Collective multiagent sequential decision making under uncertainty
by: NGUYEN, Duc Thien, et al.
Published: (2017)

Explaining Sequences of Actions in Multi-agent Deep Reinforcement Learning Models
by: KHAING, Phyo Wai, et al.
Published: (2024)

Benchmarking MARL on Long Horizon Sequential Multi-Objective Tasks
by: GENG, Minghong, et al.
Published: (2024)

Hierarchical multiagent reinforcement learning for maritime traffic management
by: SINGH, Arambam James, et al.
Published: (2020)

Constrained multiagent reinforcement learning for large agent population
by: LING, Jiajing, et al.
Published: (2022)

Constrained multiagent reinforcement learning for large agent population
by: LING, Jiajing, et al.
Published: (2023)

Societal Risk Evaluation Scheme (SRES): Scenario-Based Multi-Criteria Evaluation of Synthetic Biology Applications
by: Cummings, Christopher, et al.
Published: (2017)

Reinforcement learning for zone based multiagent pathfinding under uncertainty
by: LING, Jiajing, et al.
Published: (2020)

Reinforcement learning for sequential decision making with constraints
by: LING, Jiajing
Published: (2023)

Approximate inference using DC programming for collective graphical models
by: NGUYEN, Duc Thien, et al.
Published: (2016)

Reinforcement learning for collective multi-agent decision making
by: NGUYEN, Duc Thien
Published: (2018)

Human–computer negotiation in a three player market setting
by: Haim, Galit, et al.
Published: (2018)

Learning and exploiting shaped reward models for large scale multiagent RL
by: SINGH, Arambam James, et al.
Published: (2021)

Graphical modeling of asymmetric games and value of information in multi-agent decision systems
by: WANG XIAOYING
Published: (2010)

A quantum multi-agent evolutionary algorithm for selection of partners in a virtual enterprise
by: Tao, F., et al.
Published: (2014)

Learning and exploiting shaped reward models for large scale multiagent RL
by: ARAMBAM JAMES SINGH,, et al.
Published: (2021)

Learning and exploiting shaped reward models for large scale multiagent RL
by: SINGH, Arambam James, et al.
Published: (2021)

Mimicking to dominate: Imitation learning strategies for success in multiagent competitive games
by: BUI, The Viet, et al.
Published: (2024)

Approximate Bayesian Computation for Smoothing
by: Martin, J.S., et al.
Published: (2016)

Probabilistic modeling and reasoning in multiagent decision systems
by: ZENG YIFENG
Published: (2010)

Allocating resources in multiagent flowshops with adaptive auctions
by: Lau, H.C., et al.
Published: (2014)

Scaling up Cooperative Multi-agent Reinforcement Learning Systems
by: GENG, Minghong
Published: (2024)

Simultaneous perturbation stochastic approximation based neural networks for online learning
by: Choy, M.C., et al.
Published: (2014)

A Bayesian multiagent trust model for social networks
by: Sardana, Noel, et al.
Published: (2020)

An adaptive sequential Monte Carlo method for approximate Bayesian computation
by: Del Moral, P., et al.
Published: (2016)

Curiosity-driven testing for sequential decision-making process
by: HE, Junda, et al.
Published: (2024)

Graphical models for interactive POMDPs: Representations and solutions
by: Doshi, P., et al.
Published: (2013)

Allocating Resources in Multiagent Flowshops with Adaptive Auctions
by: LAU, Hoong Chuin, et al.
Published: (2011)

Toward intelligent multizone thermal control with multiagent deep reinforcement learning
by: Li, Jie, et al.
Published: (2021)

PRICING DECISIONS BY FREIGHT FORWARDERS
by: QIN HAN
Published: (2016)

Gradient Free Parameter Estimation for Hidden Markov Models with Intractable Likelihoods
by: Ehrlich, E., et al.
Published: (2016)

Distributed relational temporal difference learning
by: Lau, Q.P., et al.
Published: (2014)

ESSAYS ON PHYSICIAN DECISION MAKING AND HEALTHCARE PROVIDER EFFICIENCY
by: YE HAN
Published: (2020)

Systematic corrections to the Thomas-Fermi approximation without a gradient expansion
by: Chau, T.T, et al.
Published: (2020)

Exponential approximation for maintained Weibull distributed component
by: Xie, M., et al.
Published: (2014)

Rapid prototyping and manufacturing benchmarking
by: MANI MAHESH
Published: (2010)

Agent-relative consequentialism and collective self-defeat
by: HAMMERTON, Matthew
Published: (2020)

Designing recommendation agent
by: TAN CHUAN HOO
Published: (2011)

Multi-criterion models for higher education administration
by: Goh, M., et al.
Published: (2011)