Reinforcement learning for zone based multiagent pathfinding under uncertainty

We address the problem of multiple agents finding their paths from respective sources to destination nodes in a graph (also called MAPF). Most existing approaches assume that all agents move at fixed speed, and that a single node accommodates only a single agent. Motivated by the emerging applicatio...

Full description

Saved in:

Bibliographic Details
Main Authors:	LING, Jiajing, GUPTA, Tarun, KUMAR, Akshat
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2020
Subjects:	Autonomous agents Functions Multi agent systems Reinforcement learning Scheduling Traffic congestion Travel time Artificial Intelligence and Robotics Theory and Algorithms
Online Access:	https://ink.library.smu.edu.sg/sis_research/5963 https://ink.library.smu.edu.sg/context/sis_research/article/6966/viewcontent/Reinforcement_Learning_for_Zone_Based_Multiagent_Pathfinding_under_Uncertainty.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-6966
record_format	dspace
spelling	sg-smu-ink.sis_research-69662021-05-25T06:27:01Z Reinforcement learning for zone based multiagent pathfinding under uncertainty LING, Jiajing GUPTA, Tarun KUMAR, Akshat We address the problem of multiple agents finding their paths from respective sources to destination nodes in a graph (also called MAPF). Most existing approaches assume that all agents move at fixed speed, and that a single node accommodates only a single agent. Motivated by the emerging applications of autonomous vehicles such as drone traffic management, we present zone-based path finding (or ZBPF) where agents move among zones, and agents' movements require uncertain travel time. Furthermore, each zone can accommodate multiple agents (as per its capacity). We also develop a simulator for ZBPF which provides a clean interface from the simulation environment to learning algorithms. We develop a novel formulation of the ZBPF problem using difference-of-convex functions (DC) programming. The resulting approach can be used for policy learning using samples from the simulator. We also present a multiagent credit assignment scheme that helps our learning approach converge faster. Empirical results in a number of 2D and 3D instances show that our approach can effectively minimize congestion in zones, while ensuring agents reach their final destinations. 2020-10-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5963 https://ink.library.smu.edu.sg/context/sis_research/article/6966/viewcontent/Reinforcement_Learning_for_Zone_Based_Multiagent_Pathfinding_under_Uncertainty.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Autonomous agents Functions Multi agent systems Reinforcement learning Scheduling Traffic congestion Travel time Artificial Intelligence and Robotics Theory and Algorithms
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Autonomous agents Functions Multi agent systems Reinforcement learning Scheduling Traffic congestion Travel time Artificial Intelligence and Robotics Theory and Algorithms
spellingShingle	Autonomous agents Functions Multi agent systems Reinforcement learning Scheduling Traffic congestion Travel time Artificial Intelligence and Robotics Theory and Algorithms LING, Jiajing GUPTA, Tarun KUMAR, Akshat Reinforcement learning for zone based multiagent pathfinding under uncertainty
description	We address the problem of multiple agents finding their paths from respective sources to destination nodes in a graph (also called MAPF). Most existing approaches assume that all agents move at fixed speed, and that a single node accommodates only a single agent. Motivated by the emerging applications of autonomous vehicles such as drone traffic management, we present zone-based path finding (or ZBPF) where agents move among zones, and agents' movements require uncertain travel time. Furthermore, each zone can accommodate multiple agents (as per its capacity). We also develop a simulator for ZBPF which provides a clean interface from the simulation environment to learning algorithms. We develop a novel formulation of the ZBPF problem using difference-of-convex functions (DC) programming. The resulting approach can be used for policy learning using samples from the simulator. We also present a multiagent credit assignment scheme that helps our learning approach converge faster. Empirical results in a number of 2D and 3D instances show that our approach can effectively minimize congestion in zones, while ensuring agents reach their final destinations.
format	text
author	LING, Jiajing GUPTA, Tarun KUMAR, Akshat
author_facet	LING, Jiajing GUPTA, Tarun KUMAR, Akshat
author_sort	LING, Jiajing
title	Reinforcement learning for zone based multiagent pathfinding under uncertainty
title_short	Reinforcement learning for zone based multiagent pathfinding under uncertainty
title_full	Reinforcement learning for zone based multiagent pathfinding under uncertainty
title_fullStr	Reinforcement learning for zone based multiagent pathfinding under uncertainty
title_full_unstemmed	Reinforcement learning for zone based multiagent pathfinding under uncertainty
title_sort	reinforcement learning for zone based multiagent pathfinding under uncertainty
publisher	Institutional Knowledge at Singapore Management University
publishDate	2020
url	https://ink.library.smu.edu.sg/sis_research/5963 https://ink.library.smu.edu.sg/context/sis_research/article/6966/viewcontent/Reinforcement_Learning_for_Zone_Based_Multiagent_Pathfinding_under_Uncertainty.pdf
_version_	1770575706861862912

Reinforcement learning for zone based multiagent pathfinding under uncertainty

Similar Items