Learning to send reinforcements: Coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning

We address the problem of coordinating multiple agents in a dynamic police patrol scheduling via a Reinforcement Learning (RL) approach. Our approach utilizes Multi-Agent Value Function Approximation (MAVFA) with a rescheduling heuristic to learn dispatching and rescheduling policies jointly. Often,...

Full description

Saved in:

Bibliographic Details
Main Authors:	JOE, Waldy, LAU, Hoong Chuin
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2023
Subjects:	Agent-based and Multi-agent Systems Multi-agent learning Planning and Scheduling Learning in planning and scheduling police agents Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering Theory and Algorithms
Online Access:	https://ink.library.smu.edu.sg/sis_research/8103 https://ink.library.smu.edu.sg/context/sis_research/article/9106/viewcontent/SendReinforcements_pvoa.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-9106
record_format	dspace
spelling	sg-smu-ink.sis_research-91062023-09-07T07:15:22Z Learning to send reinforcements: Coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning JOE, Waldy LAU, Hoong Chuin We address the problem of coordinating multiple agents in a dynamic police patrol scheduling via a Reinforcement Learning (RL) approach. Our approach utilizes Multi-Agent Value Function Approximation (MAVFA) with a rescheduling heuristic to learn dispatching and rescheduling policies jointly. Often, police operations are divided into multiple sectors for more effective and efficient operations. In a dynamic setting, incidents occur throughout the day across different sectors, disrupting initially-planned patrol schedules. To maximize policing effectiveness, police agents from different sectors cooperate by sending reinforcements to support one another in their incident response and even routine patrol. This poses an interesting research challenge on how to make such complex decision of dispatching and rescheduling involving multiple agents in a coordinated fashion within an operationally reasonable time. Unlike existing Multi-Agent RL (MARL) approaches which solve similar problems by either decomposing the problem or action into multiple components, our approach learns the dispatching and rescheduling policies jointly without any decomposition step. In addition, instead of directly searching over the joint action space, we incorporate an iterative best response procedure as a decentralized optimization heuristic and an explicit coordination mechanism for a scalable and coordinated decision-making. We evaluate our approach against the commonly adopted two-stage approach and conduct a series of ablation studies to ascertain the effectiveness of our proposed learning and coordination mechanisms. 2023-08-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/8103 info:doi/10.24963/ijcai.2023/18 https://ink.library.smu.edu.sg/context/sis_research/article/9106/viewcontent/SendReinforcements_pvoa.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Agent-based and Multi-agent Systems Multi-agent learning Planning and Scheduling Learning in planning and scheduling police agents Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering Theory and Algorithms
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Agent-based and Multi-agent Systems Multi-agent learning Planning and Scheduling Learning in planning and scheduling police agents Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering Theory and Algorithms
spellingShingle	Agent-based and Multi-agent Systems Multi-agent learning Planning and Scheduling Learning in planning and scheduling police agents Artificial Intelligence and Robotics Operations Research, Systems Engineering and Industrial Engineering Theory and Algorithms JOE, Waldy LAU, Hoong Chuin Learning to send reinforcements: Coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning
description	We address the problem of coordinating multiple agents in a dynamic police patrol scheduling via a Reinforcement Learning (RL) approach. Our approach utilizes Multi-Agent Value Function Approximation (MAVFA) with a rescheduling heuristic to learn dispatching and rescheduling policies jointly. Often, police operations are divided into multiple sectors for more effective and efficient operations. In a dynamic setting, incidents occur throughout the day across different sectors, disrupting initially-planned patrol schedules. To maximize policing effectiveness, police agents from different sectors cooperate by sending reinforcements to support one another in their incident response and even routine patrol. This poses an interesting research challenge on how to make such complex decision of dispatching and rescheduling involving multiple agents in a coordinated fashion within an operationally reasonable time. Unlike existing Multi-Agent RL (MARL) approaches which solve similar problems by either decomposing the problem or action into multiple components, our approach learns the dispatching and rescheduling policies jointly without any decomposition step. In addition, instead of directly searching over the joint action space, we incorporate an iterative best response procedure as a decentralized optimization heuristic and an explicit coordination mechanism for a scalable and coordinated decision-making. We evaluate our approach against the commonly adopted two-stage approach and conduct a series of ablation studies to ascertain the effectiveness of our proposed learning and coordination mechanisms.
format	text
author	JOE, Waldy LAU, Hoong Chuin
author_facet	JOE, Waldy LAU, Hoong Chuin
author_sort	JOE, Waldy
title	Learning to send reinforcements: Coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning
title_short	Learning to send reinforcements: Coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning
title_full	Learning to send reinforcements: Coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning
title_fullStr	Learning to send reinforcements: Coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning
title_full_unstemmed	Learning to send reinforcements: Coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning
title_sort	learning to send reinforcements: coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning
publisher	Institutional Knowledge at Singapore Management University
publishDate	2023
url	https://ink.library.smu.edu.sg/sis_research/8103 https://ink.library.smu.edu.sg/context/sis_research/article/9106/viewcontent/SendReinforcements_pvoa.pdf
_version_	1779157155159474176

Learning to send reinforcements: Coordinating multi-agent dynamic police patrol dispatching and rescheduling via reinforcement learning

Similar Items