Grasper: A generalist pursuer for pursuit-evasion problems

Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in Policy-Space Response Oracles (PSRO) to improve...

Full description

Saved in:

Bibliographic Details
Main Authors:	LI, Pengdeng, LI, Shuxin, WANG, Xinrun, CERNY, Jakub, ZHANG, Youzhi, McAleer, Stephen, CHAN, Hau, AN, Bo
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2024
Subjects:	Generalizability Hypernetwork Multi-Agent Learning Pre-training and Fine-tuning Pursuit-Evasion Problems Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing
Online Access:	https://ink.library.smu.edu.sg/sis_research/9130 https://ink.library.smu.edu.sg/context/sis_research/article/10133/viewcontent/p1147_pvoa_cc_by.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-10133
record_format	dspace
spelling	sg-smu-ink.sis_research-101332024-08-01T09:29:54Z Grasper: A generalist pursuer for pursuit-evasion problems LI, Pengdeng LI, Shuxin WANG, Xinrun CERNY, Jakub ZHANG, Youzhi McAleer, Stephen CHAN, Hau AN, Bo Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in Policy-Space Response Oracles (PSRO) to improve scalability in solving large-scale PEGs. However, these methods primarily focus on specific PEGs with fixed initial conditions that may vary substantially in real-world scenarios, which significantly hinders the applicability of the traditional methods. To address this issue, we introduce Grasper, a GeneRAlist purSuer for Pursuit-Evasion pRoblems, capable of efficiently generating pursuer policies tailored to specific PEGs. Our contributions are threefold: First, we present a novel architecture that offers high-quality solutions for diverse PEGs, comprising critical components such as (i) a graph neural network (GNN) to encode PEGs into hidden vectors, and (ii) a hypernetwork to generate pursuer policies based on these hidden vectors. As a second contribution, we develop an efficient three-stage training method involving (i) a pre-pretraining stage for learning robust PEG representations through self-supervised graph learning techniques like graph masked auto-encoder (Graph-MAE), (ii) a pre-training stage utilizing heuristic-guided multi-task pre-training (HMP) where heuristic-derived reference policies (e.g., through Dijkstra’s algorithm) regularize pursuer policies, and (iii) a fine-tuning stage that employs PSRO to generate pursuer policies on designated PEGs. Finally, we perform extensive experiments on synthetic and real-world maps, showcasing Grasper’s significant superiority over baselines in terms of solution quality and generalizability. We demonstrate that Grasper provides a versatile approach for solving pursuit-evasion problems across a broad range of scenarios, enabling practical deployment in real-world situations. 2024-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9130 https://ink.library.smu.edu.sg/context/sis_research/article/10133/viewcontent/p1147_pvoa_cc_by.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Generalizability Hypernetwork Multi-Agent Learning Pre-training and Fine-tuning Pursuit-Evasion Problems Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Generalizability Hypernetwork Multi-Agent Learning Pre-training and Fine-tuning Pursuit-Evasion Problems Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing
spellingShingle	Generalizability Hypernetwork Multi-Agent Learning Pre-training and Fine-tuning Pursuit-Evasion Problems Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing LI, Pengdeng LI, Shuxin WANG, Xinrun CERNY, Jakub ZHANG, Youzhi McAleer, Stephen CHAN, Hau AN, Bo Grasper: A generalist pursuer for pursuit-evasion problems
description	Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in Policy-Space Response Oracles (PSRO) to improve scalability in solving large-scale PEGs. However, these methods primarily focus on specific PEGs with fixed initial conditions that may vary substantially in real-world scenarios, which significantly hinders the applicability of the traditional methods. To address this issue, we introduce Grasper, a GeneRAlist purSuer for Pursuit-Evasion pRoblems, capable of efficiently generating pursuer policies tailored to specific PEGs. Our contributions are threefold: First, we present a novel architecture that offers high-quality solutions for diverse PEGs, comprising critical components such as (i) a graph neural network (GNN) to encode PEGs into hidden vectors, and (ii) a hypernetwork to generate pursuer policies based on these hidden vectors. As a second contribution, we develop an efficient three-stage training method involving (i) a pre-pretraining stage for learning robust PEG representations through self-supervised graph learning techniques like graph masked auto-encoder (Graph-MAE), (ii) a pre-training stage utilizing heuristic-guided multi-task pre-training (HMP) where heuristic-derived reference policies (e.g., through Dijkstra’s algorithm) regularize pursuer policies, and (iii) a fine-tuning stage that employs PSRO to generate pursuer policies on designated PEGs. Finally, we perform extensive experiments on synthetic and real-world maps, showcasing Grasper’s significant superiority over baselines in terms of solution quality and generalizability. We demonstrate that Grasper provides a versatile approach for solving pursuit-evasion problems across a broad range of scenarios, enabling practical deployment in real-world situations.
format	text
author	LI, Pengdeng LI, Shuxin WANG, Xinrun CERNY, Jakub ZHANG, Youzhi McAleer, Stephen CHAN, Hau AN, Bo
author_facet	LI, Pengdeng LI, Shuxin WANG, Xinrun CERNY, Jakub ZHANG, Youzhi McAleer, Stephen CHAN, Hau AN, Bo
author_sort	LI, Pengdeng
title	Grasper: A generalist pursuer for pursuit-evasion problems
title_short	Grasper: A generalist pursuer for pursuit-evasion problems
title_full	Grasper: A generalist pursuer for pursuit-evasion problems
title_fullStr	Grasper: A generalist pursuer for pursuit-evasion problems
title_full_unstemmed	Grasper: A generalist pursuer for pursuit-evasion problems
title_sort	grasper: a generalist pursuer for pursuit-evasion problems
publisher	Institutional Knowledge at Singapore Management University
publishDate	2024
url	https://ink.library.smu.edu.sg/sis_research/9130 https://ink.library.smu.edu.sg/context/sis_research/article/10133/viewcontent/p1147_pvoa_cc_by.pdf
_version_	1814047750583484416

Grasper: A generalist pursuer for pursuit-evasion problems

Similar Items