Grasper: A generalist pursuer for pursuit-evasion problems

Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in Policy-Space Response Oracles (PSRO) to improve...

Full description

Saved in:
Bibliographic Details
Main Authors: LI, Pengdeng, LI, Shuxin, WANG, Xinrun, CERNY, Jakub, ZHANG, Youzhi, McAleer, Stephen, CHAN, Hau, AN, Bo
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2024
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/9130
https://ink.library.smu.edu.sg/context/sis_research/article/10133/viewcontent/p1147_pvoa_cc_by.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-10133
record_format dspace
spelling sg-smu-ink.sis_research-101332024-08-01T09:29:54Z Grasper: A generalist pursuer for pursuit-evasion problems LI, Pengdeng LI, Shuxin WANG, Xinrun CERNY, Jakub ZHANG, Youzhi McAleer, Stephen CHAN, Hau AN, Bo Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in Policy-Space Response Oracles (PSRO) to improve scalability in solving large-scale PEGs. However, these methods primarily focus on specific PEGs with fixed initial conditions that may vary substantially in real-world scenarios, which significantly hinders the applicability of the traditional methods. To address this issue, we introduce Grasper, a GeneRAlist purSuer for Pursuit-Evasion pRoblems, capable of efficiently generating pursuer policies tailored to specific PEGs. Our contributions are threefold: First, we present a novel architecture that offers high-quality solutions for diverse PEGs, comprising critical components such as (i) a graph neural network (GNN) to encode PEGs into hidden vectors, and (ii) a hypernetwork to generate pursuer policies based on these hidden vectors. As a second contribution, we develop an efficient three-stage training method involving (i) a pre-pretraining stage for learning robust PEG representations through self-supervised graph learning techniques like graph masked auto-encoder (Graph-MAE), (ii) a pre-training stage utilizing heuristic-guided multi-task pre-training (HMP) where heuristic-derived reference policies (e.g., through Dijkstra’s algorithm) regularize pursuer policies, and (iii) a fine-tuning stage that employs PSRO to generate pursuer policies on designated PEGs. Finally, we perform extensive experiments on synthetic and real-world maps, showcasing Grasper’s significant superiority over baselines in terms of solution quality and generalizability. We demonstrate that Grasper provides a versatile approach for solving pursuit-evasion problems across a broad range of scenarios, enabling practical deployment in real-world situations. 2024-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9130 https://ink.library.smu.edu.sg/context/sis_research/article/10133/viewcontent/p1147_pvoa_cc_by.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Generalizability Hypernetwork Multi-Agent Learning Pre-training and Fine-tuning Pursuit-Evasion Problems Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Generalizability
Hypernetwork
Multi-Agent Learning
Pre-training and Fine-tuning
Pursuit-Evasion Problems
Artificial Intelligence and Robotics
Numerical Analysis and Scientific Computing
spellingShingle Generalizability
Hypernetwork
Multi-Agent Learning
Pre-training and Fine-tuning
Pursuit-Evasion Problems
Artificial Intelligence and Robotics
Numerical Analysis and Scientific Computing
LI, Pengdeng
LI, Shuxin
WANG, Xinrun
CERNY, Jakub
ZHANG, Youzhi
McAleer, Stephen
CHAN, Hau
AN, Bo
Grasper: A generalist pursuer for pursuit-evasion problems
description Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in Policy-Space Response Oracles (PSRO) to improve scalability in solving large-scale PEGs. However, these methods primarily focus on specific PEGs with fixed initial conditions that may vary substantially in real-world scenarios, which significantly hinders the applicability of the traditional methods. To address this issue, we introduce Grasper, a GeneRAlist purSuer for Pursuit-Evasion pRoblems, capable of efficiently generating pursuer policies tailored to specific PEGs. Our contributions are threefold: First, we present a novel architecture that offers high-quality solutions for diverse PEGs, comprising critical components such as (i) a graph neural network (GNN) to encode PEGs into hidden vectors, and (ii) a hypernetwork to generate pursuer policies based on these hidden vectors. As a second contribution, we develop an efficient three-stage training method involving (i) a pre-pretraining stage for learning robust PEG representations through self-supervised graph learning techniques like graph masked auto-encoder (Graph-MAE), (ii) a pre-training stage utilizing heuristic-guided multi-task pre-training (HMP) where heuristic-derived reference policies (e.g., through Dijkstra’s algorithm) regularize pursuer policies, and (iii) a fine-tuning stage that employs PSRO to generate pursuer policies on designated PEGs. Finally, we perform extensive experiments on synthetic and real-world maps, showcasing Grasper’s significant superiority over baselines in terms of solution quality and generalizability. We demonstrate that Grasper provides a versatile approach for solving pursuit-evasion problems across a broad range of scenarios, enabling practical deployment in real-world situations.
format text
author LI, Pengdeng
LI, Shuxin
WANG, Xinrun
CERNY, Jakub
ZHANG, Youzhi
McAleer, Stephen
CHAN, Hau
AN, Bo
author_facet LI, Pengdeng
LI, Shuxin
WANG, Xinrun
CERNY, Jakub
ZHANG, Youzhi
McAleer, Stephen
CHAN, Hau
AN, Bo
author_sort LI, Pengdeng
title Grasper: A generalist pursuer for pursuit-evasion problems
title_short Grasper: A generalist pursuer for pursuit-evasion problems
title_full Grasper: A generalist pursuer for pursuit-evasion problems
title_fullStr Grasper: A generalist pursuer for pursuit-evasion problems
title_full_unstemmed Grasper: A generalist pursuer for pursuit-evasion problems
title_sort grasper: a generalist pursuer for pursuit-evasion problems
publisher Institutional Knowledge at Singapore Management University
publishDate 2024
url https://ink.library.smu.edu.sg/sis_research/9130
https://ink.library.smu.edu.sg/context/sis_research/article/10133/viewcontent/p1147_pvoa_cc_by.pdf
_version_ 1814047750583484416