Grasper: A generalist pursuer for pursuit-evasion problems
Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in Policy-Space Response Oracles (PSRO) to improve...
Saved in:
Main Authors: | , , , , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2024
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/9130 https://ink.library.smu.edu.sg/context/sis_research/article/10133/viewcontent/p1147_pvoa_cc_by.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-10133 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-101332024-08-01T09:29:54Z Grasper: A generalist pursuer for pursuit-evasion problems LI, Pengdeng LI, Shuxin WANG, Xinrun CERNY, Jakub ZHANG, Youzhi McAleer, Stephen CHAN, Hau AN, Bo Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in Policy-Space Response Oracles (PSRO) to improve scalability in solving large-scale PEGs. However, these methods primarily focus on specific PEGs with fixed initial conditions that may vary substantially in real-world scenarios, which significantly hinders the applicability of the traditional methods. To address this issue, we introduce Grasper, a GeneRAlist purSuer for Pursuit-Evasion pRoblems, capable of efficiently generating pursuer policies tailored to specific PEGs. Our contributions are threefold: First, we present a novel architecture that offers high-quality solutions for diverse PEGs, comprising critical components such as (i) a graph neural network (GNN) to encode PEGs into hidden vectors, and (ii) a hypernetwork to generate pursuer policies based on these hidden vectors. As a second contribution, we develop an efficient three-stage training method involving (i) a pre-pretraining stage for learning robust PEG representations through self-supervised graph learning techniques like graph masked auto-encoder (Graph-MAE), (ii) a pre-training stage utilizing heuristic-guided multi-task pre-training (HMP) where heuristic-derived reference policies (e.g., through Dijkstra’s algorithm) regularize pursuer policies, and (iii) a fine-tuning stage that employs PSRO to generate pursuer policies on designated PEGs. Finally, we perform extensive experiments on synthetic and real-world maps, showcasing Grasper’s significant superiority over baselines in terms of solution quality and generalizability. We demonstrate that Grasper provides a versatile approach for solving pursuit-evasion problems across a broad range of scenarios, enabling practical deployment in real-world situations. 2024-05-01T07:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/9130 https://ink.library.smu.edu.sg/context/sis_research/article/10133/viewcontent/p1147_pvoa_cc_by.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Generalizability Hypernetwork Multi-Agent Learning Pre-training and Fine-tuning Pursuit-Evasion Problems Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Generalizability Hypernetwork Multi-Agent Learning Pre-training and Fine-tuning Pursuit-Evasion Problems Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing |
spellingShingle |
Generalizability Hypernetwork Multi-Agent Learning Pre-training and Fine-tuning Pursuit-Evasion Problems Artificial Intelligence and Robotics Numerical Analysis and Scientific Computing LI, Pengdeng LI, Shuxin WANG, Xinrun CERNY, Jakub ZHANG, Youzhi McAleer, Stephen CHAN, Hau AN, Bo Grasper: A generalist pursuer for pursuit-evasion problems |
description |
Pursuit-evasion games (PEGs) model interactions between a team of pursuers and an evader in graph-based environments such as urban street networks. Recent advancements have demonstrated the effectiveness of the pre-training and fine-tuning paradigm in Policy-Space Response Oracles (PSRO) to improve scalability in solving large-scale PEGs. However, these methods primarily focus on specific PEGs with fixed initial conditions that may vary substantially in real-world scenarios, which significantly hinders the applicability of the traditional methods. To address this issue, we introduce Grasper, a GeneRAlist purSuer for Pursuit-Evasion pRoblems, capable of efficiently generating pursuer policies tailored to specific PEGs. Our contributions are threefold: First, we present a novel architecture that offers high-quality solutions for diverse PEGs, comprising critical components such as (i) a graph neural network (GNN) to encode PEGs into hidden vectors, and (ii) a hypernetwork to generate pursuer policies based on these hidden vectors. As a second contribution, we develop an efficient three-stage training method involving (i) a pre-pretraining stage for learning robust PEG representations through self-supervised graph learning techniques like graph masked auto-encoder (Graph-MAE), (ii) a pre-training stage utilizing heuristic-guided multi-task pre-training (HMP) where heuristic-derived reference policies (e.g., through Dijkstra’s algorithm) regularize pursuer policies, and (iii) a fine-tuning stage that employs PSRO to generate pursuer policies on designated PEGs. Finally, we perform extensive experiments on synthetic and real-world maps, showcasing Grasper’s significant superiority over baselines in terms of solution quality and generalizability. We demonstrate that Grasper provides a versatile approach for solving pursuit-evasion problems across a broad range of scenarios, enabling practical deployment in real-world situations. |
format |
text |
author |
LI, Pengdeng LI, Shuxin WANG, Xinrun CERNY, Jakub ZHANG, Youzhi McAleer, Stephen CHAN, Hau AN, Bo |
author_facet |
LI, Pengdeng LI, Shuxin WANG, Xinrun CERNY, Jakub ZHANG, Youzhi McAleer, Stephen CHAN, Hau AN, Bo |
author_sort |
LI, Pengdeng |
title |
Grasper: A generalist pursuer for pursuit-evasion problems |
title_short |
Grasper: A generalist pursuer for pursuit-evasion problems |
title_full |
Grasper: A generalist pursuer for pursuit-evasion problems |
title_fullStr |
Grasper: A generalist pursuer for pursuit-evasion problems |
title_full_unstemmed |
Grasper: A generalist pursuer for pursuit-evasion problems |
title_sort |
grasper: a generalist pursuer for pursuit-evasion problems |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2024 |
url |
https://ink.library.smu.edu.sg/sis_research/9130 https://ink.library.smu.edu.sg/context/sis_research/article/10133/viewcontent/p1147_pvoa_cc_by.pdf |
_version_ |
1814047750583484416 |