Solving online threat screening games using constrained action space reinforcement learning
Large-scale screening for potential threats with limited resources and capacity for screening is a problem of interest at airports, seaports, and other ports of entry. Adversaries can observe screening procedures and arrive at a time when there will be gaps in screening due to limited resource capac...
Saved in:
Main Authors: | , , , , |
---|---|
Format: | text |
Language: | English |
Published: |
Institutional Knowledge at Singapore Management University
2020
|
Subjects: | |
Online Access: | https://ink.library.smu.edu.sg/sis_research/5077 https://ink.library.smu.edu.sg/context/sis_research/article/6080/viewcontent/AAAI_ShahS.10043.pdf |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Institution: | Singapore Management University |
Language: | English |
id |
sg-smu-ink.sis_research-6080 |
---|---|
record_format |
dspace |
spelling |
sg-smu-ink.sis_research-60802021-06-10T09:04:25Z Solving online threat screening games using constrained action space reinforcement learning SHAH, Sanket SINHA, Arunesh VARAKANTHAM, Pradeep PERRAULT, Andrew TAMBE, Millind Large-scale screening for potential threats with limited resources and capacity for screening is a problem of interest at airports, seaports, and other ports of entry. Adversaries can observe screening procedures and arrive at a time when there will be gaps in screening due to limited resource capacities. To capture this game between ports and adversaries, this problem has been previously represented as a Stackelberg game, referred to as a Threat Screening Game (TSG). Given the significant complexity associated with solving TSGs and uncertainty in arrivals of customers, existing work has assumed that screenees arrive and are allocated security resources at the beginning of the time-window. In practice, screenees such as airport passengers arrive in bursts correlated with flight time and are not bound by fixed timewindows. To address this, we propose an online threat screening model in which the screening strategy is determined adaptively as a passenger arrives while satisfying a hard bound on acceptable risk of not screening a threat. To solve the online problem, we first reformulate it as a Markov Decision Process (MDP) in which the hard bound on risk translates to a constraint on the action space and then solve the resultant MDP using Deep Reinforcement Learning (DRL). To this end, we provide a novel way to efficiently enforce linear inequality constraints on the action output in DRL. We show that our solution allows us to significantly reduce screenee wait time without compromising on the risk. 2020-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5077 info:doi/10.1609/aaai.v34i02.5599 https://ink.library.smu.edu.sg/context/sis_research/article/6080/viewcontent/AAAI_ShahS.10043.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Airport passenger Linear inequality constraints Markov Decision Processes Potential threats Resource capacity Screening procedures Screening strategy Stackelberg Games Artificial Intelligence and Robotics |
institution |
Singapore Management University |
building |
SMU Libraries |
continent |
Asia |
country |
Singapore Singapore |
content_provider |
SMU Libraries |
collection |
InK@SMU |
language |
English |
topic |
Airport passenger Linear inequality constraints Markov Decision Processes Potential threats Resource capacity Screening procedures Screening strategy Stackelberg Games Artificial Intelligence and Robotics |
spellingShingle |
Airport passenger Linear inequality constraints Markov Decision Processes Potential threats Resource capacity Screening procedures Screening strategy Stackelberg Games Artificial Intelligence and Robotics SHAH, Sanket SINHA, Arunesh VARAKANTHAM, Pradeep PERRAULT, Andrew TAMBE, Millind Solving online threat screening games using constrained action space reinforcement learning |
description |
Large-scale screening for potential threats with limited resources and capacity for screening is a problem of interest at airports, seaports, and other ports of entry. Adversaries can observe screening procedures and arrive at a time when there will be gaps in screening due to limited resource capacities. To capture this game between ports and adversaries, this problem has been previously represented as a Stackelberg game, referred to as a Threat Screening Game (TSG). Given the significant complexity associated with solving TSGs and uncertainty in arrivals of customers, existing work has assumed that screenees arrive and are allocated security resources at the beginning of the time-window. In practice, screenees such as airport passengers arrive in bursts correlated with flight time and are not bound by fixed timewindows. To address this, we propose an online threat screening model in which the screening strategy is determined adaptively as a passenger arrives while satisfying a hard bound on acceptable risk of not screening a threat. To solve the online problem, we first reformulate it as a Markov Decision Process (MDP) in which the hard bound on risk translates to a constraint on the action space and then solve the resultant MDP using Deep Reinforcement Learning (DRL). To this end, we provide a novel way to efficiently enforce linear inequality constraints on the action output in DRL. We show that our solution allows us to significantly reduce screenee wait time without compromising on the risk. |
format |
text |
author |
SHAH, Sanket SINHA, Arunesh VARAKANTHAM, Pradeep PERRAULT, Andrew TAMBE, Millind |
author_facet |
SHAH, Sanket SINHA, Arunesh VARAKANTHAM, Pradeep PERRAULT, Andrew TAMBE, Millind |
author_sort |
SHAH, Sanket |
title |
Solving online threat screening games using constrained action space reinforcement learning |
title_short |
Solving online threat screening games using constrained action space reinforcement learning |
title_full |
Solving online threat screening games using constrained action space reinforcement learning |
title_fullStr |
Solving online threat screening games using constrained action space reinforcement learning |
title_full_unstemmed |
Solving online threat screening games using constrained action space reinforcement learning |
title_sort |
solving online threat screening games using constrained action space reinforcement learning |
publisher |
Institutional Knowledge at Singapore Management University |
publishDate |
2020 |
url |
https://ink.library.smu.edu.sg/sis_research/5077 https://ink.library.smu.edu.sg/context/sis_research/article/6080/viewcontent/AAAI_ShahS.10043.pdf |
_version_ |
1770575211316379648 |