Solving online threat screening games using constrained action space reinforcement learning

Large-scale screening for potential threats with limited resources and capacity for screening is a problem of interest at airports, seaports, and other ports of entry. Adversaries can observe screening procedures and arrive at a time when there will be gaps in screening due to limited resource capac...

Full description

Saved in:
Bibliographic Details
Main Authors: SHAH, Sanket, SINHA, Arunesh, VARAKANTHAM, Pradeep, PERRAULT, Andrew, TAMBE, Millind
Format: text
Language:English
Published: Institutional Knowledge at Singapore Management University 2020
Subjects:
Online Access:https://ink.library.smu.edu.sg/sis_research/5077
https://ink.library.smu.edu.sg/context/sis_research/article/6080/viewcontent/AAAI_ShahS.10043.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
Institution: Singapore Management University
Language: English
id sg-smu-ink.sis_research-6080
record_format dspace
spelling sg-smu-ink.sis_research-60802021-06-10T09:04:25Z Solving online threat screening games using constrained action space reinforcement learning SHAH, Sanket SINHA, Arunesh VARAKANTHAM, Pradeep PERRAULT, Andrew TAMBE, Millind Large-scale screening for potential threats with limited resources and capacity for screening is a problem of interest at airports, seaports, and other ports of entry. Adversaries can observe screening procedures and arrive at a time when there will be gaps in screening due to limited resource capacities. To capture this game between ports and adversaries, this problem has been previously represented as a Stackelberg game, referred to as a Threat Screening Game (TSG). Given the significant complexity associated with solving TSGs and uncertainty in arrivals of customers, existing work has assumed that screenees arrive and are allocated security resources at the beginning of the time-window. In practice, screenees such as airport passengers arrive in bursts correlated with flight time and are not bound by fixed timewindows. To address this, we propose an online threat screening model in which the screening strategy is determined adaptively as a passenger arrives while satisfying a hard bound on acceptable risk of not screening a threat. To solve the online problem, we first reformulate it as a Markov Decision Process (MDP) in which the hard bound on risk translates to a constraint on the action space and then solve the resultant MDP using Deep Reinforcement Learning (DRL). To this end, we provide a novel way to efficiently enforce linear inequality constraints on the action output in DRL. We show that our solution allows us to significantly reduce screenee wait time without compromising on the risk. 2020-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5077 info:doi/10.1609/aaai.v34i02.5599 https://ink.library.smu.edu.sg/context/sis_research/article/6080/viewcontent/AAAI_ShahS.10043.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Airport passenger Linear inequality constraints Markov Decision Processes Potential threats Resource capacity Screening procedures Screening strategy Stackelberg Games Artificial Intelligence and Robotics
institution Singapore Management University
building SMU Libraries
continent Asia
country Singapore
Singapore
content_provider SMU Libraries
collection InK@SMU
language English
topic Airport passenger
Linear inequality constraints
Markov Decision Processes
Potential threats
Resource capacity
Screening procedures
Screening strategy
Stackelberg Games
Artificial Intelligence and Robotics
spellingShingle Airport passenger
Linear inequality constraints
Markov Decision Processes
Potential threats
Resource capacity
Screening procedures
Screening strategy
Stackelberg Games
Artificial Intelligence and Robotics
SHAH, Sanket
SINHA, Arunesh
VARAKANTHAM, Pradeep
PERRAULT, Andrew
TAMBE, Millind
Solving online threat screening games using constrained action space reinforcement learning
description Large-scale screening for potential threats with limited resources and capacity for screening is a problem of interest at airports, seaports, and other ports of entry. Adversaries can observe screening procedures and arrive at a time when there will be gaps in screening due to limited resource capacities. To capture this game between ports and adversaries, this problem has been previously represented as a Stackelberg game, referred to as a Threat Screening Game (TSG). Given the significant complexity associated with solving TSGs and uncertainty in arrivals of customers, existing work has assumed that screenees arrive and are allocated security resources at the beginning of the time-window. In practice, screenees such as airport passengers arrive in bursts correlated with flight time and are not bound by fixed timewindows. To address this, we propose an online threat screening model in which the screening strategy is determined adaptively as a passenger arrives while satisfying a hard bound on acceptable risk of not screening a threat. To solve the online problem, we first reformulate it as a Markov Decision Process (MDP) in which the hard bound on risk translates to a constraint on the action space and then solve the resultant MDP using Deep Reinforcement Learning (DRL). To this end, we provide a novel way to efficiently enforce linear inequality constraints on the action output in DRL. We show that our solution allows us to significantly reduce screenee wait time without compromising on the risk.
format text
author SHAH, Sanket
SINHA, Arunesh
VARAKANTHAM, Pradeep
PERRAULT, Andrew
TAMBE, Millind
author_facet SHAH, Sanket
SINHA, Arunesh
VARAKANTHAM, Pradeep
PERRAULT, Andrew
TAMBE, Millind
author_sort SHAH, Sanket
title Solving online threat screening games using constrained action space reinforcement learning
title_short Solving online threat screening games using constrained action space reinforcement learning
title_full Solving online threat screening games using constrained action space reinforcement learning
title_fullStr Solving online threat screening games using constrained action space reinforcement learning
title_full_unstemmed Solving online threat screening games using constrained action space reinforcement learning
title_sort solving online threat screening games using constrained action space reinforcement learning
publisher Institutional Knowledge at Singapore Management University
publishDate 2020
url https://ink.library.smu.edu.sg/sis_research/5077
https://ink.library.smu.edu.sg/context/sis_research/article/6080/viewcontent/AAAI_ShahS.10043.pdf
_version_ 1770575211316379648