Solving online threat screening games using constrained action space reinforcement learning

Large-scale screening for potential threats with limited resources and capacity for screening is a problem of interest at airports, seaports, and other ports of entry. Adversaries can observe screening procedures and arrive at a time when there will be gaps in screening due to limited resource capac...

Full description

Saved in:

Bibliographic Details
Main Authors:	SHAH, Sanket, SINHA, Arunesh, VARAKANTHAM, Pradeep, PERRAULT, Andrew, TAMBE, Millind
Format:	text
Language:	English
Published:	Institutional Knowledge at Singapore Management University 2020
Subjects:	Airport passenger Linear inequality constraints Markov Decision Processes Potential threats Resource capacity Screening procedures Screening strategy Stackelberg Games Artificial Intelligence and Robotics
Online Access:	https://ink.library.smu.edu.sg/sis_research/5077 https://ink.library.smu.edu.sg/context/sis_research/article/6080/viewcontent/AAAI_ShahS.10043.pdf
Tags:	Add Tag No Tags, Be the first to tag this record!
Institution:	Singapore Management University
Language:	English

id	sg-smu-ink.sis_research-6080
record_format	dspace
spelling	sg-smu-ink.sis_research-60802021-06-10T09:04:25Z Solving online threat screening games using constrained action space reinforcement learning SHAH, Sanket SINHA, Arunesh VARAKANTHAM, Pradeep PERRAULT, Andrew TAMBE, Millind Large-scale screening for potential threats with limited resources and capacity for screening is a problem of interest at airports, seaports, and other ports of entry. Adversaries can observe screening procedures and arrive at a time when there will be gaps in screening due to limited resource capacities. To capture this game between ports and adversaries, this problem has been previously represented as a Stackelberg game, referred to as a Threat Screening Game (TSG). Given the significant complexity associated with solving TSGs and uncertainty in arrivals of customers, existing work has assumed that screenees arrive and are allocated security resources at the beginning of the time-window. In practice, screenees such as airport passengers arrive in bursts correlated with flight time and are not bound by fixed timewindows. To address this, we propose an online threat screening model in which the screening strategy is determined adaptively as a passenger arrives while satisfying a hard bound on acceptable risk of not screening a threat. To solve the online problem, we first reformulate it as a Markov Decision Process (MDP) in which the hard bound on risk translates to a constraint on the action space and then solve the resultant MDP using Deep Reinforcement Learning (DRL). To this end, we provide a novel way to efficiently enforce linear inequality constraints on the action output in DRL. We show that our solution allows us to significantly reduce screenee wait time without compromising on the risk. 2020-02-01T08:00:00Z text application/pdf https://ink.library.smu.edu.sg/sis_research/5077 info:doi/10.1609/aaai.v34i02.5599 https://ink.library.smu.edu.sg/context/sis_research/article/6080/viewcontent/AAAI_ShahS.10043.pdf http://creativecommons.org/licenses/by-nc-nd/4.0/ Research Collection School Of Computing and Information Systems eng Institutional Knowledge at Singapore Management University Airport passenger Linear inequality constraints Markov Decision Processes Potential threats Resource capacity Screening procedures Screening strategy Stackelberg Games Artificial Intelligence and Robotics
institution	Singapore Management University
building	SMU Libraries
continent	Asia
country	Singapore Singapore
content_provider	SMU Libraries
collection	InK@SMU
language	English
topic	Airport passenger Linear inequality constraints Markov Decision Processes Potential threats Resource capacity Screening procedures Screening strategy Stackelberg Games Artificial Intelligence and Robotics
spellingShingle	Airport passenger Linear inequality constraints Markov Decision Processes Potential threats Resource capacity Screening procedures Screening strategy Stackelberg Games Artificial Intelligence and Robotics SHAH, Sanket SINHA, Arunesh VARAKANTHAM, Pradeep PERRAULT, Andrew TAMBE, Millind Solving online threat screening games using constrained action space reinforcement learning
description	Large-scale screening for potential threats with limited resources and capacity for screening is a problem of interest at airports, seaports, and other ports of entry. Adversaries can observe screening procedures and arrive at a time when there will be gaps in screening due to limited resource capacities. To capture this game between ports and adversaries, this problem has been previously represented as a Stackelberg game, referred to as a Threat Screening Game (TSG). Given the significant complexity associated with solving TSGs and uncertainty in arrivals of customers, existing work has assumed that screenees arrive and are allocated security resources at the beginning of the time-window. In practice, screenees such as airport passengers arrive in bursts correlated with flight time and are not bound by fixed timewindows. To address this, we propose an online threat screening model in which the screening strategy is determined adaptively as a passenger arrives while satisfying a hard bound on acceptable risk of not screening a threat. To solve the online problem, we first reformulate it as a Markov Decision Process (MDP) in which the hard bound on risk translates to a constraint on the action space and then solve the resultant MDP using Deep Reinforcement Learning (DRL). To this end, we provide a novel way to efficiently enforce linear inequality constraints on the action output in DRL. We show that our solution allows us to significantly reduce screenee wait time without compromising on the risk.
format	text
author	SHAH, Sanket SINHA, Arunesh VARAKANTHAM, Pradeep PERRAULT, Andrew TAMBE, Millind
author_facet	SHAH, Sanket SINHA, Arunesh VARAKANTHAM, Pradeep PERRAULT, Andrew TAMBE, Millind
author_sort	SHAH, Sanket
title	Solving online threat screening games using constrained action space reinforcement learning
title_short	Solving online threat screening games using constrained action space reinforcement learning
title_full	Solving online threat screening games using constrained action space reinforcement learning
title_fullStr	Solving online threat screening games using constrained action space reinforcement learning
title_full_unstemmed	Solving online threat screening games using constrained action space reinforcement learning
title_sort	solving online threat screening games using constrained action space reinforcement learning
publisher	Institutional Knowledge at Singapore Management University
publishDate	2020
url	https://ink.library.smu.edu.sg/sis_research/5077 https://ink.library.smu.edu.sg/context/sis_research/article/6080/viewcontent/AAAI_ShahS.10043.pdf
_version_	1770575211316379648

Solving online threat screening games using constrained action space reinforcement learning

Similar Items